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THAP PROTEINS AS NUCLEAR RECEPTORS FOR CHEMOKINES 

AND ROLES IN TRANSCRIPTIONAL REGULATION, * 
CELL PROLIFERATION AND CELL DIFFERENTIATION 



FIELD OF THE INVENTION 

The present invention relates to genes and proteins of the THAP (THanatos (death)- 
Associated Protein) family, and uses thereof. In particular, the invention relates to the role of 
THAP-type chemokine-binding agents, such as THAP-family polypeptides, in transcriptional 
1 0 regulation and other chemokine-mediated cellular activities. 

BACKGROUND 

Coordination of cell proliferation and cell death is required for normal development and 
tissue homeostasis in multicellular organisms. A defect in the normal coordination of these two 
processes is a fundamental requirement for tumorigenesis. 

Progression through the cell cycle is highly regulated, requiring the transit of numerous 
checkpoints (for review, see Hunter, 1993). The extent of cell death is physiologically controlled by 
activation of a programmed suicide pathway that results in morphologically recognizable form of 
death termed apoptosis (Jacobson et al, 1997; Vaux et al., 1994). Both extra-cellular signals, such as 
tumor necrosis factor, and intracellular signals, like p53, can induce apoptotic cell death. Although 
many proteins involved in apoptosis or the cell cycle have been identified, the mechanisms by 
which these two processes are coordinated are not well understood. 

It is well established that molecules which modulate apoptosis have the potential to treat a 
wide range of conditions relating to cell death and cell proliferation. For example, such molecules 
may be used for inducing cell death for the treatment of cancers, inhibiting cell death for the 
treatment of neurodegenerative disorders, and inhibiting or inducing cell death for regulating 
angiogenesis. However, because many biological pathways controlling cell cycle and apoptosis 
have not yet been fully elucidated, there is a need for the identification of biological targets for the 
development of therapeutic molecules for the treatment of these disorders. 
PML nuclear bodies 

PML nuclear bodies (PML-NBs), also known as PODs (PML oncogenic domains), ND10 
(nuclear domain 10) and Kr bodies, are discrete subnuclear domains that are specifically disrupted 
in cells from acute promyelocyte leukemia (APL), a distinct subtype of human myeloid leukemia 
(Maul et al., 2000 ; Ruggero et al., 2000 ; Zhong et al., 2000a). Their name derives from their most 
intensively studied protein component, the promyelocytic leukemia protein (PML), a RING finger 
IFN-inducible protein encoded by a gene originally cloned as the t(15 ;17) chromosomal 
translocation partner of the retinoic acid receptor (RAR) locus in APL. In APL cells, the presence 
of the leukemogenic fusion protein, PML-RAR, leads to the disruption of PML-NBs and the 
delocalization of PML and other PML-NB proteins into aberrant nuclear structures (Zhong et al., 
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2000a). Treatment of both APL cell lines and patients with retinoic acid, which induces the 
degradation of the PML-RAR oncoprotein, results in relocalization of PML and other NBs 
components into PML-NBs and complete remission of clinical disease, respectively. The 
deregulation of the PML-NBs by PML-RAR thus appears to play a critical role in tumorigenesis. 
5 The analysis of mice, where the PML gene was disrupted by homologous recombination, has 
revealed that PML functions as a tumor suppressor in vivo (Wang et ah, 1998a), that is essential for 
multiple apoptotic pathways (Wang et al., 1998b). Pml -/- mice and cells are protected from Fas, 
TNFot, ceramide and IFN-induced apoptosis as well as from DNA damage-induced apoptosis. 
However, the molecular mechanisms through which PML modulates the response to pro-apoptotic 

10 stimuli are not well understood (Wang et al., 1998b ; Quignon et a!., 1998). Recent studies indicate 
that PML can participate in both p53-dependent and p53-independent apoptosis pathways (Guo et 
al., 2000 ; Fogal et al., 2000). p53-dependent DNA-damage induced apoptosis, transcriptional 
activation by p53 and induction of p53 target genes are all impaired in PML -/- primary cells (Guo 
et al., 2000). PML physically interacts with p53 and acts as a transcriptional co-activator for p53. 

15 This co-activatory role of PML is absolutely dependent on its ability to recruit p53 in the PML-NBs 
(Guo et al., 2000; Fogal et al., 2000). The existence of a cross-talk between PML- and p53- 
dependent growth suppression pathways implies an important role for PML-NBs and PML-NBs- 
associated proteins as modulators of p53 functions. In addition to p53, the pro-apoptotic factor 
Daxx could be another important mediator of PML pro-apoptotic activities (Ishov et al., 1999; 

20 Zhong et al., 2000b; Li et al., 2000). Daxx was initially identified by its ability to enhance Fas- 
induced cell death. Daxx interacts with PML and localizes preferentially in the nucleus where it 
accumulates in the PML-NBs (Ishov et al., 1999; Zhong et al., 2000b; Li et al., 2000). Inactivation 
of PML results in derealization of Daxx from PML-NBs and complete abrogation of Daxx pro- 
apoptotic activity (Zhong et al., 2000b). Daxx has recently been found to possess strong 

25 transcriptional repressor activity (Li et al., 2000). By recruiting Daxx to the PML-NBs, PML may 
inhibit Daxx-mediated transcriptional repression, thus allowing the expression of certain pro- 
apoptotic genes. 

PML-NBs contain several other proteins in addition to Daxx and p53. These include the 
autoantigens SplOO (Sternsdorf et al., 1999) and SplOO-related protein Spl40 (Bloch et al., 1999), 

30 the retinoblastoma tumor suppressor pRB (Alcalay et al., 1998), the transcriptional co-activator 
CBP (LaMorte et al., 1998), the Bloom syndrome DNA helicase BLM (Zhong et al., 1999) and the 
small ubiquitin-like modifier SUMO-1 (also known as sentrin-1 or PIC1; for recent reviews see 
Yeh et al., 2000; Melchior, 2000; Jentsch and Pyrowolakis, 2000). Covalent modification of PML 
by SUMO-1 (sumoylation) appears to play a critical role in PML accumulation into NBs (Muller et 

35 al., 1998) and the recruitment of other NBs components to PML-NBs (Ishov et al., 1999; Zhong et 
al., 2000c). 

Prostate apoptosis response-4 
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Prostate apoptosis response-4 (PAR4) is a 38 kDa protein initially identified as the product 
of a gene specifically upregulated in prostate tumor cells undergoing apoptosis (for reviews see 
Rangnekar, 1998 ; Mattson et al., 1999). Consistent with an important role of PAR4 in apoptosis, 
induction of PAR4 in cultured cells is found exclusively during apoptosis and ectopic expression of 
PAR4 in NIH-3T3 cells (Diaz-Meco et al.. 1996), neurons (Guo et al., 1998), prostate cancer and 
melanoma cells (Sells et al., 1997) has been shown to sensitize these cells to apoptotic stimuli. In 
addition, down regulation of PAR4 is critical for ras-induced survival and tumor progression 
(Barradas et al., 1999) and suppression of PAR4 production by antisense technology prevents 
apoptosis in several systems (Sells et al., 1997; Guo et al., 1998), includ.ng different models of 
neurodegenerative disorders (Mattson et al., 1999), further emphasizing the cntical role of PAR4 in 
apoptosis. At the carboxy terminus, PAR4 contains both a leucine zipper domain (Par4LZ, amino 
acids 290-332), and a partially overlapping death domain (Par4DD, amino acids 258-332). 
Deletion of this carboxy-terminal part abrogates the pro-apoptotic function of PAR4 (Diaz-Meco et 
al., 1996 ; Sells et al., 1997 ; Guo et al., 1998). On the other hand, overexpression of PAR4 leucine 
zipper/death domain acts in a donunant negative manner to prevent apoptosis mduced by full-length 
PAR4 (Sells et al., 1997 ; Guo et al., 1998). The PAR4 leucine zipper/death domain mediates 
PAR4 interaction with other proteins by recognizing two different kinds of motifs : zinc fingers of 
the Wilms tumor suppressor protein WT1 (Johnstone et al., 1996) and the atypical isoforms of 
protein kinase C (Diaz-Meco et al., 1996), and an arginine-rich domain from the death-associated- 
protein (DAP)-hke kinase Dlk (Page et al., 1999). Among these interactions, the binding of PAR4 
to aPKCs ajid the resulting mhibition of their enzymatic activity is of particular functional relevance 
because the aPKCs are known to play a key role in cell survival and their overexpression has been 
shown to abrogate the ability of PAR4 to induce apoptosis (Diaz-Meco et al., 1996 ; Berra et al 
1997). 

25 CHEMOKINES 

Chemokines (chemoattractant cytokines) are small secreted polypeptides of about 70-1 10 amino 
acds that regulate trafficking and effector functions of leukocytes, and play an important role in 
inflammation and host defense against pathogens (reviewed in Baggiolini M., et al. (1997) Annu 
Rev. inmmunol. 15: 675-705; Proost P., et al. (1996) Int. J. Clin. Lab. Rse. 26: 211-223; Premack 
et al. (1996) Nature Med.cine 2: 1174-1 178; Yoshie, et al. (1997) J. Leukocyte Biol. 62: 634-644)! 
Over 45 different human chemokines have been described to date. They vary in their specificities 
for different leukocyte types (neutrophils, monocytes, eosinophils, basophils, lymphocytes 
dendritic cells, etc.), and in the types of cells and tissues where the chemokines are synthesized 
Chemokines are typically produced at sites of tissue injury or stress, where they promote the 
infiltration of leukocytes into tissues and facilitate an inflammatory response. Some chemokines act 
selectively on immune system cells such as subsets of T-cells or B lymphocytes or antigen 
presenting cells, and may thereby promote immune responses to antigens. Some chemokines also 
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have the ability to regulate the growth or migration of hematopoietic progenitor and stem cells that 

normally differentiate into specific leukocyte types, thereby regulating leukocyte numbers in the 
blood. 

The activities of chemokines are mediated by cell surface receptors which are members of 
5 the family of seven transmembrane, G-protein coupled receptors. At present, over fifteen different 
human chemoldne receptors are known, including CCR1, CCR2, CCR3, CCR4, CCR5, CCR6, 
CCR7, CCR8, CCR9, CCR10, CCR11, CXCR1, CXCR2, CXCR3, CXCR4 and CXCR5. These 
receptors vary in their specificites for specific chemokines. Some receptors bind to a single known 
chemokine, while others bind to multiple chemokines. Binding of a chemokine to its receptor 
10 typically induces intracellular signaling responses such as a transient rise in cytosolic calcium 
concentration, followed by cellular biological responses such as chemotaxis. 

Chemokine SLC/CCL21 (also known as SLC ? CK0-9, 6Ckine, and exodus-2) is a member 
of the CC (beta)-chemokine subfamily, which shows 21 - 33% identity to other CC chemokines 
(Nagira, et al. (1997) J. Biol. Chem. 272:19518-19524; Hromas, et al. (1997) J. Immunol. 
15 159:2554-2558; Hedrick, et al. (1997) J. Immunol. 159:1589-1593). SLC/CCL21 contains the four 
conserved cysteines characteristic of beta chemokines plus two additional cysteines in its unusually 
long carboxyl-terminal domain. Human SLC/CCL21 cDNA encodes a 134 amino acid residue, 
highly basic, precursor protein with a 23 amino acid residue signal peptide that is cleaved to form 
the predicted 1 1 1 amino acid residues mature protein. Mouse SLC/CCL21 cDNA encodes a 133 
20 amino acid residue protein with 23 residue signal peptide that is cleaved to generate the 110 
residue mature protein. Human and mouse SLC/CCL2 1 is highly conserved, exhibiting 86% amino 
acid sequence identity. The gene for human SLC/CCL21 has been localized at human chromosome 
9pl3 rather than chromosome 17, where the genes of many human CC chemokines are clustered. 
The SLC/CCL21 gene location is within a region of about 100 kb as the gene for MIP-3 
25 beta/ELC/CCL19, another recently identified CC chemokine. SLC/CCL21 was previously known 
to be highly expressed in lymphoid tissues at the mRNA level, and to be a chemoattractant for T 
and B lymphocytes (Nagira, et al. (1997) J. Biol. Chem. 272:19518-19524; Hromas, et al. (1997) J. 
Immunol. 159:2554-2558; Hedrick, et al. (1997) J. Immunol. 159:1589-1593; Gunn, et al. (1998) 
Proc. Natl. Acad. Sci. 95:258-263). SLC/CCL2 1 also induces both adhesion of lymphocytes to 
30 intercellular adhesion molecule-1 and arrest of rolling cells (Campbell, et al. (1998) Science 
279:381-384). All of the above properties are consistent with a role for SLC/CCL2 1 in regulating 
trafficking of lymphocytes through lymphoid tissues. Unlike most CC chemokines, SLC/CCL21 is 
not chemotactic for monocytes. However, it has been reported to inhibit hemopoietic progenitor 
colony formation in a dose-dependent manner (Hromas et al. (1997) J. Immunol. 159: 2554-58). 
35 Chemokine SLC/CCL2 1 is a Iigand for chemokine receptor CCR7 (Rossi et al. (1997) J. 

Immunol. 158:1033; Yoshida et al. (1997) J. Biol. Chem. 272:13803; Yoshida et al. (1998) J. Biol. 
Chem. 273:71 18; Campbell et al. (1998) J Cell Biol 141:1053). CCR7 is expressed on T cells and 
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dendritic cells (DC), consistent with the chemotactic action of SLC/CCL2 1 for both lymphocytes 
and mature DC. Both memory (CD45RO + ) and naive (CD45RA + ) CD4 + and CD8 + T cells express 
the CCR7 receptor (Sallusto et al. (1999) Nature 401:708). W.thin the memory T cell population, 
CCR7 expression discriminates between T cells with effector function that can migrate to inflamed 
tissues (CCR7-) vs. T cells that require a secondary stimulus prior to displaying effector functions 
(CCR7 + ) (Sallusto et al. (1999) Nature 401:708). Unlike mature DC, .mmature DC do not express 
CCR7 nor do they respond to the chemotactic action of CCL21 (Sallusto et al. (1998) Eur. J. 
Immunol. 28:2760; Dieu et al. (1998) J. Exp. Med. 188:373). 

A key function of CCR7 and ,ts two ligands SLC/CCL21 and MIP3b/CCL19 is facilitat.ng 
recruitment and retention of cells to secondary lymphoid organs in order to promote efficient 
antigen exposure to T cells. CCR7-deficient mice demonstrate poorly developed secondary organs 
and exhibit an irregular distribution of lymphocytes w.thin lymph nodes, Peyer's patches, and 
splenic periarteriolar lymphoid sheaths (Forster et al. (1999) Cell 99:23). These animals have 
severely impaired primary T cell responses largely due to the inabihty of interdicting DC to 
m,grate to the lymph nodes (Forster et al. (1999) Cell 99:23). The overall findings to date support 
the notion that CCR7 and its two ligands, CCL19 and CCL21, are key regulators of T cell responses 
v,a their control of T cell/DC interactions. CCR7 is an important regulatory molecule with an 
instructive role in determming the migration of cells to secondary lympho.d organs (Forster et al. 
(1999) Cell 99:23; Nakanoet al. (1998) Blood 91:2886). 

SUMMARY OF THE INVENTION 
THAP1 (THanatos-Associated-Protein-1) 

In the past few years, the inventors have focused on the molecular characterization of novel 
genes expressed in the specialized endothelial cells (HEVECs) of post-cap, llary high endothelial 
venules (Girard and Springer, 1995a; Girard and Springer, 1995b; Girard et al., 1999). In the 
present invention, they report the analysis of THAR 1 (for THanatos (death)-Associated Protein-1) a 
protein that localizes to PML-NBs. Two hybrid screening of an HEVEC cDNA library with the 
THAP1 bait lead to the identification of a unique .nteracting partner, the pro-apoptot.c protein 
PAR4. PAR4 ,s also found to accumulate into PML-NBs and targeting of the THAP1 / PAR4 
complex to PML-NLs is mediated by PML. Similarly to PAR4, THAP1 is a pro-apoptotic 
polypept.de. Its pro-apoptotic activity requires a novel protein motif in the amino-term.na. part 
called THAP doma.n. Together these results define a novel PML-NBs pathway for apoptosis that 
involves the THAP1/PAR4 pro-apoptotic complex. 

Embodiments of the present invention includes genes, proteins and biological pathways 
mvolved m apoptosis. In some embodiments, the genes, proteins, and pathways disclosed here.n 
may be used for the development of polypeptide, nucleic acid or small molecule therapeutics. 

One embodiment of the present invention provides a novel protein motif the THAP 
domain. The present inventors initially identified the THAP domain as a 90 residue protein motif in 
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the amino-terminaf part of THAP1 and which is essential for THAP1 pro-apoptotic activity. 

THAP1 (THanatos (death) Associated Protein-1), as determined by the present inventors, is a pro- 
apoptotic polypeptide which forms a complex with the pro-apoptotic protein PAR4 and localizes in 
discrete subnuclear domains known as PML nuclear bodies. However, the THAP domain also 
5 defines a novel family of proteins, the THAP family, and the inventors have also provided at least 
twelve distinct members in the human genome (THAP-0 to THAP 11), all of which contain a THAP 
domain (typically 80-90 amino acids) in their amino-terminal part. The present invention thus 
includes nucleic acid molecules, including in particular the complete cDNA sequences, encoding 
members of the THAP family, portions thereof encoding the THAP domain or polypeptides 
10 homologous thereto, as well as to polypeptides encoded by the THAP family genes. The invention 
thus also includes diagnostic and activity assays, and uses in therapeutics, for THAP family proteins 
or portions thereof, as well as drug screening assays for identifying compounds capable of 
inhibiting (or stimulating) pro-apoptotic activity of a THAP family member. 

In one example of a THAP family member, THAP1 is determined to be an apoptosis 
15 inducing polypeptide expressed in human endothelial cells (HEVECs), providing characterization 
of the THAP sequences required for apoptosis activity in the THAP1 polypeptide. In further 
aspects, the invention is also directed to the interaction of THAP 1 with the pro-apoptotic protein 
PAR4 and with PML-NBs, including methods of modulating THAP1 / PAR4 interactions for the 
treatment of disease. The invention also concerns interaction between PAR4 and PML-NBs, 
20 diagnostics for detection of said interaction (or localization) and modulation of said interactions for 
the treatment of disease. 

Compounds which modulate interactions between a THAP family member and a THAP- 
family target molecule, a THAP domain or THAP-domain target molecule, or a PAR4 and a PML- 
NBs proteinmay be used in inhibiting (or stimulating) apoptosis of different cell types in various 
25 human diseases. For example, such compounds may be used to inhibit or stimulate apoptosis of 
endothelial cells in angiogenesis-dependent diseases including but not limited to cancer, 
cardiovascular diseases, inflammatory diseases, and to inhibit apoptosis of neurons in acute and 
chronic neurodegenerative disorders, including but not limited to Alzheimer's, Parkinson's and 
Huntington's diseases, amyotrophic lateral sclerosis, HIV encephalitis, stroke, epileptic seizures). 
30 Oligonucleotide probes or primers hybridizing specifically with a THAP I genomic DNA or 

cDNA sequence are also part of the present invention, as well as DNA amplification and detection 
methods using said primers and probes. 

Fragments of THAP family members or THAP domains include fragments encoded by 
nucleic acids comprising at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 
35 500, or 1000 consecutive nucleotides selected from the group consisting of SEQ ID NOs: 160-175, 
or polypeptides comprising at least 8, 10, 12, 15, 20, 25, 30, 40, 50, 100, 150 or 200 consecutive 
amino acids selected from the group consisting of SEQ ID NOs: 1-114. 
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A further aspect of the invention includes recombinant vectors comprising any of the nucleic 
acid sequences described above, and in particular to recombinant vectors comprising a THAP1 
regulatory sequence or a sequence encoding a THAP1 protein, THAP family member, THAP 
domain, fragments of THAP family members and THAP domains, homologues of THAP family 
members/ THAP domains, as well as to cell hosts and transgenic non human animals comprising 
said nucleic acid sequences or recombinant vectors. 

Another aspect of the invention relates to methods for the screening of substances or 
molecules that inhibit or increase the expression of the THAP1 gene or genes encoding THAP 
family members, as well as with methods for the screening of substances or molecules that interact 
with and/or inhibit or increase the activity of a THAP1 polypeptide or THAP family polypeptide. 

In accordance with another aspect, the present invention provides a medicament comprising 
an effective amount of a THAP family protein, e. g. THAP1, or a SLC/CCL2 1 -binding fragment 
thereof, together with a pharmaceutical^ acceptable carrier. The medicaments described herein 
may be useful for treatment and/or prophylaxis. 

As related to another aspect, the invention is concerned in particular with the use of a THAP 
family protein, homologs thereof and fragments thereof, for example THAP1, or a SLC/CCL21- 
binding fragment thereof as an anti-inflammatory agent. The THAP family protein, for example, 
THAP1 and fragments thereof will be useful for the treatment of conditions mediated by 
SLC/CCL2 1 . 

In a further aspect, the present invention provides a detection method comprising the steps of 
providing a SLC/CCL21 chemokine-binding molecule which is a THAP family protein, for 
example, THAP1, or an SLC/CCL21 -binding fragment thereof, contacting the SLC/CCL21 -binding 
THAP1 molecule with a sample, and detecting an interaction of the SLC/CCL21 -binding THAP1 
molecule with SLC/CCL2 1 chemokine in the sample. 

In one example, the invention may be used to detect the presence of SLC/CCL21 chemokine 
in a biological sample. The SLC/CCL21 -binding THAP I molecule may be usefully immobilized 
on a solid support, for example as a THAPl/Fc fusion. 

In accordance with another aspect, the present invention provides a method for inhibiting the 
activity of SLC/CCL2 1 chemokine in a sample, which method comprises contacting the sample 
with an effective amount of a SLC/CCL21 chemokine-binding molecule which is a THAP1 protein 
or a SLC/CCL21 -binding fragment thereof. 

In further aspects the invention provides a purified THAP1 protein or a SLC/CCL21 -binding 
fragment thereof, or a THAPl/Fc fusion, for use in a method or a medicament as described herein; 
and a kit comprising such a purified THAP1 protein or fragment. 

Some embodiments of the invention also envisage the use of fragments of the THAP1 
protein, which fragments have SLC/CCL21 chemokine-binding properties. The fragments may be 
peptides derived from the protein. Use of such peptides can be preferable to the use of an entire 
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protein or a substantial part of a protein, for example because of the reduced immunogenicity of a 
peptide compared to a protein. Such peptides may be prepared by a variety of techniques including 
recombinant DNA techniques and synthetic chemical methods. 

In addition to the above properties, THAP1 has the capability to bind to several additional 
5 chemokines. Such chemokines include, but are not limited to, ELC/CCL19, RANTES CCL5, 
MIG/CXCL9 and 1P10/CXCL10. As such, further aspects of the present invention relate to the 
binding of chemokines by THAP1, a chemokine binding domain of THAP1, and polypeptides 
having at least 30% amino acid identity to THAP1 or a chemokine-binding domain of THAP1. 
Also contemplated is the binding of chemokines to oligomers and Fc immunoglobulin fusions of 

1 0 the above-listed polypeptides. 

According to some aspects of the present invention, a THAP1 polypeptide, a chemokine- 
binding domain of THAP1, polypeptides having at least 30% amino acid identity to THAP1 or a 
chemokine-binding domain of THAP1 as well as oligomers or Fc immunoglobulin fusions of these 
proteins can be used in pharmaceutical compositions and/or medicaments for reducing the 

15 symptoms associated with inflammation and/or inflammatory diseases. As such, some aspects of 
the present invention include pharmaceutical compositions and/or medicaments comprising THAP1 
protein, a chemokine-binding domain of THAP1, polypeptides having at least 30% amino acid 
identity to THAP1 or a chemokine-binding domain of THAP1 as well as oligomers or Fc 
immunoglobulin fusions of these proteins. 

20 Yet other aspects of the invention relate THAP-family polypeptides, chemokine binding 

domains pf THAP-family peptides, fusions of a THAP-family polypeptide with an immunoglobulin 
Fc region, fusions of a chemokine-binding domain of a THAP-family peptide with an 
immunoglobulin Fc region, oligomers of THAP family polypeptides, chemokine-binding domains 
of THAP family peptides, THAP-family peptide-Fc fusions, and chemokine-binding domain of 

25 THAP-family peptide-Fc fusions as well as polypeptides having at least 30% amino acid identity to 
any of the above-listed polypeptides. Pharmaceutical compositions which include one or more of 
these polypeptides are also contemplated. 

Aspects of the invention relate to methods of binding a chemokine, inhibiting the activity of a 
chemokine, reducing or ameliorating the symptoms of a condition mediated or influenced by one or 

30 more chemokines, preventing the symptoms of a condition mediated or influenced by one or more 
chemokines and detecting a chemokine by using chemokine-binding agents such as THAP-family 
polypeptides, chemokine binding domains of THAP-family peptides, fusions of a THAP-family 
polypeptide with an immunoglobulin Fc region, fusions of a chemokine-binding domain of a 
THAP-family peptide with an immunoglobulin Fc region, oligomers of THAP family polypeptides, 

35 chemokine-binding domains of THAP family peptides, THAP-family peptide-Fc fusions, and 
chemokine-binding domain of THAP-family peptide-Fc fusions as well as polypeptides having at 
least 30% amino acid identity to any of the above-listed polypeptides. 
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Still other aspects of the present invention relate to methods modulating chemokine 
interactions with cellular receptors. Such receptors can be extracellular or can be molecules that are 
present within the cell. In some embodiments, chemokine interaction with one or more cellular 
receptors is modulated with one or more chemokine-binding agents, such as THAP-family 
5 polypeptides, chemokine binding domains of THAP-family peptides, fusions of a THAP-family 
polypeptide with an immunoglobulin Fc region, fusions of a chemokine-binding domain of a 
THAP-family peptide with an immunoglobulin Fc region, oligomers of THAP family polypeptides, 
chemokine-binding domains of THAP family peptides, THAP-family peptide-Fc fusions, and 
chemokine-binding domain of THAP-family peptide-Fc fusions as well as polypeptides having at 
1 0 least 30% amino acid identity to any of the above-listed polypeptides. 

Some embodiments of the present invention relate to chemokines or chemokine complexes 
that are present within the nucleus of the cell and which modulate transcription. In some 
embodiments, complexes that are capable of modulating transcription comprise chemokines and 
chemokine-binding agents, such as THAP-family polypeptides, chemokine binding domains of 
THAP-family peptides, fusions of a THAP-family polypeptide with an immunoglobulin Fc region, 
fusions of a chemokine-binding domain of a THAP-family peptide with an immunoglobulin Fc 
region, oligomers of THAP family polypeptides, chemokine-binding domains of THAP family 
peptides, THAP-family peptide-Fc fusions, and chemokine-binding domain of THAP-family 
peptide-Fc fusions as well as polypeptides having at least 30% amino acid identity to any of the 
above-listed polypeptides. In some embodiments, the expression of one or more genes that are 
under the qpntrol of a THAP responsive promoter is modulated. 

It will also be evident that the THAP-family proteins for use in the invention may be 
prepared in a variety of ways, in particular as recombinant proteins in a variety of expression 
systems. Any standard systems may be used such as baculovirus expression systems or mammalian 
25 cell line expression systems. 

Other aspects of the invention are described in the following numbered paragraphs: 
1 • A method of identifying a candidate modulator of apoptosis comprising: 

(a) contacting a THAP-family polypeptide or a biologically active fragment thereof with a 
test compound, wherein said THAP-family polypeptide comprises at least 30% amino acid identity 
to an amino acid sequence selected from the group consisting of SEQ ID NOs: 1-114; and 

(b) determining whether said compound selectively modulates the activity of said 
polypeptide; 

wherein a determination that said test compound selectively modulates the activity of said 
polypeptide indicates that said compound is a candidate modulator of apoptosis. 
35 2 - ^ method of Paragraph 1, wherein the THAP-family polypeptide comprises the 

amino acid sequence of SEQ ID NO: 3, or a biologically active fragment thereof. 
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3. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence of SEQ ID NO: 4, or a biologically active fragment thereof. 

4. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence of SEQ ID NO: 5, or a biologically active fragment thereof. 

5 5. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 

amino acid sequence of SEQ ID NO: 6, or a biologically active fragment thereof 

6. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence of SEQ ID NO: 7, or a biologically active fragment thereof 

7. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
10 amino acid sequence of SEQ ID NO: 8, or a biologically active fragment thereof 

8. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence of SEQ ID NO: 9, or a biologically active fragment thereof. 

9. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence of SEQ ID NO: 10, or a biologically active fragment thereof. 

15 10. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 

amino acid sequence of SEQ ID NO: 1 1 , or a biologically active fragment thereof. 

11. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence of SEQ ID NO: 12, or a biologically active fragment thereof. 

12. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
20 amino acid sequence of SEQ ID NO: 13, or a biologically active fragment thereof. 

1J3. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence of SEQ ID NO: 14, or a biologically active fragment thereof. 

14. The method of Paragraph 1, wherein the THAP-family polypeptide comprises the 
amino acid sequence selected from the group consisting of SEQ ID NOs: 15-114, and biologically 

25 active fragments thereof. 

15. The method of Paragraph 1, wherein said biologically active fragment of said 
THAP-family protein has at least one biological activity selected from the group consisting of 
interaction with a THAP-family target protein, binding to a nucleic acid sequence, binding to PAR- 
4, binding to PML, binding to a polypeptide found in PML-NBs, localization to PML-NBs, 

30 targeting a THAP-family target protein to PML-NBs, and inducing apoptosis. 

16. The methods of any one of Paragraphs 2-15 wherein said THAP-family 
polypeptide has at least one biological activity selected from the group consisting of interaction 
with a THAP-family target protein, binding to a nucleic acid sequence, binding to PAR-4, binding 
to PML, binding to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a THAP- 

35 family target protein to PML-NBs, and inducing apoptosis. 

17. An isolated nucleic acid encoding a polypeptide having apoptotic activity, said 
polypeptide consisting essentially of an amino acid sequence selected from the group consisting of : 
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(a) amino acid positions 1-90 of SEQ ID NO: 2, a fragment thereof having 
apoptotic activity, or a polypeptide having at least 30% amino acid identity thereto; 

(b) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 89 of SEQ ID NO: 3, a fragment thereof having apoptotic 

5 activity, or a polypeptide having at least 30% amino acid identity thereto ; 

(c) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 89 of SEQ ID NO: 4, a fragment thereof having apoptotic 
activity, or a polypeptide having at least 30% amino acid identity thereto ; 

(d) a polypeptide comprising a THAP-family domain consisting essentially of 
10 amino acid positions 1 to 89 of SEQ ID NO: 5, a fragment thereof having apoptotic 

activity, or a polypeptide having at least 30% amino acid identity thereto ; 

(e) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 90 of SEQ ID NO: 6, a fragment thereof having apoptotic 
activity or a polypeptide having at least 30% amino acid identity thereto ; 

15 0) a Polypeptide comprising a THAP-family domain consisting essentially of 

amino acid positions 1 to 90 of SEQ ID NO: 7, a fragment thereof having apoptotic 
activity, or a polypeptide having at least 30% amino acid identity thereto ; 

(g) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 90 of SEQ ID NO: 8, a fragment thereof having apoptotic 

20 activity ; or a polypeptide having at least 30% amino acid identity thereto ; 

(h) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 90 of SEQ ID NO: 9, a fragment thereof having apoptotic 
activity, or a polypeptide having at least 30% amino acid identity thereto ; 

(i) a polypeptide comprising a THAP-family domain consisting essentially of 
25 amino acid positions 1 to 92 of SEQ ID NO: 10, a fragment thereof having apoptotic 

activity or a polypeptide having at least 30% amino acid identity thereto ; 

0) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 90 of SEQ ID NO: 11, a fragment thereof having apoptotic 
activity, or a polypeptide having at least 30% amino acid identity thereto ; 

30 (k) a Polypeptide comprising a THAP-family domain consisting essentially of 

amino acid positions 1 to 90 of SEQ ID NO: 12, or a fragment thereof having apoptotic 
activity, or a polypeptide having at least 30% amino acid identity thereto ; 

(1) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 90 of SEQ ID NO: 13, a fragment thereof having apoptotic 

35 activity, or a polypeptide having at least 30% amino acid identity thereto ; and 
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(m) a polypeptide comprising a THAP-family domain consisting essentially of 
amino acid positions 1 to 90 of SEQ ID NO: 14, a fragment thereof having apoptotic 
activity, or a polypeptide having at least 30% amino acid identity thereto. 

18. An isolated nucleic acid encoding a THAP-family polypeptide having apoptotic 
5 activity selected from the group consisting of: 

(i) a nucleic acid molecule encoding a polypeptide comprising the amino acid 
sequence of a sequence selected from the group consisting of SEQ ID NOs: 1-114; 

(ii) a nucleic acid molecule comprising the nucleic acid sequence of a sequence 
selected from the group consisting of SEQ ID NOs: 160-175 and the sequences complementary 

10 thereto; and 

(iii) a nucleic acid the sequence of which is degenerate as a result of the genetic 
code to the sequence of a nucleic acid as defined in (i) and (ii). 

19. The nucleic acid of Paragraph 18, wherein said nucleic acid comprises a nucleic 
acid selected from the group consisting of SEQ ED NOs. 5, 7, 8 and 11. 

15 20. The nucleic acid of Paragraph 18, wherein said nucleic acid comprises a nucleic 

acid selected from the group consisting of SEQ ID NOs. 162, 164, 165 and 168. 

21. An isolated nucleic acid encoding a THAP-family polypeptide having apoptotic 
activity comprising: 

(i) the nucleic acid sequence of SEQ ID NOs : 1 -2 or the sequence complementary thereto ; or 
20 (ii) a nucleic acid molecule encoding a polypeptide comprising the amino acid sequence of 

SEQIDlsJOs 1-2; 

22. An isolated nucleic acid, said nucleic acid comprising a nucleotide sequence 
encoding: 

i) a polypeptide comprising an amino acid sequence having at least about 80% identity to a 
25 sequence selected from the group consisting of the polypeptides of SEQ ID NOs: 1-114 and the 

polypeptides encoded by the nucleic acids of SEQ ID NOs: 1 60-1 75 or 

ii) a fragment of said polypeptide which possesses apoptotic activity. 

23. The nucleic acid of Paragraph of Paragraph 23, wherein said nucleic acid encodes a 
polypeptide comprising an amino acid sequence having at least about 80% identity to a sequence 

30 selected from the group consisting of the polypeptides of SEQ ID NOs: 5, 7, 8 and 1 1 and the 
polypeptides encoded by the nucleic acids of SEQ ED NOs: 162, 164, 165 and 168 or a fragment of 
said polypeptide which possesses apoptotic activity. 

24. The nucleic acid of Paragraph 23, wherein said polypeptide comprises an amino 
acid sequence selected from the group consisting of the sequences of SEQ ED NOs: 5, 7, 8 and 1 1 

35 and the polypeptides encoded by the nucleic acids of SEQ ID NOs: 162, 164, 165 and 168. 

25. The nucleic acid of Paragraph 23, wherein polypeptide identity is determined using 
an algorithm selected from the group consisting of XBLAST with the parameters score=50 and 
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wordlength=3, Gapped BLAST with the default parameters of XBLAST, and BLAST with the 
default parameters of XBLAST. 

26. The nucleic acid of Paragraph 17, wherein said nucleic acid is operabiy linked to a 
promoter. 

27. An expression cassette comprising the nucleic acid of Paragraph 26. 

28. A host cell comprising the expression cassette of Paragraph 27. 

29. A method of making a THAP-family polypeptide, said method comprising 
providing a population of host cells comprising a recombinant nucleic acid encoding said 

THAP-family protein of any one of SEQ ID NOs. 1-114; and 

culturing said population of host cells under conditions conducive to the expression of said 
recombinant nucleic acid; 

whereby said polypeptide is produced within said population of host cells. 

30. The method of Paragraph 29 wherein said providing step comprises providing a 
population of host cells comprising a recombinant nucleic acid encoding said THAP-family protein 
of any one of SEQ JD NOs. 5, 7, 8 and 1 1 . 

31. The method of Paragraph 29, further comprising purifying said polypeptide from 
said population of cells. 

32. An isolated THAP polypeptide encoded by the nucleic acid of any one of SEQ ID 
Nos. 160-175. 

33. The polypeptide of Paragraph 32, wherein said polypeptide is encoded by a nucleic 
acid selected from the group consisting of SEQ ID NOs. 5, 7, 8, 1 1, 162, 164, 165 and 168. 

34. The polypeptide of Paragraph 32, wherein said polypeptide has at least one activity 
selected from the group consisting of interaction with a THAP-family target protein, binding to a 
nucleic acid sequence, binding to PAR-4, binding to PML, binding to a polypeptide found in PML- 
NBs, localization to PML-NBs, targeting a THAP-family target protein to PML-NBs, and inducing 
apoptosis. 

35. An isolated THAP polypeptide or fragment thereof, said polypeptide comprising at 
least 12 contiguous amino acids of a sequence selected from the group consisting of SEQ ID NOs: 
1-114. 

36. The polypeptide of Paragraph 35, wherein said polypeptide comprises at least 12 
contiguous amino acids of a sequence selected from the group consisting of SEQ ID NOs. 5, 7, 8, 
and 11. 

37. The polypeptide of Paragraph 35, wherein said polypeptide has at least one activity 
selected from the group consisting of interaction with a THAP-family target protein, binding to a 
nucleic acid sequence, binding to PAR-4, binding to PML, binding to a polypeptide found in PML- 
NBs, localization to PML-NBs, targeting a THAP-family target protein to PML-NBs, and inducing 
apoptosis. 
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38. An isolated THAP polypeptide or fragment thereof, said polypeptide comprising an 
amino acid sequence having at least about 80% amino acid sequence identity to a sequence selected 
from the group consisting of SEQ ID NOs: 1-114 or a fragment thereof, said polypeptide or 
fragment thereof having at least one activity selected from the group consisting of interaction with a 

5 THAP-family target protein, binding to a nucleic acid sequence, binding to PAR-4, binding to 
PML, binding to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a THAP- 
family target protein to PML-NBs, and inducing apoptosis. 

39. The polypeptide of Paragraph 38, wherein said THAP polypeptide or fragment 
thereof comprises an amino acid sequence having at least about 80% amino acid sequence identity 

10 to a sequence selected from the group consisting of SEQ ID NOs: 5, 7, 8 and 1 1 or a fragment 
thereof having at least one activity selected from the group consisting of interaction with a THAP- 
family target protein, binding to a nucleic acid sequence, binding to PAR-4, binding to PML, 
binding to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a THAP-family 
target protein to PML-NBs, and inducing apoptosis. 
15 40. The polypeptide of Paragraph 38, wherein said polypeptide is selectively bound by 

an antibody raised against an antigenic polypeptide, or antigenic fragment thereof, said antigenic 
polypeptide comprising the polypeptide of any one of SEQ ID NOs: 1-114. 

41. The polypeptide of Paragraph 38, wherein said polypeptide is selectively bound by 
an antibody raised against an antigenic polypeptide, or antigenic fragment thereof, said antigenic 
20 polypeptide comprising the polypeptide of any one of SEQ ID NOs: 5, 7, 8 and 1 1 . 

4£. The polypeptide of Paragraph 38, wherein said polypeptide comprises the 
polypeptide of SEQ ID NOs: 1-1 14. 

43. The polypeptide of Paragraph 38, wherein said polypeptide comprises a 
polypeptide selected from the group consisting of SEQ ID NOs. 5, 7, 8 and 11. 
25 44. An antibody that selectively binds to the polypeptide of Paragraph 38. 

45. An antibody according to Paragraph 44, wherein said antibody is capable of 
inhibiting binding of said polypeptide to a THAP-family target polypeptide. 

46. An antibody according to Paragraph 44, wherein said antibody is capable of 
inhibiting apoptosis mediated by said polypeptide. 

30 47. The polyptide of Paragraph 38, wherein identity is determined using an algorithm 

selected from the group consisting of XBLAST with the parameters score=50 and word!ength=3, 
Gapped BLAST with the default parameters of XBLAST, and BLAST with the default parameters 
of XBLAST. 

48. A method of assessing the biological activity of a THAP-family polypeptide 
35 comprising: 

(a) providing a THAP-family polypeptide or a fragment thereof; and 

(b) assessing the ability of the THAP-family polypeptide to induce apoptosis of a cell. 
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49. A method of assessing the biological activity of a THAP-family polypeptide 
comprising: 

(a) providing a THAP-family polypeptide or a fragment thereof; and 

(b) assessing the DNA binding activity of the THAP-family polypeptide. 
5 50. The method of Paragraphs 48 or 49, where.n step (a) comprises introducing to a 

cell a recombinant vector comprising a nucleic acid encoding a THAP-fam,ly polypeptide. 

51. The method of Paragraphs 49 or 50, wherein the THAP-family polypeptide 
comprises a THAP consensus amino acid sequence depicted in SEQ ID NOs : 1-2, or a fragment 
thereof having at least one activity selected from the group consisting of interaction with a THAP- 

10 family target protein, binding to a nucle.c add sequence, binding to PAR-4, binding to PML 
bmdmg to a polypeptide found in PML-NBs, loca.ization to PML-NBs, targeting a THAP-fam.ly 
target protein to PML-NBs, and inducing apoptosis. 

52. The method of Paragraph 49, wherein the THAP-family polypeptide comprises an 
amino ac.d sequence selected from the group of sequences consisting of SEQ ID NOs" 1-1 14 or a 
fragment thereof having at least one activity selected from the group consisting of interaction with a 
THAP-family target protein, binding to a nucleic acid sequence, binding to PAR-4, binding to 
PML, bmdmg to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a THAP- 
family target protein to PML-NBs, and inducing apoptosis. 

53. The method of Paragraph 49, wherein the THAP-family polypeptide comprises a 
native THAP-fami.y polypeptide, or a fragment thereof having at least one activity selected from 
the group consisting of interaction with a THAP-family target protein, binding to a nucleic acid 
sequence, binding to PAR-4, binding to PML, binding to a polypeptide found in PML-NBs 
localization to PML-NBs, targeting a THAP-fami.y target protein to PML-NBs, and inducing 
apoptosis. 

25 TU / 4 " ^ meth ° d ° f Para ^ h 49 ' *e THAP-fam„y polypeptide comprises a 

THAP-family polypeptide or a fragment thereof having at least one activity selected from the group 
consisting of interaction with a THAP-family target protein, binding to a nucleic acid sequence 
binding to PAR-4, binding to PML, binding to a polypeptide found in PML-NBs, localization to 
PML-NBs, targeting a THAP-fami.y target protein to PML-NBs, and inducing apoptosis, wherein 
said THAP-fami.y po.ypeptide or fragment thereof comprises at .east one amino acid de.etion, 
substitution or insertion. 

55. An isolated THAP-fam.ly polypeptide comprising an amino acid sequence of SEQ 
ID NOs: 1-1 ,4, wherein said polypeptide comprises at least one amino acid deletion, substitution or 
insertion with respect to said amino acid sequence of SEQ ID NOs. 1-1 14. 
35 56. A THAP-family polypeptide comprising an amino acid sequence selected from the 

group consisting of SEQ ID NOs: ,-114, wherein said polypeptide comprises at .east one amino 
acd deletion, substitution or insertion with respect to said amino acid sequence of one of SEQ ID 
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NOs. 1-1 14 and displays a reduced ability to induce apoptosis or bind DNA compared to the wild- 
type polypeptide. 

57. A THAP-family polypeptide comprising an amino acid sequence of SEQ ID NOs: 
1-114, wherein said polypeptide comprises at least one amino acid deletion, substitution or insertion 

5 with respect to said amino acid sequence of one of SEQ ID NOs. 1-114 and displays a increased 
ability to induce apoptosis or bind DNA compared to the wild-type polypeptide. 

58. A method of determining whether a THAP-family polypeptide is expressed within 
a biological sample, said method comprising the steps of : 

(a) contacting a biological sample from a subject with: 

10 a polynucleotide that hybridizes under stringent conditions to a nucleic acid of SEQ ID NOs: 

160-175 or 

a detectable polypeptide that selectively binds to the polypeptide of SEQ ED NOs: 1-114; and 

(b) detecting the presence or absence of hybridization between said polynucleotide and an RNA 
species within said sample, or the presence or absence of binding of said detectable polypeptide to a 

1 5 polypeptide within said sample; 

wherein a detection of said hybridization or of said binding indicates that said THAP-family 
polypeptide is expressed within said sample. 

59. The method of Paragraph 58, wherein said subject suffers from, is suspected of 
suffering from, or is susceptible to a cell proliferative disorder. 

20 60. The method of Paragraph 59, wherein said cell proliferative disorder is a disorder 

related to regulation of apoptosis. 

61. The method of Paragraph 58, wherein said polynucleotide is a primer, and wherein 
said hybridization is detected by detecting the presence of an amplification product comprising said 
primer sequence. 

25 62. The method of Paragraph 58, wherein said detectable polypeptide is an antibody. 

63. A method of assessing THAP-family activity in a biological sample, said method 
comprising the steps of : 

(a) contacting a nucleic acid molecule comprising a binding site for a THAP-family polypeptide 
with : 

30 (i) a biological sample from a subject or 

(ii) a THAP-family polypeptide isolated from a biological sample from a subject, the 
polypeptide comprising the amino acid sequences of one of SEQ ID NOs: 1-1 14; and 

(b) assessing the binding between said nucleic acid molecule and a THAP-family polypeptide 
wherein a detection of decreased binding compared to a reference THAP-family nucleic acid 

35 binding level indicates that said sample comprises a deficiency in THAP-family activity. 

64. A method of determining whether a mammal has an elevated or reduced level of 
THAP-family expression, said method comprising the steps of : 
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(a) providing a biological sample from said mammal; and 

(b) comparing the amount of a THAP-family polypeptide of SEQ ID NOs: 1-114 or of a 
THAP-fam,ly RNA spec.es encoding a polypeptide of SEQ ID NOs: 1-114 within sa.d biological 
sample with a level detected in or expected from a control sample ; 

wherein an increased amount of said THAP-family polypeptide or said THAP-family RNA 
species within said biological sample compared to said level detected in or expected from said 
control sample indicates that said mammal has an elevated level of THAP-family expression, and 
wherein a decreased amount of said THAP-family polypeptide or said THAP-family RNA species 
within said biological sample compared to said level detected in or expected from said control 
sample indicates that said mammal has a reduced level oF THAP-family expression. 

65. The method of Paragraph 64, wherein said mammal suffers from, is suspected of 
suffering from, or is susceptible to a cell proliferative disorder. 

66. A method of identifying a candidate inhibitor of a THAP-family polypeptide, a 
candidate inhibitor of apoptosis, or a candidate compound for the treatment of a cell proliferative 

15 disorder, said method comprising: 

(a) contacting a THAP-family polypeptide according to SEQ ID NOs: 1-114 or a fragment 
comprising a contiguous span of at least 6 contiguous amino acids of a polypeptide according to 
SEQ ID NOs: 1-114 with a test compound; and 

(b) determining whether said compound selectively binds to said polypeptide; 
wherein a determination that sa.d compound selectively binds to said polypeptide indicates that 

sa.d compound is a candidate inhibitor of a THAP-family polypeptide, a candidate inhibitor of 
apopto S1 s, or a candidate compound for the treatment of a cell proliferative disorder. 

67. A method of identifying a candidate inhibitor of apoptosis, a candidate compound 
for the treatment of a cell proliferative disorder, or a candidate inhib.tor of a THAP-family 
polypeptide of SEQ ID NOs: 1-114 or a fragment composing a contiguous span of at .east 6 
contiguous amino acids of a polypeptide according to SEQ ID NOs: 1-114, said method 
comprising: 

(a) contacting said THAP-family polypeptide with a test compound; and 

(b) determining whether said compound selectively inhibits at least one biological activity 
selected from the group consisting of interaction with a THAP-family target protein, binding to a 
nucle,c acd sequence, binding to PAR-4, binding to PML, binding to a polypeptide found in PML- 
NBs, localization to PML-NBs, targeting a THAP-family target protein to PML-NBs, and inducing 
apoptosis; 

wherein a determination that said compound selectively inhibits said at least one biological activity 
of sa,d polypeptide indicates that said compound is a candidate inhibitor of a THAP-family 
polypeptide, a candidate inhibitor of apoptosis, or a candidate compound for the treatment of a cell 
proliferative disorder. 
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68. A method of identifying a candidate inhibitor of apoptosis, a candidate compound 
for the treatment of a cell proliferative disorder, or a candidate inhibitor of a THAP-family 
polypeptide of SEQ ID NOs: 1-114 or a fragment comprising a contiguous span of at least 6 
contiguous amino acids of a polypeptide according to SEQ ED NOs: 1-114, said method 

5 comprising: 

(a) contacting a cell comprising said THAP-family polypeptide with a test compound; and 

(b) determining whether said compound selectively inhibits at least one biological activity 
selected from the group consisting of interaction with a THAP-family target protein, binding to a 
nucleic acid sequence, binding to PAR-4, binding to PML, binding to a polypeptide found in PML- 

10 NBs, localization to PML-NBs, targeting a THAP-family target protein to PML-NBs, and inducing 
apoptosis; 

wherein a determination that said compound selectively inhibits said at least one biological 
activity of said polypeptide indicates that said compound is a candidate inhibitor of a THAP-family 
polypeptide, a candidate inhibitor of apoptosis, or a candidate compound for the treatment of a cell 
1 5 proliferative disorder. 

69. The method of Paragraphs 67 or 68, wherein step (b) comprises assessing apoptotic 
activity, and wherein a determination that said compound inhibits apoptosis indicates that said 
compound is a candidate inhibitor of said THAP-family polypeptide. 

70. The method of Paragraph 68 comprising introducing a nucleic acid comprising the 
20 nucleotide sequence encoding said THAP-family polypeptide according to any one of Paragraphs 

32-43 intp said cell. 

71. A polynucleotide according to any one of Paragraphs 17- 25 attached to a solid 

support. 

72. An array of polynucleotides comprising at least one polynucleotide according to 
25 Paragraph 71. 

73. An array according to Paragraph 72, wherein said array is addressable. 

74. A polynucleotide according to any one of Paragraphs 1 7 to 25 further comprising a 

label. 

75. A method of identifying a candidate activator of a THAP-family polypeptide, said 
30 method comprising : 

a) contacting a THAP-family polypeptide according to SEQ ED NOs: 1-114 or a fragment 
comprising a a contiguous span of at least 6 contiguous amino acids of a polypeptide according to 
SEQ ID NOs: 1-1 14 with a test compound; and 

b) determining whether said compound selectively binds to said polypeptide; 

35 wherein a determination that said compound selectively binds to said polypeptide indicates 

that said compound is a candidate activator of said polypeptide. 
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76. A method of identifying a candidate activator of a THAP-family polypeptide of 
SEQ ID NOs: 1-1 14 or a fragment comprising a a contiguous span of at least 6 contiguous amino 
acids of a polypeptide according to SEQ ID NOs: 1-1 14, said method comprising: 

(a) contacting said polypeptide with a test compound; and 

(b) determining whether said compound selectively activates at least one biological activity 
selected from the group consisting of interaction with a THAP-family target protein, binding to a 
nucleic acid sequence, binding to PAR-4, binding to PML, binding to a polypeptide found in PML- 
NBs, localization to PML-NBs, targeting a THAP-family target protein to PML-NBs, and inducing 
apoptosis; 

wherein a determination that said compound selectively activates said at least one biological 
activity of said polypeptide indicates that said compound is a candidate activator of said 
polypeptide. 

77. A method of identifying a candidate activator of a THAP-family polypeptide of 
SEQ ID NOs: 1-1 14 or, a fragment comprising a a contiguous span of at least 6 contiguous ammo 

1 5 acids of a polypeptide according to SEQ ID NOs: 1-114, said method comprising: 

(a) contacting a cell comprising said THAP-family polypeptide with a test compound; and 

(b) determining whether said compound selectively activates at least one biological activity 
selected from the group consisting of interaction with a THAP-family target protein, binding to a 
nucleic acid sequence, binding to PAR-4, binding to PML, binding to a polypeptide found in PML- 
NBs, localization to PML-NBs, targeting a THAP-family target protein to PML-NBs, and inducing 
apoptosis;^ 

wherein a determination that said compound selectively activates said at least one biological 
activity of said polypeptide indicates that said compound is a candidate activator of said 
polypeptide. 

25 78 ' The method of Paragraphs 76 or 77, wherein said determining step comprises 

assessing apoptotic activity, and wherein a determination that said compound increases apoptosis 
activity indicates that said compound is a candidate activator of said THAP-family polypeptide. 

79. The method of Paragraph 77 wherein step a) comprises introducing a nucleic acid 
comprising the nucleotide sequence encoding said THAP-family polypeptide according to any one 

30 of Paragraphs 1 7-25 into said cell. 

80. A method of identifying a candidate modulator of PAR4 activity, said method 
comprising: 

(a) providing a PAR4 polypeptide or a fragment thereof; and 

(b) providing a PML-NB polypeptide, or a polypeptide associated with PML-NBs, or a 
35 fragment thereof; and 

(c) determining whether a test compound selectively modulates the ability of said PAR4 
polypeptide to bind to said PML-NB polypeptide or polypeptide associated with PML-NBs; 
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wherein a determination that said test compound selectively inhibits the ability of said PAR4 
polypeptide to bind to said PML-NB polypeptide or polypeptide associated with PML-NBs 
indicates that said compound is a candidate modulator of PAR4 activity. 

81. A method of identifying a candidate modulator of PAR4 activity, said method 
5 comprising: 

(a) providing a PAR4 polypeptide or a fragment thereof; and 

(b) determining whether a test compound selectively modulates the ability of said PAR4 
polypeptide to localise in PML-NBs; 

wherein a determination that said test compound selectively inhibits the ability of said PAR4 
10 polypeptide to localise in PML-NBs indicates that said compound is a candidate modulator of 
PAR4 activity. 

82. A method of identifying a candidate inhibitor of THAP-family activity, said 
. method comprising: 

(a) providing a THAP-family polypeptide of SEQ ID NOs: 1-1 14 or, a fragment comprising a a 
15 contiguous span of at least 6 contiguous amino acids of a polypeptide according to SEQ ID NOs: 1- 

114; and 

(b) providing a THAP-family target polypeptide or a fragment thereof; and 

(c) determining whether a test compound selectively inhibits the ability of said THAP-family 
polypeptide to bind to said THAP-family target polypeptide; 

20 wherein a determination that said test compound selectively inhibits the ability of said THAP- 

family polypeptide to bind to said THAP-family target polypeptide indicates that said compound is 
a candidate inhibitor of THAP-family activity. 

83. The method of Paragraph 82, comprising providing a cell comprising: 

(a) a first expression vector comprising a nucleic acid encoding a THAP-family polypeptide of 
25 SEQ ID NOs: 1-114 or, a fragment comprising a a contiguous span of at least 6 contiguous 

amino acids of a polypeptide according to SEQ ID NOs: 1-114; and 

(b) a second expression vector comprising a nucleic acid encoding a THAP-family target 
polypeptide, or a fragment thereof. 

84. The method of Paragraph 82, wherein said THAP-family activity is apoptosis 

30 activity. 

85. The method of Paragraph 82, wherein said THAP-family target protein is PAR-4. 

86. The method of Paragraph 82, wherein said THAP-family polypeptide is a THAP-1, 
THAP-2 or THAP-3 protein and said THAP-family target protein is PAR-4. 

87. A method of modulating apoptosis in a cell comprising modulating the activity of a 
35 THAP-family protein. 

88. The method of Paragraph 87, wherein said THAP-family protein is selected from 
the group consisting of SEQ ID NOs. 1-1 14. 
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89. A method of modulating apoptosis in a cell comprising modulating the recruitment 
of PAR-4 to a PML nuclear body. 

90. The method of Paragraph 89 wherein modulating the activity of a THAP-family 
protein comprises modulating the interaction of a THAP-family protein and a THAP-family target 

5 protein. 

91. The method of Paragraph 89 wherein modulating the activity of a THAP-family 
protein comprises modulating the interaction of a THAP-family protein and a PAR4 protein. 

92. The method of Paragraph 91 comprising modulation the interaction between a 
THAP-1, THAP-2, or THAP-3 protein and a PAR-4 protein. 

10 93. A method of modulating the recruitment of PAR-4 to a PML nuclear body 

comprising modulating the interaction of said PAR-4 protein and a THAP-family protein. 

94. The method of Paragraph 93, wherein said THAP-family protein is selected from 
the group consisting of SEQ ID NOs. 1-114. 

95. A method of modulating angiogenesis in an individual comprising modulating the 
1 5 activity of a THAP-family protein in said individual. 

96. The method of Paragraph 95, wherein said THAP-family protein is selected from 
the group consisting of SEQ ID NOs. 1-1 14. 

97. A method of preventing cell death in an individual comprising inhibiting the 
activity of a THAP-family protein in said individual. 

20 98 - 7716 method of Paragraph 97, wherein said THAP-family protein is selected from 

the group consisting of SEQ ID NOs. 1-1 14. 

99. The method according to Paragraph 97, wherein the activity of said THAP-family 
protein is inhibited in the CNS. 

100. A method of inducing angiogenesis in an individual comprising inhibiting the 
25 activity of a THAP-family protein in said individual. 

101. The method of Paragraph 100, wherein said THAP-family protein is selected from 
the group consisting of SEQ ID NOs. 1-1 14. 

102. A method according to Paragraph 100, wherein the activity of said THAP-family 
protein is inhibited in endothelial cells. 

30 103 - A method of inhibiting angiogenesis or treating cancer in an individual comprising 

increasing the activity of a THAP-family protein in said individual. 

1 04. The method of Paragraph 1 03, wherein said THAP-family protein is selected from 
the group consisting of SEQ ID NOs. 1-114. 

105. A method of treating inflammation or an inflammatory disorder in an individual 
35 comprising increasing the activity of a THAP-family protein in said individual. 

1 06. The method of Paragraph 1 05, wherein said THAP-family protein is selected from 
the group consisting of SEQ ID NOs. 1-114. 
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107. A method according to Paragraphs 103 or 105, wherein the activity of said THAP- 

family protein is increased in endothelial cells. 

108. A method of treating cancer in an individual comprising increasing the activity of a 
THAP-family protein in said individual. 

5 109. The method of Paragraph 108, wherein said THAP-family protein is selected from 

the group consisting of SEQ ID NOs. 1-1 14. 

1 10. The method of Paragraph 108, wherein increasing the activity of said THAP family 
protein induces apoptosis, inhibits cell division, inhibits metastatic potential, reduces tumor burden, 
increases sensitivity to chemotherapy or radiotherapy, kills a cancer cell, inhibits the growth of a 

10 cancer cell, kills an endothelial cell, inhibits the growth of an endothelial cell, inhibits angiogenesis, 
or induces rumor regression. 

111. A method according to any one of Paragraphs 87-1 10, comprising contacting said 
subject with a recombinant vector encoding a THAP-family protein according to any one of 
Paragraphs 32-43 operably linked to a promoter that functions in said cell. 

15 112. The method of Paragraph 111, wherein said promoter functions in an endothelial 

cell. 

113. A viral composition comprising a recombinant viral vector encoding a THAP- 
family protein according to Paragraphs 32-43. 

114. The composition of Paragraph 113, wherein said recombinant viral vector is an 
20 adenoviral, adeno-associated viral, retroviral, herpes viral, papilloma viral, or hepatitus B viral 

vector. 

115. A method of obtaining a nucleic acid sequence which is recognized by a THAP- 
family polypeptide comprising contacting a pool of random nucleic acids with said THAP-family 
polypeptide or a portion thereof and isolating a complex comprising said THAP-family polypeptide 

25 and at least one nucleic acid from said pool. 

116. The method of Paragraph 115 wherein said pool of nucleic acids are labeled. 

117. The method of Paragraph 116 wherein said complex is isolated by performing a gel 
shift analysis. 

118. A method of identifying a nucleic acid sequence which is recognized by a THAP- 
30 family polypeptide comprising: 

(a) incubating a THAP-family polypeptide with a pool of labeled random nucleic 

acids; 

(b) isolating a complex between said THAP-family polypeptide and at least one 
nucleic acid from said pool; 

35 (c) performing an amplification reaction to amplify the at least one nucleic acid 

present in said complex; 
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(d) incubating said at least one amplified nucleic acid with said THAP-family 
polypeptide; 

(e) isolating a complex between said at least one amplified nucleic acid and said 
THAP-family polypeptide; 

(f) repeating steps (c), (d) and (e) a plurality of times; 

(g) determining the sequence of said nucleic acid in said complex. 

119. A method of identifying a compound which inhibits the ability of a THAP-family 
polypeptide to bind to a nucleic acid comprising :incubating a THAP-family polypeptide or a 
fragment thereof which recognizes a binding site in a nucleic acid with a nucleic acid containing 
said binding site in the presence or absence of a test compound and determining whether the level 
of binding of said THAP-family polypeptide to said nucleic acid in the presence of said test 
compound is less than the level of binding in the absence of said test compound. 

120. A method of identifying a test compound that modulates THAP-mediated activities 
comprising: 

contacting a THAP-family polypeptide or a biologically active fragment thereof 
with a test compound, wherein said THAP-family polypeptide comprises an amino acid 
sequence having at least 30% amino acid identity to an amino acid sequence of SEQ ID 
NO: 1; and 

determining whether said test compound selectively modulates the activity of said 
20 THAP-family polypeptide or biologically active fragment thereof, wherein a determination 

thqt said test compound selectively modulates the activity of said polypeptide indicates that 
said test compound is a candidate modulator of THAP-mediated activities. 
121. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 1 , or a biologically active fragment thereof. 
25 122 - "n* method of Paragraph 120, wherein the THAP-family polypeptide comprises 

the amino acid sequence of SEQ ED NO: 2, or a biologically active fragment thereof. . 

123. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 3, or a biologically active fragment thereof. 

124. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
30 the amino acid sequence of SEQ ID NO: 4, or a biologically active fragment thereof. 

125. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ED NO: 5, or a biologically active fragment thereof. 

126. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ED NO: 6, or a biologically active fragment thereof. 

35 127 ■ The method of Paragraph 120, wherein the THAP-family polypeptide comprises 

the amino acid sequence of SEQ ED NO: 7, or a biologically active fragment thereof. 
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128. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 8, or a biologically active fragment thereof. 

129. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 9, or a biologically active fragment thereof. 

130. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ED NO: 10, or a biologically active fragment thereof. 

131. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 1 1, or a biologically active fragment thereof. 

132. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 12, or a biologically active fragment thereof. 

133. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 13, or a biologically active fragment thereof. 

134. The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence of SEQ ID NO: 14 or a biologically active fragments thereof. 

135. . The method of Paragraph 120, wherein the THAP-family polypeptide comprises 
the amino acid sequence selected from the group consisting of SEQ ED NOs: 15-114 or a 
biologically active fragments thereof. 

136. The method of Paragraph 120, wherein said THAP-mediated activity is selected 
from the group consisting of interaction with a THAP-family target protein, binding to a nucleic 
acid, binding to PAR-4, binding to SLC, binding to PML, binding to a polypeptide found in PML- 
NBs, locajization to PML-NBs, targeting a THAP-family target protein to PML-NBs, and inducing 
apoptosis 

137. The method of Paragraph 136, wherein said THAP-mediated activity is binding to 

PAR-4. 

138. The method of Paragraph 136, wherein said THAP-mediated activity is binding to 

SLC. 

139. The method of Paragraph 136, wherein said THAP-mediated activity is inducing 
apoptosis. 

140. The method of Paragraph 136, wherein said nucleic acid comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 140-1 59. 

141 . The method of Paragraph 120, wherein said amino acid identity is determined using 
an algorithm selected from the group consisting of XBLAST with the parameters, score=50 and 
wordlength=3, Gapped BLAST with the default parameters of XBLAST, and BLAST with the 
defaul parameters of XBLAST. 

142. An isolated or purified THAP domain polypeptide consisting essentially of an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 1-2, amino acids 1-89 of 
SEQ ID NOs: 3-5, amino acids 1-90 of SEQ ID NOs: 6-9, amino acids 1-92 of SEQ ID NO: 10, 
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amino acids 1-90 of SEQ ID NOs: 1 1-14 and homologs having at least 30% amino acid identity to 
any aforementioned sequence, wherein said polypeptide binds to a nucleic acid. 

143. The isolated or purified THAP domain polypeptide of Paragraph 142 consisting 
essentially of SEQ ID NO: 1. 

5 144. The isolated or purified THAP domain polypeptide of Paragraph 142, wherein said 

amino acid identity is determined using an algorithm selected from the group consisting of 
XBLAST with the parameters, score=50 and wordlength=3, Gapped BLAST with the default 
parameters of XBLAST, and BLAST with the defaul parameters of XBLAST. 

145. The isolated or purified THAP domain polypeptide of Paragraph 142, wherein said 
10 nucleic acid comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 

140-159. 

146. An isolated or purified nucleic acid which encodes the THAP domain polypeptide 
of Paragraph 142 or a complement thereof. 

147. An isolated or purified PAR4-binding domain polypeptide consisting essentially of 
an amino acid sequence selected from the group consisting of amino acids 143-192 of SEQ ID NO: 
3, amino acids 132-181 of SEQ ID NO: 4, amino acids 186-234 of SEQ ID NO: 5 , SEQ ID NO: 
15 and homologs having at least 30% amino acid identity to any aforementioned sequence, wherein 
said polypeptide binds to PAR4. 

148. The isolated or purified PAR4-binding domain of Paragraph 147 consisting 
20 essentially of SEQ ID NO: 15. 

149. The isolated or purified PAR4-binding domain of Paragraph 147 consisting 
essentially of amino acids 143-193 of SEQ ID NO: 3. 

150. The isolated or purified PAR4-binding domain of Paragraph 147 consisting 
essentially of amino acids 132-181 of SEQ ID NO: 4. 

25 151. The isolated or purified PAR4-binding domain of Paragraph 147 consisting 

essentially of amino acids 186-234 of SEQ ID NO: 5. 

152. The isolated or purified PAR4-binding domain polypeptide of Paragraph 147, 
wherein said amino acid identity is determined using an algorithm selected from the group 
consisting of XBLAST with the parameters, score=50 and wordlength=3, Gapped BLAST with the 

30 default parameters of XBLAST, and BLAST with the defaul parameters of XBLAST. 

153. An isolated or purified nucleic acid which encodes the PAR4-binding domain 
polypeptide of Paragraph 147 or a complement thereof. 

154. An isolated or purified SLC-binding domain polypeptide consisting essentially of 
an amino acid sequence selected from the group consisting of amino acids 143-213 of SEQ ID NO: 
3 and homologs thereof having at least 30% amino acid identity, wherein said polypeptide binds to 
SLC. 
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155. The isolated or purified SLC-binding domain polypeptide of Paragraph 154, 
wherein said amino acid identity is determined using an algorithm selected from the group 
consisting of XBLAST with the parameters, score=50 and wordlength=3, Gapped BLAST with the 
default parameters of XBLAST, and BLAST with the defaul parameters of XBLAST. 
5 156. An isolated or purified nucleic acid which encodes the SLC-binding domain 

polypeptide of Paragraph 154 or a complement thereof. 

157. A fusion protein comprising an Fc region of an immunoglobulin fused to a 
polypeptide comprising an amino acid sequence selected from the group consisting of amino acids 
143-213 of SEQ ID NO: 3 and homologs thereof having at least 30% amino acid identity. 
10 158. An oligomeric THAP protein comprising a plurality of THAP polypeptides, 

wherein each THAP polypeptide comprises an amino acid sequence selected from the group 
consisting of amino acid 143-213 of SEQ ID NO: 3 and homologs thereof having at least 30% 
amino acid identity. 

159. A medicament comprising an effective amount of a THAP1 polypeptide or an 
1 5 SLC-binding fragment thereof, together with a pharmaceutical ly acceptable carrier. 

160. An isolated or purified THAP dimerization domain polypeptide consisting 
essentially of an amino acid sequence selected from the group consisting of amino acids 143 and 
192 of SEQ ID NO: 3 and homologs thereof having at least 30% amino acid identity, wherein said 
polypeptide binds to a THAP-family polypeptide.. 

20 161. The isolated or purified THAP dimerization domain polypeptide of Paragraph 160, 

wherein said amino acid identity is determined using an algorithm selected from the group 
consisting of XBLAST with the parameters, score=50 and wordlength=3, Gapped BLAST with the 
default parameters of XBLAST, and BLAST with the defaul parameters of XBLAST. 

162. An isolated or purified nucleic acid which encodes the THAP dimerization domain 
25 polypeptide of Paragraph 160 or a complement thereof. 

163. An expression vector comprising a promoter operably linked to a nucleic acid 
having a nucleotide sequence selected from the group consisting of SEQ ID NOs: 160-175 and 
portions thereof comprising at least 18 consecutive nucleotides. 

164. The expression vector of Paragraph 163, wherein said promoter is a promoter 
30 which is not operably linked to said nucleic acid selected from the group consisting of SEQ ID 

NOs.: 160-175 in a naturally occurring genome. 

165. A host cell comprising the expression vector of Paragraph 163. 

166. An expression vector comprising a promoter operably linked to a nucleic acid 
encoding a polypeptide comprising an amino acid sequence selected from the group consisting of 

35 SEQ ID NOs: 1-114 and portions thereof comprising at least 1 8 consecutive nucleotides. 
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167. The expression vector of Paragraph 166, wherein said promoter is a promoter 
which is not operably linked to said nucleic acid selected from the group consisting of SEQ ID 
NOs.: 160-175 in a naturally occurring genome. 

168. A host cell comprising the expression vector of Paragraph 166. 

169. A method of identifying a candidate inhibitor of a THAP-family polypeptide, a 
candidate inhibitor of apoptosis, or a candidate compound for the treatment of a cell proliferative 
disorder, said method comprising: 

contacting a THAP-family polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1-1 14 or a fragment comprising a span 
of at least 6 contiguous amino acids of a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1-1 14 with a test compound; and 

determining whether said compound selectively binds to said polypeptide, wherein 
a determination that said compound selectively binds to said polypeptide indicates that said 
compound is a candidate inhibitor of a THAP-family polypeptide, a candidate inhibitor of 
1 5 apoptosis, or a candidate compound for the treatment of a cell proliferative disorder. 

170. A method of identifying a candidate inhibitor of apoptosis, a candidate compound 
for the treatment of a cell proliferative disorder, or a candidate inhibitor of a THAP-family 
polypeptide of SEQ ID NOs: 1-1 14 or a fragment comprising a span of at least 6 contiguous amino 
acids of a polypeptide according to SEQ ID NOs: 1-1 14, said method comprising: 

20 contacting said THAP-family polypeptide with a test compound; and 

determining whether said compound selectively inhibits at least one biological 
activity selected from the group consisting of interaction with a THAP-family target 
protein, binding to a nucleic acid sequence, binding to PAR-4, binding to SLC, binding to 
PML, binding to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a 
25 THAP-family target protein to PML-NBs, and inducing apoptosis, wherein a determination 

that said compound selectively inhibits said at least one biological activity of said 
polypeptide indicates that said compound is a candidate inhibitor of a THAP-family 
polypeptide, a candidate inhibitor of apoptosis, or a candidate compound for the treatment 
of a cell proliferative disorder. 
30 171 - A method of identifying a candidate inhibitor of apoptosis, a candidate compound 

for the treatment of a cell proliferative disorder, or a candidate inhibitor of a THAP-family 
polypeptide of SEQ ID NOs: 1-114 or a fragment comprising a span of at least 6 contiguous amino 
acids of a polypeptide according to SEQ ID NOs: 1-114, said method comprising: 

contacting a cell comprising said THAP-family polypeptide with a test compound; 

35 and 

determining whether said compound selectively inhibits at least one biological 
activity selected from the group consisting of interaction with a THAP-family target 
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protein, binding to a nucleic acid sequence, binding to PAR-4, binding to SLC, binding to 
PML, binding to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a 
THAP-family target protein to PML-NBs, and inducing apoptosis, wherein a determination 
that said compound selectively inhibits said at least one biological activity of said 
polypeptide indicates that said compound is a candidate inhibitor of a THAP-family 
polypeptide, a candidate inhibitor of apoptosis, or a candidate compound for the treatment 
of a cell proliferative disorder. 

172. A method of identifying a candidate modulator of THAP-family activity, said 
method comprising: 

providing a THAP-family polypeptide of SEQ ID NOs: 1-114 or, a fragment 
comprising a span of at least 6 contiguous amino acids of a polypeptide according to SEQ 
ID NOs: 1-114; and 

providing a THAP-family target polypeptide or a fragment thereof; and 
determining whether a test compound selectively modulates the ability of said 
THAP-family polypeptide to bind to said THAP-family target polypeptide, wherein a 
determination that said test compound selectively modulates the ability of said THAP- 
family polypeptide to bind to said THAP-family target polypeptide indicates that said 
compound is a candidate modulator of THAP-family activity. 

173. The method of Paragraph 172, wherein said THAP-family polypeptide is provided 
by a first expression vector comprising a nucleic acid encoding a THAP-family polypeptide of SEQ 
ID NOs: 1-1 14 or, a fragment comprising a contiguous span of at least 6 contiguous amino acids of 
a polypeptide according to SEQ ID NOs: 1-1 14, and wherein said THAP-family target polypeptide 
is provided by a second expression vector comprising a nucleic acid encoding a THAP-family 
target polypeptide, or a fragment thereof. 

174. The method of Paragraph 172, wherein said THAP-family activity is apoptosis 

activity. 

175. The method of Paragraph 172, wherein said THAP-family target protein is PAR-4. 

176. The method of Paragraph 172, wherein said THAP-family polypeptide is a THAP- 
1, THAP-2 or THAP-3 protein and said THAP-family target protein is PAR^. 

177. The method of Paragraph 172, wherein said THAP-family target protein is SLC. 

178. A method of modulating apoptosis in a cell comprising modulating the activity of a 
THAP-family protein. 

179. The method of Paragraph 178, wherein said THAP-family protein is selected from 
the group consisting of SEQ ID NOs: 1-1 14. 

180. The method of Paragraph 178, wherein modulating the activity of a THAP-family 
protein comprises modulating the interaction of a THAP-family protein and a THAP-family target 
protein. 
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181. The method of Paragraph 178, wherein modulating the activity of a THAP-family 
protein comprises modulating the interaction of a THAP-family protein and a PAR4 protein. 

182. A method of identifying a candidate activator of a THAP-family polypeptide, a 
candidate activator of apoptosis, or a candidate compound for the treatment of a cell proliferative 

5 disorder, said method comprising: 

contacting a THAP-family polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1-98 or a fragment comprising a span 
of at least 6 contiguous amino acids of a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1-98 with a test compound; and 
10 determining whether said compound selectively binds to said polypeptide, wherein 

a determination that said compound selectively binds to said polypeptide indicates that said 
compound is a candidate activator of a THAP-family polypeptide, a candidate activator of 
apoptosis, or a candidate compound for the treatment of a cell proliferative disorder. 

183. A method of identifying a candidate activator of apoptosis, a candidate compound 
15 for the treatment of a cell proliferative disorder, or a candidate activator of a THAP-family 

polypeptide of SEQ ID NOs: 1-98 or a fragment comprising a span of at least 6 contiguous amino 
acids of a polypeptide according to SEQ ID NOs: 1-98, said method comprising: 

contacting said THAP-family polypeptide with a test compound; and 
determining whether said compound selectively activates at least one biological 
20 activity selected from the group consisting of interaction with a THAP-family target 

protein, binding to a nucleic acid sequence, binding to PAR-4, binding to SLC, binding to 
PML, binding to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a 
THAP-family target protein to PML-NBs, and inducing apoptosis, wherein a determination 
that said compound selectively activates said at least one biological activity of said 
25 polypeptide indicates that said compound is a candidate activator of a THAP-family 

polypeptide, a candidate activator of apoptosis, or a candidate compound for the treatment 
of a cell proliferative disorder. 

184. A method of identifying a candidate activator of apoptosis, a candidate compound 
for the treatment of a cell proliferative disorder, or a candidate activator of a THAP-family 

30 polypeptide of SEQ ID NOs: 1 to 98 or a fragment comprising a span of at least 6 contiguous amino 
acids of a polypeptide according to SEQ ID NOs: 1-98, said method comprising: 

contacting a cell comprising said THAP-family polypeptide with a test compound; 

and 

determining whether said compound selectively activates at least one biological 
35 activity selected from the group consisting of interaction with a THAP-family target 

protein, binding to a nucleic acid sequence, binding to PAR-4, binding to SLC, binding to 
PML, binding to a polypeptide found in PML-NBs, localization to PML-NBs, targeting a 
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fHAP-family target protein to PML-NBs, and inducing apoptosis, wherein a determination 
that said compound selectively activates said at least one biological activity of said 
polypeptide indicates that said compound is a candidate activator of a THAP-family 
polypeptide, a candidate activator of apoptosis, or a candidate compound for the treatment 
5 of a cell proliferative disorder. 

185. A method of ameliorating a condition associated with the activity of SLC in an 
individual comprising administering a polypeptide comprising the SLC binding domain of a THAP- 
family protein to said individual. 

186. The method of Paragraph 185, wherein said polypeptide comprises a fusion protein 
10 comprising an Fc region of an immunoglobulin fused to a polypeptide comprising an amino acid 

sequence selected from the group consisting of amino acids 143-213 of SEQ ID NO: 3 and 
homologs thereof having at least 30% amino acid identity. 

187. The method of Paragraph 185, wherein said polypeptide comprises an oligomeric 
THAP protein comprising a plurality of THAP polypeptides, wherein each THAP polypeptide 

15 comprises an amino acid sequence selected from the group consisting of amino acid 143-213 of 
SEQ ID NO: 3 and homologs thereof having at least 30% amino acid identity. 

188. A method of modulating angiogenesis in an individual comprising modulating the 
activity of a THAP-family protein in said individual. 

189. The method of Paragraph 188, wherein said THAP-family protein is selected from 
20 the group consisting of SEQ ID NOs: 1-1 14. 

1P0. The method of Paragraph 188, wherein said modulation is inhibition. 

191 . The method of Paragraph 1 88, wherein said modulation is induction. 

192. A method of reducing cell death in an individual comprising inhibiting the activity 
of a THAP-family protein in said individual. 

25 193. The method of Paragraph 192, wherein said THAP-family protein is selected from 

the group consisting of SEQ ID NOs: 1-114. 

194. The method according to Paragraph 192, wherein the activity of said THAP-family 
protein is inhibited in the CNS. 

195. A method of reducing inflammation or an inflammatory disorder in an individual 
30 comprising modulating the activity of a THAP-family protein in said individual. 

196. The method of Paragraph 195, wherein said THAP-family protein is selected from 
the group consisting of SEQ ED NOs: 1-114. 

197. A method of reducing the extent of cancer in an individual comprising modulating 
the activity of a THAP-family protein in said individual. 

35 198. The method of Paragraph 197, wherein said THAP-family protein is selected from 

the group consisting of SEQ ID NOs: 1-114. 
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1 99. The method of Paragraph 1 97, wherein increasing the activity of said TH AP family 
protein induces apoptosis, inhibits cell division, inhibits metastatic potential, reduces tumor burden, 
increases sensitivity to chemotherapy or radiotherapy, kills a cancer cell, inhibits the growth of a 
cancer cell, kills an endothelial cell, inhibits the growth of an endothelial cell, inhibits angiogenesis, 
or induces tumor regression. 

200. A method of forming a complex, said method comprising: 
contacting a chemokine with a chemokine-binding agent comprising a polypeptide 

selected from the group consisting of THAP-1, a polypeptide having at least 30% amino 
acid identity to THAP-1, a chemokine-binding domain of THAP-1 and a polypeptide 
having at least 30% amino acid identity to a chemokine-binding domain of THAP-1, 
wherein said chemokine and said chemokine binding agent form a complex. 

20 1 . The method of Paragraph 200, wherein said amino acid identity is determined using 
an algorithm selected from the group consisting of XBLAST with the parameters, score=50 and 
wordlength=3, Gapped BLAST with the default parameters of XBLAST, and BLAST with the 

15 defaul parameters of XBLAST. 

202. The method of Paragraph 200, wherein said polypeptide is fused to an Fc region of 
an immunoglobulin. 

. 203. The method of Paragraph 200, wherein said polypeptide comprises a THAP 
dimerization domain. 

20 204 " The method of Paragraph 203, wherein said THAP dimerization domain interacts 

with one or more THAP dimerization domains to form a THAP oligomer. 

205. The method of Paragraph 200, wherein said polypeptide is a recombinant 
polypeptide. 

206. The method of Paragraph 200, wherein said chemokine is selected from the group 
25 consisting of SLC, CCL19, CCL5, CXCL9 and CXCL10. 

207. The method of Paragraph 200, wherein said chemokine is selected from the group 
consisting of SLC, CCL19 and CXCL9. 

208. The method of Paragraph 200, wherein said polypeptide comprises THAP-1 . 

209. The method of Paragraph 208, wherein said THAP-1 comprises the amino acid 
30 sequence of SEQ ED NO: 3. 

210. The method of Paragraph 200, wherein said polypeptide comprises a polypeptide 
having at least 30% amino acid identity to THAP-1 . 

211. The method of Paragraph 200, wherein said polypeptide comprises a chemokine- 
binding domain of THAP-1. 

35 212 - The meth °d of Paragraph 211, wherein said chemokine-binding domain of THAP-1 

comprises the amino acid sequence of amino acids 143-213 of SEQ ID NO: 3. 
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213. The method of Paragraph 200, wherein said polypeptide comprises a polypeptide 
having at least 30% amino acid identity to a chemokine-binding domain of THAJM. 

214. A method of inhibiting the activity of a chemokine, said method comprising 
contacting a chemokine with an effective amount of an agent comprising a polypeptide selected 

5 from the group consisting of THAP-1, a polypeptide having at least 30% amino acid identity to 
THAJM, a chemokine-binding domain of THAP-1 and a polypeptide having at least 30% amino 
acid identity to a chemokine-binding domain of THAP-1, wherein the activity of said chemokine is 
inhibited. 

215. The method of Paragraph 2 14, wherein said amino acid identity is determined using 
10 an algorithm selected from the group consisting of XB LAST with the parameters, score=50 and 

wordlength=3, Gapped BLAST with the default parameters of XBLAST, and BLAST with the 
defaul parameters of XBLAST. 

216. The method of Paragraph 214, wherein said polypeptide is fused to an Fc region of 
an immunoglobulin. 

15 217. The method of Paragraph 214, wherein said polypeptide comprises a THAP 

dimerization domain. 

218. The method of Paragraph 217, wherein said THAP dimerization domain interacts 
with one or more THAP dimerization domains to form a THAP oligomer. 

219. The method of Paragraph 214, wherein said polypeptide is a recombinant 
20 polypeptide. 

220. The method of Paragraph 214, wherein said polypeptide binds to a chemokine 
selected from the group consisting of SLC, CCL19, CCL5, CXCL9 and CXCL10. 

221. The method of Paragraph 214, wherein said polypeptide binds to a chemokine 
selected from the group consisting of SLC, CCL19 and CXCL9. 

25 222. The method of Paragraph 2 1 4, wherein said polypeptide comprises THAP- 1 . 

223. The method of Paragraph 222, wherein said THAP-1 comprises the amino acid 
sequence of SEQ ID NO: 3. 

224. The method of Paragraph 214, wherein said polypeptide comprises a polypeptide 
having at least 30% amino acid identity to THAP-1 . 

30 225. The method of Paragraph 214, wherein said polypeptide comprises a chemokine- 

binding domain of THAP-1. 

226. The method of Paragraph 225, wherein said chemokine-binding domain of THAP-1 
comprises the amino acid sequence of amino acids 143-213 of SEQ ID NO: 3. 

227. The method of Paragraph 214, wherein said polypeptide comprises a polypeptide 
35 having at least 30% amino acid identity to a chemokine-binding domain of THAP-1. 

228. A method of reducing inflammation comprising administering an effective amount 
of a chemokine binding agent to a subject afflicted with an inflammatory condition, wherein said 
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chemokine-binding agent comprises a polypeptide selected from the group consisting of THAP-1, a 
polypeptide having at least 30% amino acid identity to THAP-1, a chemokine-binding domain of 
THAP-1 and a polypeptide having at least 30% amino acid identity to a chemokine-binding domain 
of THAP-1. 

5 229. The method of Paragraph 228, wherein said amino acid identity is determined using 

an algorithm selected from the group consisting of XBLAST with the parameters, score=50 and 
wordlength=3, Gapped BLAST with the default parameters of XBLAST, and BLAST with the 
defaul parameters of XBLAST. 

230. The method of Paragraph 228, wherein said polypeptide is fused to an Fc region of 
10 an immunoglobulin. 

231. The method of Paragraph 228, wherein said polypeptide comprises a THAP 
dimerization domain. 

232. The method of Paragraph 231, wherein said THAP dimerization domain interacts 
with one or more THAP dimerization domains to form a THAP oligomer. 

5 233. The method of Paragraph 228, wherein said polypeptide is a recombinant 

polypeptide. 

234. The method of Paragraph 228, wherein said polypeptide binds to a chemokine 
selected from the group consisting of SLC, CCL19, CCL5, CXCL9 and CXCL10. 

235. The method of Paragraph 228, wherein said polypeptide binds to. a chemokine 
0 selected from the group consisting of SLC, CCL1 9 and CXCL9. 

236. The method of Paragraph 228, wherein said polypeptide comprises THAP- 1 . 

237. The method of Paragraph 236, wherein said THAP-1 comprises the amino acid 
sequence of SEQ ID NO: 3. 

238. The method of Paragraph 228, wherein said polypeptide comprises a polypeptide 
having at least 30% amino acid identity to THAP-1 . 

239. The method of Paragraph 228, wherein said polypeptide comprises a chemokine- 
binding domain of THAP-1 . 

240. The method of Paragraph 239, wherein said chemokine-binding domain of THAP-1 
comprises the amino acid sequence of amino acids 143-213 of SEQ ID NO: 3. 

241. The method of Paragraph 228, wherein said polypeptide comprises a polypeptide 
having at least 30% amino acid identity to a chemokine-binding domain of THAP-1 

242. A method of reducing one or more symptoms associated with an inflammatory 
disease, said method comprising administering to a subject afflicted with said inflammatory disease 
a therapeutically effective amount of an agent which reduces or eliminates the activity of one or 
more chemokines, wherein said agent comprises a polypeptide selected from the group consisting 
of THAP-1, a polypeptide having at least 30% amino acid identity to THAP-1, a chemokine- 
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binding domain of THAP-1 and a polypeptide having at least 30% amino acid identity to a 
chemokine-binding domain of THAP- 1 . 

243. The method of Paragraph 242, wherein said polypeptide is fused to an Fc region of 
an immunoglobulin. 

244. The method of Paragraph 242, wherein said polypeptide comprises a THAP 
dimerization domain. 

245. The method of Paragraph 244, wherein said THAP dimerization domain interacts 
with one or more THAP dimerization domains to form a THAP oligomer. 

246. The method of Paragraph 242, wherein said polypeptide is a recombinant 
polypeptide. 

247. The method of Paragraph 242, wherein said polypeptide binds to a chemokine 
selected from the group consisting of SLC, CCL19, CCL5, CXCL9 and CXCL10. 

248. The method of Paragraph 242, wherein said polypeptide binds to a chemokine 
selected from the group consisting of SLC, CCL19 and CXCL9. 

249. The method of Paragraph 242, wherein said polypeptide comprises THAP-1 . 

250. The method of Paragraph 249, wherein said THAP-1 comprises the amino acid 
sequence of SEQ ID NO: 3. 

25 1 . The method of Paragraph 242, wherein said polypeptide comprises a polypeptide 
having at least 30% amino acid identity to THAP-1. 

252. The method of Paragraph 242, wherein said polypeptide comprises a chemokine- 
binding dpmain of THAP-1 . 

253. The method of Paragraph 252, wherein said chemokine-binding domain of THAP-1 
comprises the amino acid sequence of amino acids 143-213 of SEQ ID NO: 3. 

254. The method of Paragraph 242, wherein said polypeptide comprises a polypeptide 
having at least 30% amino acid identity to a chemokine-binding domain of THAP-1. 

255. The method of Paragraph 242, wherein said inflammatory disease is arthritis. 

256. The method of Paragraph 242, wherein said inflammatory disease is inflammatory 
bowel disease. 

257. A method of detecting a chemokine, said method comprising: 

contacting a chemokine with a chemokine-binding agent comprising a polypeptide 
selected from the group consisting of THAP-1, a polypeptide having at least 30% amino 
acid identity to THAP-1, a chemokine-binding domain of THAP-1 and a polypeptide 
having at least 30% amino acid identity to a chemokine-binding domain of THAP-1 ; and 

detecting chemokine-binding agent bound to said chemokine. 

258. The method of Paragraph 257, wherein chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL9 and CXCL10. 
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259. The method of Paragraph 257, wherein said chemokine is selected from the group 
consisting of SLC, CCL19 and CXCL9. 

260. A detection system comprising a chemokine-binding agent comprising a 
polypeptide selected from the group consisting of THAP-1, a polypeptide having at least 30% 

5 amino acid identity to THAP- 1 , a chemokine-binding domain of THAP- 1 and a polypeptide having 
at least 30% amino acid identity to a chemokine-binding domain of THAP-1, wherein said 
chemokine-binding agent is coupled to a solid support. 

261. The detection system of Paragraph 260, wherein said polypeptide comprises 
THAP-1. 

10 262. The detection system of Paragraph 26 1 , wherein said THAP- 1 comprises the amino 

acid sequence of SEQ ID NO: 3. 

263. The detection system of Paragraph 260, wherein said polypeptide comprises a 
polypeptide having at least 30% amino acid identity to THAP-1. 

264. The detection system of Paragraph 260, wherein said polypeptide comprises a 
1 5 chemokine-binding domain of THAP- 1 . 

265. The detection system of Paragraph 264, wherein said chemokine-binding domain of 
THAP-1 comprises the amino acid sequence of amino acids 143-213 of SEQ ID NO: 3. 

266. The detection system of Paragraph 260, wherein said polypeptide comprises a 
polypeptide having at least 30% amino acid identity to a chemokine-binding domain of THAP-1. 

20 267. A pharmaceutical composition comprising a chemokine-binding agent in a 

pharaceutically acceptable carrier, wherein said chemokine-binding agent comprises a polypeptide 
selected from the group consisting of THAP-1, a polypeptide having at least 30% amino acid 
identity to THAP-1, a chemokine-binding domain of THAP-1 and a polypeptide having at least 
30% amino acid identity to a chemokine-binding domain of THAP-1 . 

25 268 - The Pharmaceutical composition of Paragraph 267, wherein said amino acid 

identity is determined using an algorithm selected from the group consisting of XBLAST with the 
parameters, score=50 and wordlength=3, Gapped BLAST with the default parameters of XBLAST, 
and BLAST with the defaul parameters of XBLAST. 

269. The pharmaceutical composition of Paragraph 267, wherein said polypeptide is 
30 fused to an Fc region of an immunoglobulin. 

270. The pharmaceutical composition of Paragraph 267, wherein said polypeptide 
comprises a THAP dimerization domain. 

271. The pharmaceutical composition of Paragraph 271, wherein said THAP 
dimerization domain interacts with one or more THAP dimerization domains to form a THAP 

35 oligomer. 

272. The pharmaceutical composition of Paragraph 267, wherein said polypeptide binds 
to a chemokine selected from the group consisting of SLC, CCL19, CCL5, CXCL9 and CXCL10. 
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273. The pharmaceutical composition of Paragraph 267, wherein said polypeptide binds 
to a chemokine selected from the group consisting of SLC, CCL19 and CXCL9. 

274. The pharmaceutical composition of Paragraph 267, wherein said polypeptide 
comprises THAP-1 . 

5 275. The pharmaceutical composition of Paragraph 274, wherein said THAP-1 

comprises the amino acid sequence of SEQ ID NO: 3. 

276. The pharmaceutical composition of Paragraph 267, wherein said polypeptide 
comprises a polypeptide having at least 30% amino acid identity to THAP-1 . 

277. The pharmaceutical composition of Paragraph 267, wherein said polypeptide 
10 comprises a chemokine-binding domain of THAP-1 . 

278. The pharmaceutical composition of Paragraph 277, wherein said chemokine- 
binding domain of THAP-1 comprises the amino acid sequence of amino acids 143-213 of SEQ ID 
NO: 3. 

279. The pharmaceutical composition of Paragraph 267, wherein said polypeptide 
1 5 comprises a polypeptide having at least 30% amino acid identity to a chemokine-binding domain of 

THAP-1. 

280. A device for administering an agent, said device comprising a container that 
contains therein a chemokine-binding agent in a pharmaceutical ly acceptable carrier, wherein said 
chemokine-binding agent comprises a polypeptide selected from the group consisting of THAP-1, a 

20 polypeptide having at least 30% amino acid identity to THAP-1, a chemokine-binding domain of 
THAP-1 and a polypeptide having at least 30% amino acid identity to a chemokine-binding domain 
of THAP-1. 

281 . The device according to Paragraph 280, wherein said container is a syringe. 

282. The device according to Paragraph 280, wherein said container is a patch for 
25 transdermal administration. 

283. The device according to Paragraph 280, wherein said container is pressurized 

canister. 

284. A kit comprising: 

a chemokine-binding agent comprising a polypeptide selected from the group 
30 consisting of THAP-1, a polypeptide having at least 30% amino acid identity to THAP-1, a 

chemokine-binding domain of THAP-1 and a polypeptide having at least 30% amino acid 
identity to a chemokine-binding domain of THAP-1; and 

instructions for using said chemokine-binding agent for detecting or inhibiting 
chemokines. 

35 285. The kit of Paragraph 284, wherein said chemokine is selected from the group 

consisiting of SLC, CCL19, CCL5, CXCL9 and CXCL10. 
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286. An isolated or purified chemokine-binding domain consisting essentially of a 
portion of SEQ ID NO: 3 that binds to a chemokine. 

287. The isolated or purified chemokine-binding domain of Paragraph 286, wherein said 
chemokine is CCL19. 

5 288. The isolated or purified chemokine-binding domain of Paragraph 286, wherein said 

chemokine is CCL5. 

289. The isolated or purified chemokine-binding domain of Paragraph 286, wherein said 
chemokine is CXCL9. 

290. The isolated or purified chemokine-binding domain of Paragraph 286, wherein said 
10 chemokine is CXCL10. 

291. A method of modulating expression of a THAP responsive gene, said method 
comprising modulating the interaction of a THAP-family polypeptide or a biologically active 
fragment thereof with a nucleic acid, thereby enhancing or repressing expression of said THAP 
responsive gene. 

15 292. The method of Paragraph 291, wherein said THAP-family polypeptide is THAP1 . 

293. The method of Paragraph 291, wherein said nucleic acid is a THAP responsive 
promoter. 

294. The method of Paragraph 293, wherein said THAP responsive promoter comprises 
a THAP responsive element. 

20 295. The method of Paragraph 294, wherein said THAP responsive element is a DR-5 

element. „ 

296. The method of Paragraph 294, wherein said THAP responsive element is an ER- 1 1 
element. 



297. 

25 298. 



The method of Paragraph 294, wherein said THAP responsive element is THRE. 
The method of Paragraph 293, wherein said THAP responsive promoter does not 
comprise a THAP responsive element. 

299. The method of Paragraph 298, wherein said THAP response promoter is 
modulated by a product of a gene that is under the control of a promoter which comprises a THAP 
responsive element. 

30 300. The method of Paragraph 291, wherein said THAP responsive gene is selected 

from the group consisting of Survivin, PTTGl/Securin, PTTG2/Securin, PTTG3/Securin, CKS1, 
MAD2L1,USP16/Ubp-M,HMMR^HAMM,KIAA0008/HURP, CDCA7/JP01 andTHAPl. 

301. The method of Paragraph 291, wherein sa,d THAP responsive gene encodes a 
polypeptide involved in the G2 or M phase of the cell cycle. 

35 302. The method of Paragraph 291, wherein said THAP responsive gene encodes a 

polypeptide involved in the S phase of the cell cycle. 



-37- 



BNSDOCID: <WO___2O0405505OA2J_> 



WO 2004/055050 PCT/IB2003/006434 

303. The method of Paragraph 302, wherein said THAP responsive gene encodes a 
polypeptide involved in DNA replication. 

304. The method of Paragraph 302, wherein said THAP responsive gene encodes a 
polypeptide involved in DNA repair. 

305. The method of Paragraph 291, wherein said THAP responsive gene encodes a 
polypeptide involved in RNA splicing. 

306. The method of Paragraph 291, wherein said THAP responsive gene encodes a 
polypeptide involved in apoptosis. 

307. The method of Paragraph 291, wherein said THAP responsive gene encodes a 
polypeptide involved in angiogenesis. 

308. The method of Paragraph 291, wherein said THAP responsive gene encodes a 
polypeptide involved in the proliferation of cancer cells. 

309. The method of Paragraph 291, wherein said THAP responsive gene encodes a 
polypeptide involved in inflammatory disease. 

310. A method of modulating the expression of a gene responsive to a THAP/chemokine 
complex, said method comprising modulating the interaction of a chemokine with a THAP-family 
polypeptide or a biologically active fragment thereof, thereby enhancing or repressing expression of 
said gene. 

311. The method of Paragraph 310, wherein said THAP-family polypeptide is THAP1. 

312. The method of Paragraph 310, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL11, CXCL10 and CXCL9. 

313. The method of Paragraph 3 1 0, wherein said chemokine is SLC. 

314. The method of Paragraph 3 1 0, wherein said chemokine is CXCL9. 

315. The method of Paragraph 310, wherein the interaction between said chemokine and 
said THAP-family polypeptide is modulated by providing a THAP-type chemokine-binding agent. 

316. The method of Paragraph 315, wherein said THAP-type chemokine-binding agent 
comprises a polypeptide selected from the group consisting of a THAP1 polypeptide, an 
chemokine-binding domain of a THAP1 polypeptide, a THAP1 polypeptide oligomer, an oligomer 
comprising a THAP1 chemokine-binding domain, a THAP1 polypeptide-immunoglobulin fusion, a 
THAP1 chemokine-binding domain-immunoglobulin fusion and polypeptide homologs of any one 
of the aforementioned polypeptides. 

317. The method of Paragraph 316, wherein said chemokine-binding domain is an SLC- 
binding domain. 

318. The method of Paragraph 316, wherein said chemokine-binding domain is a 
CXCL9-binding domain. 

319. The method of Paragraph 310, wherein said gene encodes a polypeptide involved in 
the G2 or M phase of the cell cycle. 
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320. The method of Paragraph 3 1 0, wherein said gene encodes a polypeptide involved in 
the S phase of the cell cycle. 

321. The method of Paragraph 3 1 0, wherein said gene encodes a polypeptide involved in 
DNA replication. 

5 322. The method of Paragraph 310, wherein said gene encodes a polypeptide involved in 

DNA repair. 

323 . The method of Paragraph 310, wherein said gene encodes a polypeptide involved in 
RNA splicing. 

324. The method of Paragraph 3 1 0, wherein said gene encodes a polypeptide involved in 
1 0 apoptosis. 

325. The method of Paragraph 310, wherein said gene encodes a polypeptide involved in 
angiogenesis. 

326. The method of Paragraph 310, wherein said gene encodes a polypeptide involved in 
the proliferation of cancer cells. 

5 327. The method of Paragraph 3 1 0, wherein said gene encodes a polypeptide involved in 

inflammatory disease. 

328. A method of modulating the expression of a gene responsive to a TH AP/chemokine 
complex, said method comprising modulating the interaction of a THAP/chemokine complex with a 
nucleic acid, thereby enhancing or repressing expression of said gene. 
0 329. The method of Paragraph 328, wherein said THAP-family polypeptide is THAP1 . 

33f>. The method of Paragraph 328, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

331. The method of Paragraph 328, wherein said chemokine is SLC. 

332. The method of Paragraph 328, wherein said chemokine is CXCL9. 

333. The method of Paragraph 328, wherein said gene encodes a polypeptide involved in 
the G2 or M phase of the cell cycle. 

334. The method of Paragraph 328, wherein said gene encodes a polypeptide involved in 
the S phase of the cell cycle. 

335. The method of Paragraph 334, wherein said gene encodes a polypeptide involved in 
DNA replication. 

336. The method of Paragraph 334, wherein said gene encodes a polypeptide involved in 
DNA repair. 

337. The method of Paragraph 328, wherein said gene encodes a polypeptide involved in 
RNA splicing. 

338. The method of Paragraph 328, wherein said gene encodes a polypeptide involved in 
apoptosis. 
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339. The method of Paragraph 328, wherein said gene encodes a polypeptide involved in 
angiogenesis. 

340. The method of Paragraph 328, wherein said gene encodes a polypeptide involved in 
the proliferation of cancer cells. 

341 . The method of Paragraph 328, wherein said gene encodes a polypeptide involved in 
inflammatory disease. 

342. The method of Paragraph 328, wherein said nucleic acid is a THAP responsive 
promoter. 

343. The method of Paragraph 342, wherein said THAP responsive promoter comprises 
a THAP responsive element. 

344. The method of Paragraph 343, wherein said THAP responsive element is a DR-5 
element. 

345. The method of Paragraph 343, wherein said THAP responsive element is an ER-1 1 
element. 

346. The method of Paragraph 343, wherein said THAP responsive element is THRE. 

347. The method of Paragraph 342, wherein said THAP responsive promoter does not 
comprise a THAP responsive element. 

348. The method of Paragraph 347, wherein said THAP responsive promoter is 
modulated by a product of a gene that is under the control of a promoter which comprises a THAP 
responsive element. 

3£9. A pharmaceutical composition comprising a THAP responsive element in a 
pharmaceutically acceptable carrier. 

350. The pharmaceutical composition of Paragraph 349, wherein said THAP responsive 
element is a DR-5 element. 

351 . The pharmaceutical composition of Paragraph 349, wherein said THAP responsive 
element is an ER-1 1 element. 

352. The pharmaceutical composition of Paragraph 349, wherein said THAP responsive 
element is an THRE. 

353. A transcription factor decoy consisting essentially of a THAP responsive element. 

354. The transcription factor decoy of Paragraph 353, wherein said THAP responsive 
element is a DR-5 element. 

355. The transcription factor decoy of Paragraph 353, wherein said THAP responsive 
element is a ER-1 1 element. 

356. The transcription factor decoy of Paragraph 353, wherein said THAP responsive 
element is a THRE element. 

357. A cell comprising a transcription factor decoy of Paragraph 353. 
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358. A method of modulating the interaction between a nucleic acid and a THAP-family 
polypeptide or a biologically active fragment thereof, said method comprising providing a 
transcription factor decoy which comprises a THAP responsive element, thereby modulating the 
interaction between said nucleic acid and said THAP-family polypeptide or a biologically active 

5 fragment thereof. 

359. The method of Paragraph 358, wherein said THAP-family polypeptide is THAP1. 

360. The method of Paragraph 358, wherein said THAP responsive element is a DR-5 
element. 

361 . The method of Paragraph 358, wherein said THAP responsive element is an ER- 1 1 
10 element. 

362. The method of Paragraph 358, wherein said THAP responsive element is THRE. 

363. A method of modulating the interaction between a nucleic acid and a 
THAP/chemokine complex, said method comprising providing a transcription factor decoy which 
comprises a THAP responsive element, thereby modulating the interaction between said nucleic 

1 5 acid and said THAP/chemokine complex. 

364. The method of Paragraph 363, wherein said THAP-family polypeptide is THAP 1 . 

365. The method of Paragraph 363, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

366. The method of Paragraph 363, wherein said chemokine is SLC. 
The method of Paragraph 363, wherein said chemokine is CXCL9. 

368. The method of Paragraph 363, wherein said THAP responsive element is a DR-5 

element. 

369. The method of Paragraph 363, wherein said THAP responsive element is an ER-1 1 

element. 

25 370. The method of Paragraph 363, wherein said THAP responsive element is THRE. 

371. A vector packaging cell line comprising a cell comprising a viral vector which 
comprises a promoter operably linked to a nucleic acid encoding a THAP-family polypeptide or a 
biologically active fragment thereof. 

372. The cell line of Paragraph 371, wherein said cell further comprises an introduced 
30 nucleic acid construct comprising a nucleic acid encoding a chemokine operably linked to a 

promoter. 

373. The cell line of Paragraph 372, wherein said chemokine-encoding construct is 
included on the same vector as said nucleic acid encoding said THAP-family polypeptide or 
biologically active fragment thereof. 

35 374 - ^ cel1 ,ine of Paragraph 372, wherein said nucleic acid encoding said chemokine 

encodes a chemokine selected from the group consisting of SLC, CCL19, CCL5, CXCL11, 
CXCL10andCXCL9. 



20 367. 
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375. The cell line of Paragraph 372, wherein said nucleic acid encoding said chemokine 
encodes SLC. 

376. The cell line of Paragraph 372, wherein said nucleic acid encoding said chemokine 
encodes CXCL9. 

5 377. The cell line of Paragraph 371, wherein said THAP-family polypeptide is THAP1. 

378. The cell line of Paragraph 371, wherein said cell is a mammalian cell. 

379. The cell line of Paragraph 378, wherein said cell is a human cell. 

380. The cell line of Paragraph 371, wherein said viral vector is an adenoviral vector. 

381. The cell line of Paragraph 371, wherein said viral vector is a retroviral vector. 

10 382. A cell which is genetically engineered to express a THAP-family polypeptide or a 

biologically active fragment thereof. 

383. The cell line of Paragraph 382, wherein said THAP-family polypeptide is THAP1. 

384. The cell line of Paragraph 382, wherein said cell is a mammalian cell. 

385. The cell line of Paragraph 382, wherein said cell is a human cell. 

15 386. The cell line of Paragraph 382, wherein said THAP family polypeptide is encoded 

by a gene that is introduced into the cell on an adenoviral vector. 

387. The cell line of Paragraph 382, wherein said THAP family polypeptide is encoded 
by a gene that is introduced into the cell on a retroviral vector. 

388. A method of constructing a cell which expresses a recombinant THAP-family 
20 polypeptide, said method comprising introducing into a cell a vector comprising a nucleic acid 

encoding a THAP-family polypeptide or a biologically active fragment thereof operably linked to a 
promoter. 

389. The method of Paragraph 388, further comprising introducing into a cell a nucleic 
acid construct comprising a nucleic acid encoding a chemokine operably linked to a promoter. 

25 390. The method of Paragraph 389, wherein said chemokine-encoding construct is 

included on the same vector as said nucleic acid encoding said THAP-family . polypeptide or 
biologically active fragment thereof. 

391. The method of Paragraph 389, wherein said nucleic acid encoding said chemokine 
encodes a chemokine selected from the group consisting of SLC, CCL19, CCL5, CXCL11, 

30 CXCL10andCXCL9. 

392. The method of Paragraph 389, wherein said nucleic acid encoding said chemokine 
encodes SLC. 

393. The method of Paragraph 389, wherein said nucleic acid encoding said chemokine 
encodes CXCL9. 

35 394. The method of Paragraph 388, wherein said THAP-family polypeptide is THAP1 . 

395. The method of Paragraph 388, wherein said cell is a mammalian cell. 

396. The method of Paragraph 395, wherein said cell is a human cell. 
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397. The method of Paragraph 388, wherein said vector is a viral vector. 

398. The method of Paragraph 397, wherein said vector is an adenoviral vector. 

399. The method of Paragraph 397, wherein said vector is a retroviral vector. 

400. The method of Paragraph 388, wherein said vector is introduced into said cell by 
transfection. 

401. A method of ameliorating symptoms associated with a condition mediated by a 
THAP/chemokine complex, said method comprising: 

introducing into a cell a nucleic acid construct comprising a nucleic acid encoding a 
chemokine operably linked to a promoter and a nucleic acid construct comprising a nucleic 
acid encoding a THAP-family polypeptide or a biologically active fragment thereof 
operably linked to a promoter; and 

expressing said nucleic acid encoding said chemokine and said nucleic acid 
encoding said THAP-family polypeptide or biologically active fragment thereof. 

402. The method of Paragraph 401 , wherein said nucleic acid constructs are present on a 
1 5 single vector. 

403. The method of Paragraph 401 , wherein said nucleic acid constructs are present on 
different vectors. 

404. The method of Paragraph 401 , wherein said cell is a mammalian cell. 

405. The method of Paragraph 404, wherein said cell is a human cell. 

20 406. The method of Paragraph 401 , wherein said nucleic acid encoding said chemokine 

encodes a^ chemokine selected from the group consisting of SLC, CCL19, CCL5, CXCL11, 
CXCL10andCXCL9. 

407. The method of Paragraph 401 , wherein said nucleic acid encoding said chemokine 
encodes SLC. 

25 408. The method of Paragraph 401 , wherein said nucleic acid encoding said chemokine 

encodes CXCL9. 

409. The method of Paragraph 401, wherein said THAP-family polypeptide is THAP1. 

410. A method of identifying a test compound that modulates transcription at a THAP 
responsive element, said method comprising: 

30 comparing the level of transcription from a THAP responsive promoter in the 

presence and absence of a test compound wherein a determination that the level of 
transcription is increased or decreased in the presence of said test compound relative to the 
level of transcription in the absence of said test compound indicates that said test compound 
is a candidate modulator of transcription. 

35 41 L 7116 method of Paragraph 410, wherein the level of transcription from said THAP 

responsive promoter in the presence and absence of the test compound is determined by performing 
an in vitro transcription reaction using a construct comprising said THAP responsive promoter and 
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a THAP-family polypeptide or a biologically active fragment thereof, wherein said THAP-family 
polypeptide comprises an amino acid sequence having at least 30% amino acid identity to an amino 
acid sequence of SEQ ID NO: 1 . 

412. The method of Paragraph 410, wherein the level of transcription from said THAP 
5 responsive promoter in the presence and the absence of the test compound is determined by 
measuring the level of transcription from a THAP responsive promoter in a cell expressing a 
THAP-family polypeptide or a biologically active fragment thereof, wherein said THAP-family 
polypeptide comprises an amino acid sequence having at least 30% amino acid identity to an amino 
acid sequence of SEQ ID NO: 1 . 
10 413. The method of Paragraph 410, wherein said THAP-family polypeptide or 

biologically active fragment thereof is selected from the group consisting of SEQ ID NOs: 1-114 
and biologically active fragments thereof. 

414. The method of Paragraph 410, wherein said THAP responsive promoter comprises 
a THAP responsive element having a nucleotide sequence selected from the group consisting of 

15 SEQ ED NOs: 140-159, SEQ ID NO: 306, and homologs thereof having at least 60% nucleotide 
identity. 

415. The method of Paragraph 411 or Paragraph 122, wherein the level of transcription 
in the presence or absence of said test compound is measured in the presence of a chemokine. 

416. The method of Paragraph 415, wherein said chemokine is selected from the group 
20 consisting of CCL family chemokines and CXCL family chemokines. 

4J7. The method of Paragraph 416, wherein said CCL family chemokine is selected 
from the group consisting of SLC, CCL 19 and CCL5. 

418. The method of Paragraph 416, wherein said CXCL family chemokine is selected 
from the group consisting of CXCL1 1, CXCL 10 and CXCL9. 
25 419. The method of Paragraph 415, wherein the level of transcription in the presence or 

absence of said test compound is measured in a cell which expresses a receptor for said chemokine. 

420. The method of Paragraph 419, wherein said chemokine receptor is selected from 
the group consisting of CCR1, CCR3, CCR5, CCR7, CCR1 1 and CXCR3. 

421. The method of Paragraph 420, wherein said chemokine is selected from the group 
30 consisting of SLC, CCL 19, CCL5; CXCL11, CXCL10 and CXCL9. 

422. The method of Paragraph 419, wherein said THAP-family polypeptide comprises 
THAP1 or a biologically active fragment thereof and said cell expresses the CCR7 receptor. 

423. The method of Paragraph 422, wherein said chemokine is SLC. 

424. The method of Paragraph 419, wherein said THAP-family polypeptide comprises 
35 THAP1 or a biologically active fragment thereof and said cell expresses the CXCR3 receptor. 

425. Them method of Paragraph 424, wherein said chemokine is CXCL9. 



-44- 



BNSDOCID: <WO _2O04O5505GA2J_> 



WO 2004/055050 „„„„ 

PCT/IB2003/006434 

426. The method of Paragraph 412, wherein said THAP responsive promoter is in a 
gene endogenous to said cell. 

427. The method of Paragraph 4 1 2, wherein said THAP respons.ve promoter has been 
introduced into said cell. 

5 428. The method of Paragraph 412, wherein said THAP responsive promoter does not 

comprise a THAP responsive element. 

429. The method of Paragraph 428, wherein said THAP responsive promoter is 
modulated by a product of a gene that is under the control of a promoter which comprises a THAP 
responsive element. 

10 430. A method for reducing the symptoms associated with a condition selected from the 

group consisting of excessive or insufficient angiogenesis, inflammation, metastasis of a cancerous 
tissue, excessive or insufficient apoptosis, cardiovascular disease and neurodegenerative diseases 
comprising modulating the interaction between a THAP-family polypeptide and a chemokine in an 
individual suffering from said condition. 

15 43 1 • ^ method of Paragraph 430, wherein said THAP-family polypeptide is selected 

from a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-1 14. 

432. The method of Paragraph 430, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

433. The method of Paragraph 430, wherein said chemokine is SLC and the condition is 
20 inflammation. 

434. The method of Paragraph 430, wherein said chemokine is SLC and the condition is 
excessive or insufficient angiogenesis. 

435. The method of Paragraph 430, wherein said chemokine is CXCL9 and the 
condition is inflammation. 

25 436 - ^ method of Paragraph 430, wherein said chemokine is CXCL9 and the 

condition is excessive or insufficient angiogenesis. 

437. A method for reducing the symptoms associated with a condition resulting from the 
activity of a chemokine in an individual comprising modulating the interaction between said 
chemokine and a THAP-family polypeptide in said individual. 
30 438. The method of Paragraph 437, wherein said chemokine is selected from the group 

consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

439. The method of Paragraph 437, wherein said chemokine is SLC. 

440. The method of Paragraph 437, wherein said chemokine is CXCL9. 

441. The method ofParagraph 437, wherein said THAP-family polypeptide is THAP- 1. 
35 442. The method of Paragraph 437, wherein the condition is inflammation. 

443. The method of Paragraph 437, wherein the condition is excessive or insufficient 
angiogenesis. 
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444. The method of Paragraph 437, wherein the interaction between said chemokine and 

said THAP-family polypeptide is modulated by administering to an individual, a therapeutically 
effective amount of a THAP-type chemokine-binding agent. 

445. The method of Paragraph 444, wherein said THAP-type chemokine-binding agent 
5 comprises a therapeutically effective amount of a polypeptide selected from the group consisting of 

a THAP1 polypeptide, an chemokine-binding domain of a THAP1 polypeptide, a THAP1 
polypeptide oligomer, an oligomer comprising a THAP1 chemokine-binding domain, a THAP1 
polypeptide-immunoglobulin fusion, a THAP1 chemokine-binding domain-immunoglobulin fusion 
and polypeptide homologs having at least 30% amino acid identity to any one of the 
10 aforementioned polypeptides. 

446. The method of Paragraph 445, wherein said chemokine-binding domain is an SLC- 
binding domain. 

447. The method of Paragraph 445, wherein said chemokine-binding domain is a 
CXCL9-binding domain. 

15 448. A method of reducing the symptoms associated with a condition resulting from the 

activity of a THAP-family polypeptide in an individual comprising modulating the extent of 
transcriptional repression or activation of at least one THAP-family responsive promoter in said 
individual. 

449. The method of Paragraph 448, wherein said THAP-family polypeptide comprises 
20 an amino acid sequence selected from the group consisting of SEQ ID NOs: 1-1 14. 

450. The method of Paragraph 448, wherein said THAP-family polypeptide comprises 
an amino acid sequence of SEQ ID NO: 3. 

45 1 . The method of Paragraph 448, wherein said THAP responsive promoter comprises 
a THAP responsive element. 

25 452. The method of Paragraph 448, wherein said THAP responsive promoter does not 

comprise a THAP responsive element. 

453. A method of reducing the symptoms associated with a condition resulting from the 
activity of a THAP-family polypeptide in an individual, said method comprising: 

diagnosing said individual with a condition resulting from the activity of a THAP- 
30 family polypeptide; and 

administering a compound which modulates the interaction between said THAP- 
family polypeptide and a chemokine to said individual. 

454. The method of Paragraph 453, wherein said THAP-family polypeptide is selected 
from a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-114. 

35 455. The method of Paragraph 453, wherein said THAP-family polypeptide is THAP1. 

456. The method of Paragraph 453, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCLl 1, CXCL10 and CXCL9. 
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457. The method of Paragraph 453, wherein said chemokine is SLC. 

458. The method of Paragraph 453, wherein said chemokine is CXCL9. 

459. A method of reducing the symptoms associated with a condition resulting from the 
activity of a THAP-family polypeptide in an individual comprising: 

5 diagnosing said individual with a condition resulting from the activity of THAP- 

family polypeptide; and 

administering a chemokine or an analog thereof to said individual. 

460. The method of Paragraph 459, wherein said THAP-family polypeptide is selected 
from a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-114. 

10 461. The method of Paragraph 459, wherein said THAP-family polypeptide is THAP1. 

462. The method of Paragraph 459, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

463. The method of Paragraph 459, wherein said chemokine is SLC. 

464. The method of Paragraph 459, wherein said chemokine is CXCL9. 

15 465. A method of reducing the symptoms associated with transcriptional repression or 

activation mediated by a THAP-family polypeptide in an individual comprising administering a 
chemokine or an analog thereof to said individual. 

466. The method of Paragraph 465, wherein said THAP-family polypeptide is selected ^ 
from a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-1 14. 
20 467. The method of Paragraph 465, wherein said THAP-family polypeptide is THAP1 . 

46£. The method of Paragraph 465, wherein said chemokine is selected from the group 
consisting of SLC, CCL 1 9, CCL5, CXCL1 1 , CXCL 1 0 and CXCL9. 

469. The method of Paragraph 465, wherein said chemokine is SLC. 

470. The method of Paragraph 465, wherein said chemokine is CXCL9. 

47 1 . A method of reducing the symptoms associated with the activity of a chemokine in 
an individual comprising modulating the extent to which said chemokine is transported to the 
nucleus of a cell in said individual. 

472. The method of Paragraph 47 1 , wherein said chemokine is selected from the group 
consisting of SLC, CCL 19, CCL5, CXCL1 1 , CXCL10 and CXCL9. 

30 473 - ^ method of Paragraph 471, wherein said cell expresses a chemokine receptor 

selected from the group consisting of CCR1, CCR3, CCR5, CCR7, CCR1 1 and CXCR3. 

474. The method of Paragraph 473, wherein said chemokine is SLC and said chemokine 
receptor is CCR7. 

475. The method of Paragraph 473, wherein said chemokine is CXCL9 and said 
35 chemokine receptor is CXCR3. 
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^/o. fhe method of Paragraph 471, wherein the extent of transport of said chemokine 
into a nucleus of a cell is modulated by contacting said chemokine with a THAP-type chemokine- 
binding agent. 

477. The method of Paragraph 476, wherein said THAP-type chemokine-binding agent 
selected from the group consisting of a THAP1 polypeptide, a chemokine-binding domain of a 
THAP1 polypeptide, a THAP1 polypeptide oligomer, an oligomer comprising a THAP1 
chemokine-binding domain, a THAP1 polypeptide-immunoglobulin fusion, a THAP1 chemokine- 
binding domain-immunoglobulin fusion and polypeptide homologs having at least 30% amino acid 
identity to any one of the aforementioned polypeptides. 

478. The method of Paragraph 477, wherein said chemokine-binding domain is an SLC- 
binding domain. 

479. The method of Paragraph 477, wherein said chemokine-binding domain is a 
CXCL9-binding domain. 

480. A method for identifying a compound which modulates the transport of a 
chemokine into the nucleus comprising comparing the extent of said chemokine transport into the 
nucleus of cells in the presence and absence of a test compound. 

481 . The method of Paragraph 480, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

482. The method of Paragraph 480, wherein said cell expresses a chemokine receptor 
selected from the group consisting of CCR1, CCR3, CCR5, CCR7, CCR1 1 and CXCR3. 

483. The method of Paragraph 482, wherein said chemokine is SLC and said chemokine 
receptor is CCR7. 

484. The method of Paragraph 482, wherein said chemokine is CXCL9 and said 
chemokine receptor is CXCR3. 

485. The method of Paragraph 480, wherein the extent of transport of said chemokine 
into a nucleus of a cell is modulated by contacting said chemokine with a THAP-type chemokine- 
binding agent. 

486. The method of Paragraph 485, wherein said THAP-type chemokine-binding agent 
is selected from the group consisting of a THAP1 polypeptide, a chemokine-binding domain of a 
THAP1 polypeptide, a THAP1 polypeptide oligomer, an oligomer comprising a THAP1 
chemokine-binding domain, a THAP1 polypeptide-immunoglobulin fusion, a THAP1 chemokine- 
binding domain-immunoglobulin fusion and polypeptide homologs having at least 30% amino acid 
identity to any one of the aforementioned polypeptides. 

487. The method of Paragraph 486, wherein said chemokine-binding domain is an SLC- 
binding domain. 

488. The method of Paragraph 486, wherein said chemokine-binding domain is a 
CXCL9-binding domain. 
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489. The method of Paragraph 480, wherein transport of SLC into the nucleus is 
measured by immunostaining. 

490. A vector comprising a THAP responsive promoter operably linked to a nucleic acid 
encoding a detectable product. 

5 49 1 . The vector of Paragraph 490, wherein said THAP responsive promoter comprises a 

THAP responsive element. 

492. The vector of Paragraph 490, wherein said THAP responsive promoter does not 
comprise a THAP responsive element. 

493. A genetically engineered cell comprising the vector of any one of Paragraphs 490- 

10 492. 

494. An in vitro transcription reaction comprising a nucleic acid comprising a THAP 
responsive promoter, ribonucleotides and an RNA polymerase. 

495. The in vitro transcription reaction of Paragraph 494, wherein said THAP 
responsive promoter comprises a THAP responsive element. 

1 5 496. An isolated mutant THAP-family polypeptide that does not bind to a chemokine. 

497. The isolated mutant THAP-family polpeptide of Paragraph 496, wherein said 
chemokine is selected from the group consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and 
CXCL9. 

498. The isolated mutant THAP-family polypeptide of Paragraph 496, wherein said 
20 chemokine is SLC. 

499. The isolated mutant THAP-family polypeptide of Paragraph 496, wherein said 
chemokine is CXCL9. 

500. The isolated mutant THAP-family polypeptide of Paragraph 496, wherein said 
THAP-family polypeptide is THAP1. 

25 501 • The isolated mutant THAP-family polypeptide of Paragraph 500, wherein said 

polypeptide comprises an amino acid sequence of SEQ ID NO: 3. 

502. The isolated mutant THAP-family polypeptide of Paragraph 501, wherein said 
amino acid sequence comprises at least one point mutation. 

503. The methods of Paragraphs 291, 310, 328, 358, 363, 388, or 401 or the 
30 compositions of Paragraphs 371 or 382, wherein said THAP-family polypeptide comprises an 

amino acid sequence selected from the group consisting of of SEQ ID NOs: 1-1 14. 

504. The methods of Paragraphs 294, 342, 358 or 363 or the compositions of Paragraphs 
349 or 353, wherein said THAP responsive element comprises a nucleic acid having a nucleotide 
sequence selected from the group consisting of SEQ ID NOs: 140-159 and 306. 

35 505 ■ ^ methods of Paragraphs 291, 310, or 328, wherein said gene comprises a 

nucleic acid having a nucleotide sequence selected from the group consisting of SEQ ID NOs: 344, 
346, 348, 350, 352, 354, 356, 358, 360, 362, 364, 366, 368, 370, 372, 374, 376, 378, 380, 382, 384, 
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386, 388, 390, 392, 394, 396, 398, 400, 402, 404, 406, 408, 410, 412, 414, 416, 418, 420, 422, 424, 
426, 428, 430, 432, 434, 436, 438, 440, 442, 444, 446, 448, 450, 452, 454, 456, 458, 460, 462, 464, 
466, 468, 470, 472, 474, 476, 478, 480, 482, 484, 486, 488, 490, 492, 494, 496, 498, 500, 502, 504, 
506, 508, 510, 512, 514, 516, 530, 532, 534 and portions thereof. 

506. The methods of Paragraphs 291, 310 or 328, wherein said gene encodes a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NOs: 
343, 345, 347, 349, 351, 353, 355, 357, 359, 361, 363, 365, 367, 369, 371, 373, 375, 377, 379, 381, 
383, 385, 387, 389, 391, 393, 395, 397, 399, 401, 403, 405, 407, 409, 411, 413, 415, 417, 419, 421, 
423, 425, 427, 429, 431, 433, 435, 437, 439, 441, 443, 445, 447, 449, 451, 453, 455, 457, 459, 461, 
463, 465, 467, 469, 471, 473, 475, 477, 479, 481, 483, 485, 487, 489, 491, 493, 495, 497, 499, 501, 
503, 505, 507, 509, 51 1, 513, 515, 531, 533, 535 and portions thereof. 

507. The methods of Paragraphs 310, 328, 363, 388 or 401 or the composition of 
Paragraph 371, wherein said chemokine has an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 1 19, 271, 273, 275, 277, 289 and 323. 

508. A method of ameliorating symptoms associated with inflammation, said method 
comprising modulating the expression of a THAP responsive gene or a gene responsive to a 
THAP/chemokine complex. 

509. The method of Paragraph 508, wherein said gene expression is modulated by 
modulating the interaction between a nucleic acid and a THAP-family polypeptide or a biologically 
active fragment thereof, modulating the interaction between a nucleic acid and a THAP/chemokine 
complex or modulating the interaction between a chemokine and THAP-family polypeptide or a 
biologically active fragment thereof. 

510. A method of ameliorating symptoms associated with a condition resulting from 
excessive or insufficient angiogenesis, said method comprising modulating the expression of a 
THAP responsive gene or a gene responsive to a THAP/chemokine complex. 

511. The method of Paragraph 510, wherein said gene expression is modulated by 
modulating the interaction between a nucleic acid and a THAP-family polypeptide or a biologically 
active fragment thereof, modulating the interaction between a nucleic acid and a THAP/chemokine 
complex or modulating the interaction between a chemokine and THAP-family polypeptide or a 
biologically active fragment thereof. 

512. A method of ameliorating the symptoms associated with a condition resulting from 
the proliferation of a cancer cell, said method comprising modulating the expression of a THAP 
responsive gene or a gene responsive to a THAP/chemokine complex. 

513. The method of Paragraph 512, wherein said gene expression is modulated by 
modulating the interaction between a nucleic acid and a THAP-family polypeptide or a biologically 
active fragment thereof, modulating the interaction between a nucleic acid and a THAP/chemokine 
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complex or modulating the interaction between a chemokine and THAP-family polypeptide or a 
biologically active fragment thereof. 

It will be appreciated that THAP compositions and methods of making and using have been 
described in other copending patent applications. These patent applications include, US Patent 
Application No. 10/317,832, entitled NOVEL DEATH ASSOCIATED PROTEINS AND THAP1 
AND PAR4 PATHWAYS IN APOPTOSIS CONTROL, filed December 10, 2002 and US Patent 
Application No. 10/601,072, entitled CHEMOKINE-BINDING PROTEIN AND METHODS OF 
USE, filed June 19, 2003. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1A illustrates an amino acid sequence alignment of human THAP1 (hTHAPl) (SEQ 
ID NO: 3) and mouse THAP1 (mTHAPl) (SEQ ID NO: 99) orthologous polypeptides. Identical 
amino acid residues are indicated with an asterisk. 

Figure IB depicts the primary structure of the human THAP1 polypeptide. Positions of the 
THAP domain, the proline-rich region (PRO) and the bipartite nuclear localization sequence (NLS) 
15 are indicated. 

Figure 2 depicts the results of a Northern Blot analysis of THAP 1 mRNA expression in 12 
human tissues. Each lane contains 2 ug of poly A + RNA isolated from the indicated human tissues. 
The blot was hybridized, under high-stringency conditions, with a 32 P-labeled THAP1 cDNA 
probe, and exposed at -70°C for 72 hours. 

Figure 3A illustrates the interaction between THAP1 and PAR4 in a yeast two-hybrid 
system. In'particular, THAP1 binds to wild-type Par4 (Par4) and the leucine zipper-containing Par4 
death domain (Par4DD) (amino acids 250-342 of PAR4) but not a Par4 deletion mutant lacking the 
death domain. (PAR4A) (amino acids 1-276 of PAR4). A (+) indicates binding whereas a (-) 
indicated lack of binding. 

25 Figure 3B shows the binding of in vitro translated, 35 S-methionine-labeled THAP1 to a 

GST-Par4DD polypeptide fusion. Par4DD was expressed as a GST fusion protein then purified on 
an affinity matrix of glutathione sepharose. GST served as negative control. The input represents 
1/10 of the material used in the binding assay. 

Figure 4A illustrates the interaction between PAR4 and several THAP1 deletion mutants 
both in vitro and in vivo. Each THAP1 deletion mutant was tested for binding to either PAR or 
PAR4DD in a yeast two hybrid system (two hybrid bait), to PAR4DD in GST pull down assays (in 
vitro) and to myc-Par4DD in primary human endothelial cells (in vivo). A (+) indicates binding 
whereas a (-) indicated lack of binding. 

Figure 4B shows the binding of several in vitro translated, 35 S-methionine-labeled THAP1 
deletion mutants to a GST-Par4DD polypeptide fusion. Par4DD was expressed as a GST fusion 
protein then purified on an affinity matrix of glutathione sepharose. GST served as negative 
control. The input represents 1/10 of the material used in the binding assay. 
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Figure 5 A depicts an amino acid sequence alignment of the Par4 binding domain of human 
THAP1 (SEQ ID NO: 117) and mouse THAP1 (SEQ ID NO: .l 16) orthologues with that of mouse 
ZIP kinase (SEQ ID NO: 115), another Par4 binding partner. An arginine-rich consensus Par4 
binding site (SEQ ID NO: 15), derived from this alignment, is also indicated. 
5 Figure 5B shows the primary structure of the THAP1 wild-type polypeptide and two 

THAP1 mutants (THAP 1 A(QRCRR) and THAP1 RR/AA). TH AP 1 A(QRCRR) is a deletion 
mutant having a deletion of amino acids at positions 168-172 of THAP1 (SEQ ID NO: 3) whereas 
THAP RR/AA is a mutant having the two arginines located at amino acid positions 171 and 172 to 
THAP1 (SEQ ID NO: 3) replaced with alanines. Results obtained, in yeast two-hybrid system with 
•10 Par4 and Par4DD baits (two hybrid bait), in GST pull down assays with GST-Par4DD (in vitro) and 
in the in vivo interaction test with myc-Par4DD in primary human endothelial cells (in vivo) are 
summarized. 

Figure 6A is a graph which compares apoptosis levels in cells transfected with GFP- 
APSK1, GFP-Par4 or GFP-THAP1 expression vectors. Apoptosis was quantified by DAPI staining 
15 of apoptotic nuclei, 24 h after serum- withdrawal. Values are the means of three independent 
experiments. 

Figure 6B is a graph which compares apoptosis levels in cells transfected with GFP-APSK1 
or GFP-THAP1 expression vectors. Apoptosis was quantified by DAPI staining of apoptotic 
nuclei, 24 h after addition of TNF a. Values are the means of three independent experiments. 

20 Figure 7A shows the binding of in vitro translated 35 S-methionine labeled THAP1 (wt) or 

THAP1ATHAP (A) to a GST-Par4DD polypeptide fusion. Par4DD was expressed as a GST fusion 
protein then purified on an affinity matrix of glutathione sepharose. GST served as negative 
control. The input represents 1/10 of the material used in the binding assay. 

Figure 7B is a graph which compares the proapoptotic activity of THAP 1 with a THAP1 

25 mutant having its THAP domain (amino acids 1-90 of SEQ ID NO: 3) deleted. The percentage of 
apoptotic cells in mouse 3T3 fibroblasts overexpressing GFP-APSK1 (control), GFP-THAP1 
(THAP1) or GFP-TH AP 1 ATH AP (THAP 1 ATH AP) was determined by counting apoptotic nuclei 
after DAPI staining. Values are the means of three independent experiments. 

Figure 8 depicts the primary structure of twelve human THAP proteins. The THAP domain 

30 (colored grey) is located at the ami no-terminus of each of the twelve human THAP proteins. The 
black box in THAP1, THAP2 and THAP3 indicates a nuclear localization sequence, rich in basic 
residues, that is conserved in the three proteins. The number of amino-acids in each THAP protein 
is indicated; (*) indicates the protein is not full length. 

Figure 9A depicts an amino acid sequence alignment of the THAP domain of human 

35 THAP1 (hTHAPl, SEQ ID NO: 123) with the DNA binding domain of drosophila melanogaster P- 
element transposase (dmTransposase, SEQ ID NO: 124). Identical residues are boxed in black and 
conserved residues in grey. A THAP domain consensus sequence (SEQ ID NO: 125) is also shown. 
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Figure 9B depicts an amino acid sequence alignment of the THAP domains of twelve 
members of the human THAP family (hTHAPl, SEQ ID NO: 126; hTHAP2, SEQ ID NO: 131; 
hTHAP3, SEQ ID NO: 127; hTHAP4, SEQ ID NO: 130; hTHAP5, SEQ ID NO: 128; hTHAP^ 
SEQ ID NO: 1 35; hTHAP7, SEQ ID NO: 133; hTHAP8, SEQ ID NO: 129; hTHAP9, SEQ ID NO: 
5 1 34; hTHAP 1 0, SEQ ID NO: 1 37; hTHAPl 1 , SEQ ID NO: 1 36; hTHAPO, SEQ ID NO: 1 32) with 
the DNA binding domain of Drosophila melanogasier P-element transposase (dmTransposae, SEQ 
ID NO: 138). Residues conserved among at least seven of the thirteen sequences are boxed. Black 
boxes indicate identical residues whereas boxes shaded in grey show similar amino acids. Dashed 
lines represent gaps introduced to align sequences. A THAP domain consensus sequence (SEQ ID 
10 NO: 139) is also shown. 

Figure 9C depicts an amino acid sequence alignment of 95 distinct THAP domain 
sequences, including hTHAPl through hTHAPl 1 and hTHAPO (SEQ ID NOs: 3-14, listed 
sequentially beginning from the top), with 83 THAP domains from other species (SEQ ID NOs: 16- 
98, listed sequentially beginning at the sequence denoted sTHAPl and ending at the sequence 
denoted ceNP_498747.1), which were identified by searching GenBank genomic and EST 
databases with the human THAP sequences. Residues conserved among at least 50% of the 
sequences are boxed. Black boxes indicate identical residues whereas boxes shaded in grey show 
similar amino acids. Dashed lines represent gaps introduced to align sequences. The species are 
indicated: Homo sapiens (h); Sus scrofa (s); Bos taurus (b); Mus musculus (m); Rattus norvegicus 
(r); Gallus gallus (g); Xenopus laevi (x); Danio rerio (z); Oryzias latipes (o); Drosophila 
melanogasier (dm); Anopheles gambiae (a); Bombyx mori (bm); Caenorhabditis.elegans (ce). A 
consensus sequence (SEQ ID NO: 2) ,s also shown. Amino acids underlined in the consensus 
sequence are residues which are conserved in all 95 THAP sequences. 

Figure' 10A shows an amino acid sequence alignment of the human THAP1 (SEQ ID NO: 
3), THAP2 (SEQ ID NO: 4) and THAP3 (SEQ ID NO: 5) protein sequences. Residues conserved 
among at least two of the three sequences are boxed. Black boxes indicate identical residues 
whereas boxes shaded in grey show similar amino acids. Dashed lines represent gaps introduced to 
align sequences. Regions corresponding to the THAP domain, the PAR4-binding domain, and the 
nuclear localization signal (NLS) are also indicated. 

Figure 10B shows the primary structure of human THAP1, THAP2 and THAP3 and results 
of two-hybrid interactions between each THAP protein and Par4 or Par4 death domain (Par4DD) in 
the yeast two hybrid system. 

Figure 10C shows the binding of in vitro translated, 35 S-methionine-labeled THAP2 and 
THAP3 to a GST-Par4DD polypeptide fusion. Par4DD was expressed as a GST fusion protein then 
purified on an affinity matrix of glutathione sepharose. GST served as negative control. The input 
represents 1/1 0 of the material used in the binding assay. 

Figure 11A is a graph which compares apoptosis levels in cells transfected with GFP- 
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APSK1, GFP-THAP2 or GFP-THAP3 expression vectors. Apoptosis was quantified by DAPI 
staining of apoptotic nuclei, 24 h after serum -withdrawal. Values are the means of two independent 
representative experiments. 

Figure 1 1 B is a graph which compares apoptosis levels in cells transfected with GFP- 
5 APSK1, GFP-THAP2 or GFP-THAP3 expression vectors. Apoptosis was quantified by DAPI 
staining of apoptotic nuclei, 24 h after additional of TNFa. Values are the means of two 
independent representative experiments. 

Figure 12 illustrates the results obtained by screening several different THAP1 mutants in a 
yeast two-hybrid system with SLC/CCL21 bait. The primary structure of each THAP1 deletion 
10 mutant that was tested is shown. The 70 carboxy-terminal residues of THAP1 (amino acids 143- 
213) are sufficient for binding to chemokine SLC/CCL21. 

Figure 13 illustrates the interaction of THAP1 with wild type SLC/CCL21 and a 
SLC/CCL21 mutant deleted of the basic carboxy-terminal extension (SLC/CCL2 1 ACOOH). The 
interaction was analyzed both in yeast two-hybrid system with THAP1 bait and in vitro using GST- 
15 pull down assays with GST-THAP1. 

Figure 14 depicts micrographs of the primary human endothelial cells were transfected with 
the GFP-THAP0, 1, 2, 3, 6 ,7 ,8 , 10, 1 1 (green fluorescence) expression constructs. To reveal the 
nuclear localization of the human THAP proteins, nuclei were counterstained with DAPI (blue). 
The bar equals 5 jam. 

20 Figure 15A is a threading-derived structural alignment between the THAP domain of 

human THAP1 (THAP1) (amino acids 1-81 of SEQ ID NO: 3) and the thyroid receptor p DNA 
binding domain (NLLB) (SEQ ED NO: 121). The color coding is identical to that described in 
Figure 15D. 

Figure 15B shows a model of the three-dimensional structure of the THAP domain of 
25 human THAP1 based on its homology with the crystal lographic structure of thyroid receptor 
p. The color coding is identical to that described in Figure 1 5D. 

Figure 15C shows a model of the three-dimensional structure of the DNA-binding domain 
of Drosophila transposase (DmTRP) based on its homology with the crystal lographic structure of 
the DNA-binding domain of the glucocorticoid receptor. The color coding is identical to that 
30 described in Figure 15D. 

Figure 15D is a threading— derived structural alignment between the Drosophila 
melanogaster transposase DNA binding domain (DmTRP) (SEQ ID NO: 120) and the 
glucocorticoid receptor DNA binding domain (GLUA) (SEQ ID NO: 122). In accordance with the 
sequences and structures in Figures ISA - 15C, the color-coding is the following: brown indicates 
35 residues in a-helices; indigo indicates residues in P-strands; red denotes the eight conserved Cys 
residues in NLLB and GLUA or for the three Cys residues common to THAP1 and DmTRP; 
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magenta indicates other Cys residues in THAP1 or DmTRP; cyan denotes the residues involved in 
the hydrophobic interactions networks colored in THAP1 or DmTRP. 

Figure 16A illustrates the results obtained by screening several different THAP1 mutants in 
a yeast two-hybrid system with THAP1 bait. The primary structure of each THAP1 deletion mutant 
that was tested is shown. A (+) indicates binding whereas a (-) indicates no binding. 

Figure 16B shows the binding of several in vitro translated, 35 S-methionine-labeled THAP1 
deletion mutants to a GST-THAP1 polypeptide fusion. Wild-type THAP1 was expressed as a GST 
fusion protein then purified on an affinity matrix of glutathione sepharose. GST served as negative 
control. The input represents 1/10 of the material used in the binding assay. 

Figure 17A is an agarose gel showing two distinct THAP1 cDNA fragments were obtained 
by RT-PCR. Two distinct THAP1 cDNAs were -400 and 600 nucleotides in length. 

Figure 17B shows that the 400 nucleotide fragment corresponds to an alternatively spliced 
isoform of human THAP1 cDNA, lacking exon 2 (nucleotides 273-468 of SEQ ID 160). 

Figure 17C is a Western blot which shows that the second isoform of human THAP1 
15 (THAPlb) encodes a truncated THAP1 protein (THAP1 C3) lacking the amino-terminal THAP 
domain. 

Figure 18A shows a specific DNA binding site recognized by the THAP domain of human 
THAP1. The THAP domain recognizes GGGCAA or TGGCAA DNA target sequences 
preferentially organized as direct repeats with 5 nucleotide spacing (DR-5). The consensus 
sequence 5*- GGGCAAnnnnnTGGCAA -3' (SEQ ID NO: 149). The DR-5 consensus was 
generated by examination of 9 nucleic acids bound by THAP1 (SEQ ID NO: 140-148, beginning 
sequentially from the top). 

Figure 18B shows a second specific DNA binding site recognized by the THAP domain of 
human THAP I. The THAP domain recognizes everted repeats with 1 1 nucleotide spacing (ER-1 1) 
25 having a consensus sequence 5'- TTGCCAnnnnnnnnnnnGGGCAA -3' (SEQ ID NO: 159). The 
ER-1 1 consensus was generated by examination of 9 nucleic acids bound by THAP1 (SEQ ID NO: 
150-158, beginning sequentially from the top). 

Figure 19 shows that THAP1 interacts with both CC and CXC chemokines both in vivo in a 
yeast two-hybrid system with THAP1 prey and in vitro using GST-pull down assays with 
immobilized GST-THAP1. The cytokine IFNy was used as a negative control. Results are 
summarized as follows: +++ indicates strong binding; ++ indicates intermediate binding; +/. 
indicates some binding; - indicates no binding; and ND indicates not determined. 

Figure 20A is an SDS-polyacrylamide gel showing the relative amounts of chemokine and 
cytokine used in immobilized GST-THAP1 binding assays. 
35 Figure 20B is an SDS-polyacrylamide gel showing that neither the cytokine, IFNy, nor any 

of the chemokines bound to immobilized GST alone. 
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ngurc zuc is an SDS-polyacrylamide gel showing that chemokines, CXCL10, CXCL9 and 
CCL19, but not the cytokine IFNy, bound to immobilized GST-THAP1 fusions. 

Figure 21 A shows the THAP1 protein fused to the GaI4 DNA-binding domain. This fusion 
was used in transcriptionnal assays with a GaMJAS-luciferase reporter plasmid. 

Figure 21 B shows results of assays wherein the GaMJAS-luciferase reporter plasmid was 
co-transfected into COS7 cells with increasing amounts of the Gal4 DNA- binding domain-THAPl 
fusion expression vector. This analysis revealed that, compared to the Gal4 DNA-binding domain 
alone, the Gal4 DNA-binding domain-THAPl fusion represses transcriptional activity of the 
luciferase reporter. The repression effect of THAP1 was similar to that observed with the well 
characterized transcriptional repressor Suv39Hl. 

Figure 22 A shows that THAP1 as a nuclear receptor for chemokine SLC/CCL21. SLC 
binds to a cytoplasmic receptor such as CR7. Once internalized SCL/CCL2 1 is transported to the 
nucleus wherein it interacts with a THAP-family protein, such as THAP1. The bound SLC 
complex can bind DNA at certain recognition sequences so as to modulate transcription. 

Figure 22B shows the role of THAP1 as a nuclear receptor for chemokines SLC/CCL21 
and MIG/CXCL9. SLC and MIG bind to cell surface receptors such as CCR7 (polypeptide 
sequence SEQ ID NO: 302, nucleotide sequence SEQ ID NO: 303) and CXCR3 (polypeptide 
sequence SEQ ID NO: 304, nucleotide sequence SEQ ID NO: 305). Once internalized SLC and 
MIG are transported into the nucleus wherein they interact with a THAP-family protein, such as 
THAP1. The bound SLC/THAP1 and MIG/THAP1 complexes can bind DNA at certain 
recognition sequences so as to modulate transcription. 

Figure 23 shows the nucleotide sequence of the human Fucosyl transferase TVII promoter 
(GenBank Accession Number ABO 12668, nucleotides 661-1080) (SEQ ID NO: 301). The 
sequence corresponding to the mRNA is underlined and the initiation codon (ATG) is indicated in 
bold. The promoter contains one GGGCAA (antisense orientation) and six GGGCAG (3 sense and 
3 antisense orientations) THAP domain recognition elements, that are indicated in bold and 
underlined. 

Figure 24 shows a consensus sequence (THAP-responsive element, THRE) (SEQ ID NO: 
306) recognized by the THAP domain of human THAP1. The THRE consensus was generated by 
examination of 18 nucleic acids bound by THAP1 (SEQ ID NO: 140-148 and 150-158). The 
THRE was validated experimentally by using oligonuceotides mutated at each position. 

Figure 25A shows the results of an EMSA assay carried out with the purified THAP 
domain from human THAP1 and oligonucleotides bearing wild type or mutant THRE sequences 
(wt, AGTAAGGGCAA (SEQ ID NO: 307); 3mutl, AGTAATTTCAA (SEQ ID NO: 308); 3mut3, 
AGTAAGGTCAA (SEQ ID NO: 309); 3mut4, AGTAAGTGCAA (SEQ ID NO: 310); 3mutl4, 
AGTAAGGGCCA (SEQ ID NO: 311); and 3mut5, AGTAAGGGAAA (SEQ ID NO: 312)). 
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Figure 25B shows the results of an EMSA assay carried out with the purified THAP 
domain from human THAP1 and labelled oligonucleotides bearing the wild type THRE sequence 
(5 ' - AGC AAGT AAGGGC AAACT ACTTC AT-3 ') (SEQ ID NO: 313) in the presence of increasing 
amounts of unlabelled THRE or non-specific competitor olgonucleotides (wild-type THRE, 5'- 
5 AGCA AGT AAGGGCAAA CTACTTCAT-3 > (SEQ ID NO: 313) non-specific competitor, 5'- 
AGCA AGTAATTTCAAA CTACTTCAT-3 ') (SEQ ID NO: 314). 

Figure 26A shows the results of an EMSA assay carried out with the purified THAP 
domain from human THAP1 and labelled oligonucleotides bearing the wild type THRE sequence 
(5 '-AGCAAGT AAGGGCAAACTACTTCAT-3 ') (SEQ ID NO: 313) in the presence of metal 
10 chelators EDTA (5mM or 50mM) or 1,10 phenanthroline (vehicle alone, lmM or 5mM). 

Figure 26B shows the results of an EMSA assay carried out with the purified THAP 
domain from human THAP1 and labelled oligonucleotides bearing the wild type THRE sequence 
(5 '-AGC AAGTAAGGGC AA A CTACTTC AT-^ ') (SEQ ID NO: 313) in the presence of metal 
chelator 1 , 1 0 phenanthroline (5mM + Phe) and increasing amounts of Zn2 + (100 |iM or 500 jiM) or 
1 5 Mg2 + ( 1 00 jiM or 500 nM). 

Figure 27A-D depicts micrographs of human Hela ceils transfected with the GFP-SLC (A) 
and GFP-MIG (green fluorescence) (C) expression constructs. To reveal the nuclear localization of 
the chemokines SLC and MIG, nuclei were counterstained with DAPI (blue) (B and D). 

Figure 28A-D depicts micrographs of human U20S cells transfected with the secreted 
20 MIG (red fluorescence) expression construct (phMIG-Flag) in the presence of a CXCR3 expression 
vector (pEF-CXCR3) (28C) or a control vector (pEF-puro) (28A). To reveal the nuclear localization 
of chemokine MIG, nuclei were counterstained with DAPI (blue) (B and D). 

Figure 29A-C depicts micrographs of human U20S cells transfected with the secreted 
MIG expression construct (phMIG-Flag) in the presence of a CXCR3 expression vector (pEF- 
25 CXCR3). MIG chemokine and CXCR3 expression were detected with anti-Flag (red fluorescence) 
(A) and anti-CXCR3 antibodies (green fluorescence) (B). To reveal the nuclear localization of 
chemokine MIG, nuclei were counterstained with DAPI (blue) (C). 

Figure 30 shows the nucleotide sequence of the human Survivin promoter (GenBank 
Accession Number NT 010641.14, nucleotides 10102350-10102668) (SEQ ED NO: 315). The 
30 sequence corresponding to the mRNA is underlined and the initiation codon (ATG) is indicated in 
italics (nt 210-212). The promoter contains a DR5-type THAP1 responsive element in the antisense 
orientation (GGGCAAnnnnnGGGCAC) (SEQ ID NO: 316), that is indicated in bold. 

Figure 31 shows the nucleotide sequence of the human Ubiquitin specific protease 16 
promoter (EPD database, which can be accessed by typing in the address bar of a web brower 
35 "http://www.epd." immediately followed by "isb-sib.ch"), Accession Number EP73421, 
nucleotides -499-to + 100) (SEQ ID NO: 317). The sequence corresponding to the mRNA is 
underlined. The promoter contains, near the TATA box, a consensus THAP1 responsive element 
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(THRE-llnt) in the antisense orientation (AGTGTGGGCAT) (SEQ ID NO: 318), that is indicated 
in bold and underlined. 

DETAILED DESCRIPTION OF THE INVENTION 
THAP and PAR4 biological pathways 
5 As mentioned above, the inventors have discovered a novel class of proteins involved in 

apoptosis. Then, the inventors have also linked a member of this novel class to another (PAR4) 
apoptosis pathway, and further linked both of these pathways to PML-NBs. Moreover, the 
inventors have also linked both of these pathways to endothelial cells, providing a range of novel 
and potentially selective therapeutic treatments. In particular, it has been discovered that THAP1 

10 (THanatos (death)-Associated-Protein-l) localizes to PML-NBs. Furthermore, two hybrid 
screening of an HEVEC cDNA library with the THAP1 bait lead to the identification of a unique 
interacting partner, the pro-apoptotic protein PAR4. PAR4 is also found to accumulate into PML- 
NBs. Targeting of the THAP-1 / PAR4 complex to PML-NBs is mediated by PML. Similarly to 
PAR4, THAP1 has a pro-apoptotic activity. This activity includes a novel motif in the amino- 

15 terminal part called THAP domain. Together these results define a novel PML-NBs pathway for 
apoptosis that involves the THAP1/PAR4 pro-apoptotic complex. 
THAP-family members, and uses thereof 

The present invention includes polynucleotides encoding a family of pro-apoptotic 
polypeptides THAP-0 to THAP11, and uses thereof for the modulation of apoptosis-related and 

20 other THAP-mediated activities. Included is THAP1, which forms a complex with the pro- 
apoptotic^. protein PAR4 and localizes in discrete subnuclear domains known as PML nuclear 
bodies. Additionally, THAP-family polypeptides can be used to alter or otherwise modulate 
bioavailability of SLC/CCL21 (SLC). 

The present invention also includes a novel protein motif, the THAP domain, which is 

25 found in an 89 amino acid domain in the amino-terminal part of THAP 1 and which is involved in 
THAP1 pro-apoptotic activity. The THAP domain defines a novel family of proteins, the THAP- 
family, with at least twelve distinct members in the human genome (THAP-0 to THAP 11), which 
all contain a THAP domain in their amino-terminal part. The present invention thus pertains to 
nucleic acid molecules, including genomic and in particular the complete cDNA sequences, 

30 encoding members of the THAP-family, as well as with the corresponding translation products, 
nucleic acids encoding THAP domains, homologues thereof, nucleic acids encoding at least 10, 12, 
15, 20, 25, 30, 40, 50, 100,150 or 200 consecutive amino acids, to the extent that said span is 
consistent with the particular SEQ ID NO, of a sequence selected from the group consisting of SEQ 
IDNOs: 160-175. 

35 THAP1 has been identified based on its expression in HEVs, specialized postcapillary 

venules found in lymphoid tissues and nonlymphoid tissues during chronic inflammatory diseases 
that support a high level of lymphocyte extravasation from the blood. An important element in the 

-58- 

BNSDOCID: <WO 2004055050A2J_> 



10 



WO 2004/055050 PCT/IB2003/006434 
cloning of the THAP1 cDNA from HEVECs was the development of protocols for obtaining 
HEVECs RNA, since HEVECs are not capable of maintaining their phenotype outside of their 
native environment for more than a few hours. A protocol was developed where total RNA was 
obtained from HEVECs freshly purified from human tonsils. Highly purified HEVECs were 
obtained by a combination of mechanical and enzymatic procedures, immunomagnetic depletion 
and positive selection. Tonsils were minced finely with scissors on a steel screen, digested with 
collagenase/dispase enzyme mix and unwanted contaminating cells were then depleted using 
immunomagnetic depletion. HEVECs were then selected by immunomagnetic positive selection 
with magnetic beads conjugated to the HEV-specific antibody MECA-79. From these HEVEC that 
were 98% MECA-79-positive, 1 ug of total RNA was used to generate full length cDNAs for 
THAP1 cDNA cloning and RT-PCR analysis. 

As used herein, the term "nucleic acids" and "nucleic acid molecule 0 is intended to include 
DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of 
the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can be single- 
15 stranded or double-stranded, but preferably is double-stranded DNA. Throughout the present 
specification, the expression "nucleotide sequence" may be employed to designate indifferently a 
polynucleotide or a nucleic acid. More precisely, the expression "nucleotide sequence" 
encompasses the nucleic material itself and is thus not restricted to the sequence information (i.e. 
the succession of letters chosen among the four base letters) that biochemically characterizes a 
20 specific DNA or RNA molecule. Also, used interchangeably herein are terms "nucleic acids", 
"oligonucleotides", and "polynucleotides". 

An "isolated" nucleic acid molecule is one which is separated from other nucleic acid 
molecules which are present in the natural source of the nucleic acid. Preferably, an "isolated" 
nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at 
25 the 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated THAP-family nucleic acid 
molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide 
sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which 
the nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a cDNA 
30 molecule, can be substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or substantially free of chemical precursors or other chemicals when 
chemically synthesized. A nucleic acid molecule of the present invention, e.g., a nucleic acid 
molecule having the nucleotide sequence of SEQ ID NOs: 160-175, a portion thereof, can be 
isolated using standard molecular biology techniques and the sequence information provided herein. 
35 Using all or a portion of the nucleic acid sequence of SEQ ID NOs: 160-175, as a hybridization 
probe, THAP-family nucleic acid molecules can be isolated using standard hybridization and 
cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
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Cloning. A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of e.g. SEQ ID NOs: 160- 
175, can be isolated by the polymerase chain reaction (PCR) using synthetic oligonucleotide 
primers designed based upon the sequence of SEQ ID NOs: 160-175. 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 
THAP-family nucleotide sequences can be prepared by standard synthetic techniques, e.g., using an 
automated DNA synthesizer. 

As used herein, the term "hybridizes to" is intended to describe conditions for moderate 
stringency or high stringency hybridization, preferably where the hybridization and washing 
conditions permit nucleotide sequences at least 60% homologous to each other to remain hybridized 
to each other. Preferably, the conditions are such that sequences at least about 70%, more preferably 
at least about 80%, even more preferably at least about 85%, 90%, 95% or 98% homologous to 
each other typically remain hybridized to each other. Stringent conditions are known to those 
skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization conditions 
are as follows: the hybridization step is realized at 65°C in the presence of 6 x SSC buffer, 5 x 
Denhardfs solution, 0,5% SDS and 100^g/ml of salmon sperm DNA. The hybridization step is 
followed by four washing steps: . 

- two washings during 5 min, preferably at 65°C in a 2 x SSC and 0.1%SDS buffer; 

- one washing during 30 min, preferably at 65°C in a 2 x SSC and 0.1% SDS buffer, 

- one washing during 10 min, preferably at 65°C in a 0.1 x SSC and 0.1%SDS buffer, 
these hybridization conditions being suitable for a nucleic acid molecule of about 20 nucleotides in 
length. It will be appreciated that the hybridization conditions described above are to be adapted 
according to the length of the desired nucleic acid, following techniques well known to the one 
skilled in the art, for example be adapted according to the teachings disclosed in Hames B.D. and 
Higgins S.J. (19S5) Nucleic Acid Hybridization: A Practical Approach. Hames and Higgins Ed., 
IRL Press, Oxford; and Current Protocols in Molecular Biolog (supra). Preferably, an isolated 
nucleic acid molecule of the invention that hybridizes under stringent conditions to a sequence of 
SEQ ED NOs: 160-175 corresponds to a naturally-occurring nucleic acid molecule. As used herein, 
a "naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 

To determine the percent homology of two amino acid sequences or of two nucleic acids, 
the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in the 
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sequence 01 a nrsi amino acid or nucleic acid sequence for optimal alignment with a second amino 
or nucleic acid sequence and non-homologous sequences can be disregarded for comparison 
purposes). In a preferred embodiment, the length of a reference sequence aligned for comparison 
purposes is at least 30%, preferably at least 40%, more preferably at least 50%, even more 
5 preferably at least 60%, and even more preferably at least 70%, 80%, 90% or 95% of the length of 
the reference sequence (e.g., when aligning a second sequence to e.g. a THAP-1 amino acid 
sequence of SEQ ID NO: 3 having 213 amino acid residues, at least 50, preferably at least 100, 
more preferably at least 200, amino acid residues are aligned or when aligning a second sequence to 
the THAP-1 cDNA sequence of SEQ ID NO: 160 having 2173 nucleotides or nucleotides 202- 
1 0 835 which encode the amino acids of the THAP1 protein, preferably at least 100, preferably at least 
200, more preferably at least 300, even more preferably at least 400, and even more preferably at 
least 500, 600, at least 700, at least 800, at least 900, at least 1000, at least 1200, at least 1400, at 
least 1600, at least 1800, or at least 2000 nucleotides are aligned. The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When 
15 a position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence, then the molecules are homologous at that position 
(i.e., as used herein amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic acid 
"homology"). The percent homology between the two sequences is a function of the number of 
identical positions shared by the sequences (i.e., % homology = number (#) of identical 
20 positions/total number (#) of positions 100). 

Thp comparison of sequences and determination of percent homology between two 
sequences can be accomplished using a mathematical algorithm. A preferred, non-limiting example 
of a mathematical algorithm utilized for the comparison of sequences is the algorithm of Karlin and 
Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-68, modified as in Karlin and Altschul (1993) 
Proc. Natl. Acad. Sci. USA 90:5873-77. Such an algorithm is incorporated into the NBLAST and 
XBLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST 
nucleotide searches can be performed with the NBLAST program, score=100, wordlength=12 to 
obtain nucleotide sequences homologous to THAP-family nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to 
obtain amino acid sequences homologous to THAP-family protein molecules of the invention. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al., (1997) Nucleic Acids Research 25(17):3389-3402. When utilizing BLAST and 
Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used (see, www.ncbi.nlm.nih.gov). Another preferred, non-limiting example of a 
35 mathematical algorithim utilized for the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989). Such an algorithm is incorporated into the ALIGN program (version 2.0) 
which is part of the GCG sequence alignment software package. When utilizing the ALIGN 



25 



30 



-61- 



BNSDOCID: <WO 2004055050A2J_> 



WO 2004/055050 PCT/IB2003/006434 
program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty 
of 1 2, and a gap penalty of 4 can be used. 

The term "polypeptide" refers to a polymer of amino acids without regard to the length of 
the polymer; thus, peptides, oligopeptides, and proteins are included within the definition of 
5 polypeptide. This term also does not specify or exclude post-expression modifications of 
polypeptides, for example, polypeptides which include the covalent attachment of glycosyl groups, 
acetyl groups, phosphate groups, lipid groups and the like are expressly encompassed by the term 
polypeptide. Also included within the definition are polypeptides which contain one or more 
analogs of an amino acid (including, for example, non-naturally occurring amino acids, amino acids 

10 which only occur naturally in an unrelated biological system, modified amino acids from 
mammalian systems etc.), polypeptides with substituted linkages, as well as other modifications 
known in the art, both naturally occurring and non-naturally occurring. 

An "isolated" or "purified" protein or biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from the cell or tissue source from which the 

1 5 THAP family or THAP domain polypeptide, or a biologically active fragment or homologue thereof 
protein is derived, or substantially free from chemical precursors or other chemicals when 
chemically synthesized. The language "substantially free of cellular material" includes preparations 
of a protein according to the invention (e.g. THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof) in which the protein is separated from cellular 

20 components of the cells from which it is isolated or recombinantly produced. In one embodiment, 
the language "substantially free of cellular material" includes preparations of a protein according to 
the invention having less than about 30% (by dry weight) of protein other than the THAP-family 
protein (also referred to herein as a "contaminating protein"), more preferably less than about 20% 
of protein other than the protein according to the invention, still more preferably less than about 

25 10% of protein other than the protein according to the invention, and most preferably less than 
about 5% of protein other than the protein according to the invention. When the protein according 
to the invention or biologically active portion thereof is recombinantly produced, it is also 
preferably substantially free of culture medium, i.e., culture medium represents less than about 
20%, more preferably less than about 10%, and most preferably less than about 5% of the volume 

30 of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of THAP family or THAP domain polypeptide, or a biologically active fragment or 
homologue thereof in which the protein is separated from chemical precursors or other chemicals 
which are involved in the synthesis of the protein. In one embodiment, the language "substantially 

35 free of chemical precursors or other chemicals" includes preparations of a THAP-family protein 
having less than about 30% (by dry weight) of chemical precursors or non-THAP-family chemicals, 
more preferably less than about 20% chemical precursors or non-THAP-family or THAP-domain 
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chemicals, still more preferably less than about 10% chemical precursors or non-THAP-family or 
THAP-domain chemicals, and most preferably less than about 5% chemical precursors or non- 
THAP-family or THAP-domain chemicals. 

The term "recombinant polypeptide" is used herein to refer to polypeptides that have been 
5 artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been expressed from a recombinant polynucleotide. 

Accordingly, another aspect of the invention pertains to an ti -THAP-family or THAP- 
domain antibodies. The term "antibody" as used herein refers to immunoglobulin molecules and 

10 immunologically active portions of immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds (immunoreacts with) an antigen, such as a THAP 
family or THAP domain polypeptide, or a biologically active fragment or homologue thereof. 
Examples of immunologically active portions of immunoglobulin molecules include F(ab) and 
F(ab') 2 fragments which can be generated by treating the antibody with an enzyme such as pepsin. 

15 The invention provides polyclonal and monoclonal antibodies that bind a THAP family or THAP 
domain polypeptide, or a biologically active fragment or homologue thereof. The term "monoclonal 
antibody" or "monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of immunoreacting with 
a particular epitope of a THAP-family or THAP domain polypeptide. A monoclonal antibody 

20 composition thus typically displays a single binding affinity for a particular THAP-family or THAP 
domain protein with which it immunoreacts. 
PAR4 

As mentioned above, Prostate apoptosis response-4 (PAR4) is a 38 kDa protein initially 
identified as the product of a gene specifically upregulated in prostate tumor cells undergoing 

25 apoptosis (for reviews see Rangnekar, 1998 ; Mattson et al., 1999). The PAR4 nucleic acid and 
amino acid sequences, see Johnstone et al, Mol. Cell. Biol. 16 (12), 6945-6956 (1996); and 
Genbank accession no. U63809 (SEQ ID NO: 118). 

As used interchangeably herein, a "PAR4 activity", "biological activity of a PAR4" or 
"functional activity of a PAR4", refers to an activity exerted by a PAR4 protein, polypeptide or 

30 nucleic acid molecule as determined in vivo, or in vitro, according to standard techniques. In one 
embodiment, a PAR4 activity is a direct activity, such as an association with a PAR4-target' 
molecule or most preferably apoptosis induction activity, or inhibition of cell proliferation or cell 
cycle. As used herein, a "target molecule" is a molecule with which a PAR4 protein binds or 
interacts in nature, such that PAR4-mediated function is achieved. An example of a PAR4 target 

35 molecule is a THAP-family protein such as THAP1 or THAP2, or a PML-NBs protein. A PAR4 
target molecule can be a PAR4 protein or polypeptide or a non-PAR4 molecule. For example, a 
PAR4 target molecule can be a non-PAR4 protein molecule. Alternatively, a PAR4 activity is an 
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indirect activity, such as an activity mediated by interaction of the PAR4 protein with a PAR4 
target molecule such that the target molecule modulates a downstream cellular activity (e.g., 
interaction of a PAR4 molecule with a PAR4 target molecule can modulate the activity of that 
target molecule on an intracellular signaling pathway). 
5 Binding or interaction with a PAR4 target molecule (such as THAP1/PAR4 described 

herein) or with other targets can be detected for example using a two hybrid-based assay in yeast to 
find drugs that disrupt interaction of the PAR4 bait with the target (e.g. PAR4) prey, or an in vitro 
interaction assay with recombinant PAR4 and target proteins (e.g. THAP1 and PAR4). 
CHEMOKINES 

10 Chemokines are important in medicine because they regulate the movement and biological 
activities of leukocytes in many disease situations, including, but not limited to: allergic disorders, 
autoimmune diseases, ischemia/reperfusion injury, development of atherosclerotic plaques, cancer 
(including mobilization of hematopoietic stem cells for use in chemotherapy or myeloprotection 
during chemotherapy), chronic inflammatory disorders, chronic rejection of transplanted organs or 

15 tissue grafts, chronic myelogenous leukemia, and infection by HIV and other pathogens. 
Antagonists of chemokines or chemokine receptors may be of benefit in many of these diseases by 
reducing excessive inflammation and immune system responses. 

The activity of chemokines is tightly regulated to prevent excessive inflammation that can 
cause disease. Inhibition of chemokines by neutralizing antibodies in animal models (Sekido et al. 

20 (1993) Nature 365:654-657) or disruption of mouse chemokine genes (Cook et al. (1995) Science 
269:1583 : 1588) have confirmed a critical role of chemokines in vivo in inflammation mediated by 
virus infection or other processes. The production of soluble versions of cytokine receptors 
containing only the extracellular binding domain, represents a physiological and therapeutic 
strategy to block the activity of some cytokines (Rose-John and Heinrich (1994) Biochem J. 

25 300:281-290; Heaney and Golde (1996) Blood 87:847-857). However, the seven transmembrane 
domain structure of chemokine receptors makes the construction of soluble, inhibitory receptors 
difficult, and thus antagonists based on mutated chemokines, blocking peptides or antibodies are 
under evaluation as chemokine inhibitors (D'Souza & Harden (1996) Nature Medecine 2:1293- 
1300; Howard et al. (1996) Trends Biotech. 14:46-51; Baggiolini (1998) Nature 392:565-568; 

30 Rollins ( 1 997) Blood 90:909-928). 

Several viral chemokine binding proteins have been described that may be useful as soluble 
chemokine inhibitors. Soluble chemokine-binding proteins have been previously detected in 
poxviruses. Firstly, the myxoma virus T7 protein, which was first identified as a soluble IFN-y 
Receptor (Upton et al. (1992) Science 258:1369-1372), binds to a range of chemokines through the 

35 heparin-binding domain and affects the infiltration of cells into infected tissue (Lalani et al. (1997) J 
Virol 71:4356-4363). The protein is described in U.S. Patent No. 5,834,419 and International 
Publication No. WO 96/33730, and is designated CBP-1. Secondly, it was demonstrated that VV 
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stra,n Lister expresses a soluble 35 kDa protein that is secreted from infected cells and which binds 
many CC chemokines. (Graham et al. (1997) Virology 229:12-24; Smith et al. (1997) Virology 
236:316-327; Alcami et al (1998) J Immunol 160:624-633), but not CXC chemokines, through a 
domain distinct from the heparin-binding domain (Smith et al. (1997) Virology 236:316-327; 
Alcami et al (1998) J Immunol 160:624-633). This protein has been called vCKBP (Alcami et al 
(1998) J Immunol 160:624-633). The protein is also described in U.S. Patent No. 5,871,740 and 
International Publication No. W097/11714. One main disadvantage to the use of these viral 
proteins in a clinical setting is that antigenicity severely limits their indications. As such, there is a 
strong interest in the identification of cellular chemokine-binding proteins 

Some aspects of the present invention relate to cellular polypeptides and homologs thereof, 
portions of cellular polypeptides and homologs thereof as well as modified cellular polypeptides 
and homologs thereof that bind to one or more chemokines. In some embodiments of the present 
invention such cellular polypeptides are THAP-family polypeptides, including THAP-1, 
chemokine-binding domains of THAP-family polypeptides (including a chemokine-binding domain 
of THAP-1), THAP-family polypeptide or THAP-family chemokine-binding domain fusions to 
immunoglobulin Fc (including THAP-1 fused to an immunoglobuhn Fc region or a chemokine- 
binding domain of THAP-1 fused to an immunoglobulin Fc region), oligomers of THAP-family 
polypeptides or THAP-family chemokine-binding domains (including THAP-1 oligomers or 
oligomers of a chemokine-binding domain of THAP-1), or homologs of any of the above-listed 
compositions. Throughout this disclosure, the above-listed polypeptides are referred to as THAP- 
type chemokine-binding agents. Each of these THAP-type chemokine-binding agents are described 
in detail below. 

SLC/CCL21 (SLC) 

BiologicalRol.es of SLC 

The signals which mediate T-cell infiltration during T-cell auto-immune diseases are poorly 
understood. SLC/CCL21 (SEQ ID NO: 1 19) is highly potent and highly specific for attracting T- 
cell migration. It was mitially thought to be expressed only in secondary lymphoid organs, directing 
naive T-cells to areas of antigen presentation. However, using immunohistology it was found that 
expression of CCL21 was highly induced in endothelial cells of T-cell auto-immune infiltrative skin 
diseases (Christopherson et al. (2002) Blood electronic publication prior to printed publication). No 
other T-cell chemokine was consistently induced ,n these T cell skin diseases. The receptor for 
CCL21, CCR7, was also found to be highly expressed on the infiltrating T-cells, the majority of 
wh.ch expressed the memory CD45Ro phenotype. Inflamed venules endothelial cel.s expressing 
SLC/CCL21 m T cell infiltrative autoimmune skin diseases may therefore play a key role in the 
35 regulation of T-cell migration into these tissues. 

There are a number of other autoimmune diseases where induced expression of 
SLC/CCL21 ,n endothehal cells may cause abnormal recru.tment of T-cells from the circulation to 
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sites of pathologic inflammation. For instance, chemokine SLC/CCL21 appears to be important for 
aberrant T-cell infiltration in experimental autoimmune encephalomyelitis (EAE), an animal model 
for multiple sclerosis (Alt et al. (2002) Eur J Immunol 32:2133-44). Migration of autoaggressive T 
cells across the blood-brain barrier (BBB) is critically involved in the initiation of EAE. The direct 
involvement of chemokines in this process was suggested by the observation that G-protein- 
mediated signaling is required to promote adhesion strengthening of encephalitogenic T cells on 
BBB endothelium in vivo. A search for chemokines present at the BBB, by in situ hybridizations 
and immunohistochemistry revealed expression of the lymphoid chemokines CCL19/ELC and 
CCL21/SLC in venules surrounded by inflammatory cells (Alt et al. (2002) Eur J Immunol 
32:2133-44). Their expression was paralleled by the presence of their common receptor CCR7 in 
inflammatory cells in brain and spinal cord sections of mice afflicted with EAE. Encephalitogenic T 
cells showed surface expression of CCR7 and specifically chemotaxed towards both CCL19 or 
CCL21 in a concentration dependent and pertussis toxin-sensitive manner comparable to naive 
lymphocytes in vitro. Binding assays on frozen sections of EAE brains demonstrated a functional 
involvement of CCL19 and CCL21 in adhesion strengthening of encephalitogenic T lymphocytes to 
inflamed venules in the brain (Alt et al. (2002) Eur J Immunol 32:2133-44). Taken together these 
data suggested that the lymphoid chemokines CCL19 and CCL21 besides regulating lymphocyte 
homing to secondary lymphoid tissue are involved in T lymphocyte migration into the 
immunoprivileged central nervous system during immunosurveillance and chronic inflammation. 

Other diseases where induced expression of SLC/CCL21 in venular endothelial cells has 
been observed include rheumatoid arthritis (Page et al. (2002) J Immunol 168:5333-5341) and 
experimental autoimmune diabetes (Hjelmstrom et al. (2000) Am J Path 156:1133-1138). 
Therefore, chemokine SLC/CCL21 may be an important pharmacological target in T-cell auto- 
immune diseases. Inhibitors of SLC/CCL21 may be effective agents at treating these T cell 
infiltrative diseases by interfering with the abnormal recruitment of T cells, from the circulation to 
sites of pathologic inflammation, by endothelial cells expressing SLC/CCL21. The reduction in T 
cell migration into involved tissue would reduce the T-cell inflicted damage seen in those diseases. 

Ectopic lymphoid tissue formation is a feature of many chronic inflammatory diseases, 
including rheumatoid arthritis, inflammatory bowel diseases (Crohn's disease, ulcerative colitis), 
autoimmune diabetes, chronic inflammatory skin diseases (lichen panus, psoriasis, ...), 
Hashimoto's thyroiditis, Sjogren's syndrome, gastric lymphomas and chronic inflammatory liver 
disease (Girard and Springer (1995) Immunol today 16:449-457; Takemura et al. (2001) J Immunol 
167:1072-1080; Grant et al. (2002) Am J Pathol 2002 160:1445-55; Yoneyama et al. (2001) J Exp 
Med 193:35-49). 

Ectopic expression of SLC/CCL21 has been shown to induce lymphoid neogenesis, both in 
mice and in human inflammatory diseases. In mice, transgenic expression of SLC/CCL21 in the 
pancreas (Fan et al. (2000) J Immunol 164:3955-3959; Chen et al. (2002) J Immunol 168:1001- 
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1008; Luther et al. (2002) J Immunol 169:424-433), a non-lymphoid tissue, has been found to be 
sufficient for the development and organization of ectopic lymphoid tissue through differential 
recruitment of T and B lymphocytes and induction of high endothelial venules, specialized blood 
vessels for lymphocyte migration (Girard and Springer (1995) Immunol today 16:449-457). In 
5 humans, hepatic expression of SLC/CCL21 has been shown to promote the development of high 
endothelial venules and portal -associated lymphoid tissue in chronic inflammatory liver disease 
(Grant et al. (2002) Am J Pathol 2002 160:1445-55; Yoneyama et al. (2001) J Exp Med 193:35- 
49).The chronic inflammatory liver disease primary sclerosing cholangitis (PSC) is associated with 
portal inflammation and the development of neolymphoid tissue in the liver. More than 70% of 

10 patients with PSC have a history of inflammatory bowel disease and strong induction of 
SLC/CCL21 on CD34(+) vascular endothelium in portal associated lymphoid tissue in PSC has 
been reported (Grant et al. (2002) Am J Pathol 2002 160:1445-55). In contrast, CCL21 is absent 
from LYVE-1(+) lymphatic vessel endothelium. Intrahepatic lymphocytes in PSC include a 
population of CCR7(+) T cells only half of which express CD45RA and which respond to CCL21 

15 in migration assays. The expression of CCL21 in association with mucosal addressin cell adhesion 
molecule-1 in portal tracts in PSC may promote the recruitment and retention of CCR7(+) mucosal 
lymphocytes leading to the establishment of chronic portal inflammation and the expanded portal- 
associated lymphoid tissue. These findings are supported by studies in an animal model of chronic 
hepatic inflammation, that have shown that anti-SLC/CCL2 1 antibodies prevent the development of 

20 high endothelial venules and portal -associated lymphoid tissue (Yoneyama et al. (2001) J Exp Med 
193:35-49), 

Induction of chemokine SLC/CCL21 at a site of inflammation could convert the lesion 

from an acute to a chronic state with corresponding development of ectopic lymphoid tissue. 

Blocking chemokine SLC/CCL21 activity in chronic inflammatory diseases may therefore have 
25 significant therapeutic value. 

Chemokine SLC/CCL21 and regulation of cell proliferation and cell death 

In addition to its key role in chemotaxis and cell migration, chemokine SLC/CCL21 has 

also been shown to regulate cell proliferation and cell death. For instance, the proliferation rate of 

normal hematopoietic or leukemia progenitor cells was reduced upon stimulation with SLC/CCL2 1 
30 (Hromas et al. (1997) J Immunol 159 :2554-2558 ; Hromas et al. (2000) Blood 95 :1506-1508). In 

contrast, SLC/CCL21 stimulated proliferation of mesangial cells from human kidney (Banas et al. 

(2002) J Immunol 168:4301-4307), suggesting differential action of this chemokine on 

hematopoietic or non-hematopoietic cells. 

SLC/CCL2 1 has also been shown to inhibit cell death. It was found that pretreatment with 
35 small doses of SLC/CCL21 prevented the death of normal murine marrow progenitors from the 

toxic effects of the chemotherapeutic agent Ara-C (Hromas et al. (2002) Cancer Chemother 

Pharmacol 50 :163-166). In addition, SLC/CCL21 was found to act as anti-apoptotic factor that 
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promotes mesengial cells survival in cell death assays. It is not known whether SLC/CCL21 effects 
on cell proliferation and cell death require the CCR7 chemokine receptor or are mediated by other 
cellular receptors. 

Chemokine SLC/CCL21 and regulation of endothelial cell differentiation (induction of the 
specialized high endothelial venule phenotype) 

Chemokine SLC/CCL21 has been shown to act on endothelial cells in two ways: 1) It 
exhibits angiostatic (anti-angiogenic) properties and efficiently block blood vessel formation in vivo 
(Soto et al. (1998) PNAS 95:8205-8210; Vicari et al. (2000) 165:1992-2000); 2) It induces 
differentiation of 'flat* endothelial cells into high endothelial venules (HEV), specialized blood 
vessels for lymphocyte migration (Girard and Springer (1995) Immunol today 16:449-457). For 
instance, in transgenic mice, ectopic expression of SLC/CCL21 in the pancreas (Fan et al. (2000) J 
Immunol 164:3955-3959; Chen et al. (2002) J Immunol 168:1001-1008; Luther et al. (2002) J 
Immunol 169:424-433), has been found to be sufficient for induction of high endothelial venules 
and associated lymphoid tissue. In humans, hepatic expression of SLC/CCL21 has been shown to 
promote the development of high endothelial venules and portal-associated lymphoid tissue in 
chronic inflammatory liver disease (Grant et al. (2002) Am J Pathol 2002 160:1445-55; Yoneyama 
et al. (2001) J Exp Med 193:35^9). A critical role for SLC/CCL21 in induction of high endothelial 
venules is supported by studies in an animal model of chronic hepatic inflammation, that have 
shown that anti-SLC/CCL21 antibodies prevent the development of high endothelial venules and 
portal-associated lymphoid tissue (Yoneyama et al. (2001) J Exp Med 193:35-49). 

Induction of chemokine SLC/CCL21 at a site of inflammation might convert the lesion 
from an acute to a chronic state with corresponding development of high endothelial venules and 
ectopic lymphoid tissue. Blocking chemokine SLC/CCL21 effects on endothelial cells in chronic 
inflammatory diseases may therefore have significant therapeutic value. Since the CCR7 chemokine 
receptor is not expressed in endothelial cells, the effects of SLC/CCL21 on endothelial cells are 
likely to be mediated by other mechanisms. There is therefore a strong interest in the identification 
of other cellular receptors for SLC/CCL21. 

CHEMOKINES MIG/CXCL9, IP10/CXCL10, 1-TAC/CXCL11 

Roles of chemokines MIG/CXCL9. IP10/CXCL10, 1-TAC/CXCL1 1 in leukocyte chemotaxis 

Chemokines monokine induced by IFN-y (Mig/CXCL9), IFN-induced protein of 10 kDa 
(IP-10/CXCL10) and IFN-inducible T cell a-chemoattractant (I-TAC/CXCL1 1) are three CXC 
chemokines more closely related to each other than to any other chemokine with an amino acid 
sequence identity of about 40% (Luster and Ra vetch ( 1 987) J Exp Med 166: 1084; Cole et al. (1998) 
J Exp Med 187 :2009-2021; Farber (1993) BBRC 192:223-230). They share a number of features: 
i) they lack the glutamic acid - leucine - arginine (ELR) motif preceding the first conserved 
cysteine and are therefore inactive towards neutrophils; ii) they share an individual branch of the 
phylogenetic tree, have a similar gene structure, and are clustered on chromosome 4q21.2 
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(O'Donovan et al. (1999) Cytogenet Cell Genet 84:39^42). Among the CXC members, CXCL9, 
CXCL10 and CXCL1 1 are unique in that they are all induced by IFN-y in a wide variety of cell 
types, including endothelial cells (Luster and Ravetch (1987) J Exp Med 166:1084; Farber (1997) J 
Leuk Biol 61:246-257; Mach et al. (1999) J Clin Invest 104:1041; Cole et al. (1998) J Exp Med 
187 :2009-2021; Loetscher et al. (1998) Eur J Immunol 28:3696-3705), and act through a unique 
chemokine receptor, CXCR3. CXCR3 is expressed on activated T cells, preferentially of the Thl 
phenotype, NK cells, and on a significant fraction (-20-40%) of circulating CD4 + and CD8 + T cells 
(Loetscher et al. (1996) J Exp Med 184:963-969; Loetscher et al. (1998) Eur J Immunol 28:3696- 
3705). The majority of peripheral CXCR3 + T cells express CD45RO (memory T cells) as well as P, 
integrins (Qin et al. (1998) J Clin Invest 101:746) which are implicated in the binding of 
lymphocytes to endothelial cells and the extracellular matrix. In addition, CXCR3 has been reported 
to be expressed on plasmacytoid dendritic cells, leukemic B cells, eosinophils, and dividing 
microvascular endothelial cells (Cella et al. (1999) Nat Med 5:919; Romagnani et al. (2001) J Clin 
Invest 107:53). 

CXCR3 + T cells accumulate at sites of Thl -type inflammation where IFN-y is highly 
expressed, including atherosclerosis, sarcoidosis, inflammatory bowel diseases, and rheumatoid 
arthritis (Qin et al. (1998) J Clin Invest 101:746; Mach et al. (1999) J Clin Invest 104:1041). IP-10 
has been found to be highly expressed in a number of Thl -type inflammatory diseases, including 
psoriasis , tuberculoid leprosy, sarcoidosis, and viral meningitis. In addition, IFN-y-stimulated 
endothelial cells and endothelium from atherosclerotic lesions are a rich source of IP-10, Mig, and I- 
TAC suggesting an important role for these chemokines in the transendothelial migration and local 
retention of CXCR3* T cells found in atherosclerotic lesions (Mach et al. (1999) J Clin Invest 
104: 1041). In support of this hypothesis, IP-10 and Mig induce the rapid adhesion of IL-2-activated 
T cells to immobilized VCAM-1 and ICAM-1, and IP-10, Mig, and I-TAC are potent chemotactic 
agents for activated T cells. 

Roles of chemokines MIG/CXCL9. IP10/CXCU0, 1-TAC/CXCL1 1 in angiogenesis 

CXC chemokines MIG/CXCL9, IP10/CXCL1O, I-TAC/CXCL11 exhibit the selective 
property to inhibit angiogenesis (Belperio et al. (2000) J Leukoc Biol 68:1-8). These angiostatic 
chemokines induce injury to established tumor-associated vasculature and promote extensive tumor 
necrosis (Arenberg et al. (1996) J Exp Med 184:981-992; Sgadari et al. (1997) Blood 89:2635- 
2643) and thus have been proposed as useful therapeutic agents in cancer. 

The angiostatic effects of CXCL9, CXCL10, and CXCL11 on human microvascular 
endothelial cells (HMVEQare mediated by CXCR3 (Romagnani et al. (2001) J Clin Invest 107:53- 
63; Lasagni et al. (2003) J Exp Med 197:1537-1549). A distinct, previously unrecognized 
alternatively spliced variant of CXCR3 named CXCR3-B, has recently been shown to mediate the 
angiostatic activity of CXCR3 ligands (Lasagni et al. (2003) J Exp Med 197 :1537-1549). Human 
microvascular endothelial cell line-1 (HMEC-1), transfected with either the known CXCR3 
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(renamed CXCR3-A) or CXCR3-B, bound CXCL9, CXCL10, and CXCL11. Overexpression of 
CXCR3-A induced an increase of survival, whereas overexpression of CXCR3-B dramatically 
reduced DNA synthesis and up-regulated apoptotic HMEC-l death through activation of distinct 
signal transduction pathways. Unlike CXCR3A, CXCR3B was not found to be coupled to G- 
proteins. Remarkably, primary cultures of human microvascular endothelial cells, whose growth is 
inhibited by CXCL9, CXCL10 and CXCL11, expressed CXCR3-B, but not CXCR3-A. Finally, 
monoclonal antibodies raised to selectively recognize CXCR3-B reacted with endothelial cells from 
neoplastic tissues, providing evidence that CXCR3-B is also expressed in vivo and may account for 
theangiostatic effects of CXC chemokines. 

Chemokine MIG/CXCL9 and chemokine receptor CXCR3 and regulation of endothelial cell 
differentiation (induction of the specialized high endothelial venule phenotype) 

During inflammation, chemokine MIG/CXCL9 has been shown to be induced in high 
endothelial venules (HEV, Girard and Springer (1995) Immunol today 16:449-457), specialized 
blood vessels for lymphocyte migration (Janatpour et al. (2001) J Exp Med 193:1375-1384). 
Interestingly, in many human chronic inflammatory diseases, including Crohn's disease, Graves's 
disease and glomerulonephritis, CXCR3 receptor has also been found to be upregulated on 
endothelial cells during transformation of small blood vessels into HEV -like vessels (Romagnani et 
al. (2001) J Clin Invest 107:53-63). 

Induction of chemokine MIG/CXCL9 and its receptor CXCR3 on endothelial cells at a site 
of inflammation might convert the lesion from an acute to a chronic state with corresponding 
development of high endothelial venules and ectopic lymphoid tissue. Blocking chemokine 
MIG/CXCL9 effects on CXCR3+ endothelial cells in chronic inflammatory diseases may therefore 
have significant therapeutic value. 

Role of chemokines CXCL9/Mig and CXCL10/IP-10 in vascular pericyte proliferation 

CXCL9 and CXCL10 have been implicated in the pathogenesis of proliferative 
glomerulonephritis, a common renal disease characterized by glomerular hypercellularity, because 
they induce increased survival and growth of human mesangial cells (HMC) through their receptor 
CXCR3 (Romagnani et al. (1999) J Am Soc Nephrol 10:2518-2526; Romagnani et al. (2002) J Am 
Soc Nephrol 13:53-64). High levels of expression of mRNA and protein forCXCLlO and CXCL9 
were observed, by using in situ hybridization and immunohistochemical analyses, in kidney biopsy 
specimens from patients with glomerulonephritis (GN), particularly those with 
membranoproliferative or crescentic GN, but not in normal kidneys (Romagnani et al. (2002) J Am 
Soc Nephrol 13:53-64). Double-imrnunostaining or combined in situ hybridization and 
immunohistochemical analyses for IP-10, Mig, and proliferating cell nuclear antigen (PCNA) or ct- 
smooth muscle actin (a-SMA) revealed that IP-10 and Mig production by resident glomerular cells 
was a selective property of glomeruli in which mesangial cells demonstrated active proliferation. EP- 
10 and Mig mRNA and protein were also expressed by primary cultures of human mesangial cells. 
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Moreover, high levels of CXCR3 were found in mesangial cells from patients with proliferative 
GN, and CXCR3 was also observed on the surface of cultured human mesangial cells (BMC) and 
seemed to mediate both intracellular Ca 2+ influx, cell chemotaxis and cell proliferation, induced by 
CXCL9 and CXCL10 (Romagnani et al. (1999) J Am Soc Nephrol 10:2518-2526). Therefore, 
among patients with proliferative GN, the chemokines IP- 10 and/or Mig not only may be 
responsible for the attraction of infiltrating mononuclear cells into the inflamed tissue but also may 
directly stimulate the proliferation of mesangial cells. 

As used herein, "SLC/CCL2 1 " and "SLC" are synonymous. 
As used herein, "ELC/CCL19", "CCL19" and "ELC" are synonymous. 
As used herein, "Rantes/CCL5", "CCL5" and "Rantes" are synonymous. 
As used herein, "MIG/CXCL9", "CXCL9" and "MIG" are synonymous. 
As used herein, "IP10/CXCL10", "CXCL10" and "IP10" are synonymous. 
As used herein, "I-TAC/CXCL1 V\ "CXCL1 1" and "I-TAC" are synonymous. 
As used herein, in some embodiments of the present invention, "CXCR3" includes CXCR3 
splice variant B (polypeptide encoding CXCR3 splice variant B, SEQ ID NO: 517; cDNA encoding 
CXCR3 splice variant B, Genbank Accession Number: AX805367, SEQ ID NO: 518). 
THAP-family members comprising a THAP Domain 

Based on the elucidation of a biological activity of the THAP1 protein in apoptosis as 
described herein, the inventors have identified and further characterized a novel protein motif, 
referred to herein as THAP domain. The THAP domain has been identified by the present 
inventors iij several other polypeptides, as further described herein. Knowledge of the structure and 
function of the THAP domain allows the performing of screening assays that can be used in the 
preparation or screening of medicaments capable of modulating interaction with a THAP-family. 
target molecule, modulating cell cycle and cell proliferation, inducing apoptosis or enhancing or 
participating in the induction of apoptosis. 

As used interchangeably herein, a THAP-family protein or polypeptide, or a THAP-family 
member refers to any polypeptide having a THAP domain as described herein. As mentioned, the 
inventors have provided several specific THAP-family members. Thus, as referred to herein, a 
THAP-family protein or polypeptide, or a THAP-family member, includes but is not limited to a 
THAP-0, THAP1, THAP-2, THAP-3, THAP-4, THAP-5, THAP-6, THAP-7, THAP-8, THAP-9, 
THAP10 or a THAP1 1 polypeptide. 

As used interchangeably herein, a "THAP-family activity", "biological activity of a THAP- 
family member" or "functional activity of a THAP-family member", refers to an activity exerted by 
a THAP family or THAP domain polypeptide or nucleic acid molecule, or a biologically active 
fragment or homologue thereof comprising a THAP as determined in vivo, or in vitro, according to 
standard techniques. In one embodiment, a THAP-family activity is a direct activity, such as an 
association with a THAP-family-target molecule or most preferably apoptosis induction activity, or 
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inhibition of cell proliferation or cell cycle. As used herein, a " THAP-family target molecule" is a 
molecule with which a THAP-family protein binds or interacts in nature, such that a THAP family- 
mediated function is achieved. For example, a THAP family target molecule can be another 
THAP-family protein or polypeptide which is substantially identical or which shares structural 
5 similarity (e.g. forming a dimer or multimer). In another example, a THAP family target molecule 
can be a non-THAP family comprising protein molecule, or a non-self molecule such as for 
example a Death Domain receptor. Binding or interaction with a THAP family target molecule 
(such as THAP1/PAR4 described herein) or with other targets can be detected for example using a 
two hybrid-based assay in yeast to find drugs that disrupt interaction of the THAP family bait with 

10 the target (e.g. PAR4) prey, or an in vitro interaction assay with recombinant THAP family and 
target proteins (e.g. THAP1 and PAR4). In yet another example, a THAP family target molecule 
can be a nucleic acid molecule. For instance, a THAP family target molecule can be DNA. 

Alternatively, a THAP-family activity may be an indirect activity, such as an activity 
mediated by interaction of the THAP-family protein with a THAP-family target molecule such that 

15 the target molecule modulates a downstream cellular activity (e.g., interaction of a THAP-family 
molecule with a THAP-family target molecule can modulate the activity of that target molecule on 
an intracellular signaling pathway). 

THAP-family activity is not limited to the induction of apoptotic activity, but may also 
involve enhancing apoptotic activity. As death domains may mediate protein-protein interactions, 

20 including interactions with other death domains, THAP-family activity may involve transducing a 
cytocidal gignal. 

Assays to detect apoptosis are well known. In a preferred example, an assay is based on 
serum-withdrawal induced apoptosis in a 3T3 cell line with tetracycline-regulated expression of a 
THAP family member comprising a THAP domain. Other non-limiting examples are also 
25 described. 

In one example, a THAP family or THAP domain polypeptide, or a biojogically active 
fragment or homologue thereof can be the minimum region of a polypeptide that is necessary and 
sufficient for the generation of cytotoxic death signals. Exemplary assays for apoptosis activity are 
further provided herein. 

30 In specific embodiments, PAR4 is a preferred THAP1 and/or THAP2 target molecule. In 

another aspect, a THAP1 target molecule is a PML-NB protein. 

In further aspects, THAP-domain or a THAP-family polypeptide comprises a DNA binding 

domain. 

In other aspects, a THAP-family activity is detected by assessing any of the following 
35 activities: (1) mediating apoptosis or cell proliferation when expressed in or introduced into a cell, 
most preferably inducing or enhancing apoptosis, and/or most preferably reducing cell proliferation; 
(2) mediating apoptosis or cell proliferation of an endothelial cell; (3) mediating apoptosis or cell 
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proliferation of a hyperproliferative cell; (4) mediating apoptosis or cell proliferation of a CNS cell, 
preferably a neuronal or glial cell; (5) an activity determined in an animal selected from the group 
consisting of mediating, preferably inhibiting angiogenesis, mediating, preferably inhibiting 
inflammation, inhibition of metastatic potential of cancerous tissue, reduction of tumor burden, 
increase in sensitivity to chemotherapy or radiotherapy, killing a cancer cell, inhibition of the 
growth of a cancer cell, or induction of tumor regression; or (6) interaction with a THAP family 
target molecule or THAP domain target molecule, preferably interaction with a protein or a nucleic 
acid. Detecting THAP-family activity may also comprise detecting any suitable therapeutic 
endpoint discussed herein in the section titled "Methods of Treatment". THAP-family activity may 
be assessed either in vitro (cell or non-cell based) or in vivo depending on the assay type and 
format. 

A THAP domain has been identified in the N- terminal region of the THAP1 protein, from 
about amino acid 1 to about amino acid 89 of SEQ ID NO: 3 based on sequence analysis and 
functional assays. A THAP domain has also been identified in THAP2 to THAP0 of SEQ ID NOs: 
4-14. However, it will be appreciated that a functional THAP domain may be only a small portion 
of the protein, about 10 amino acids to about 15 amino acids, or from about 20 amino acids to about 
25 amino acids, or from about 30 amino acids to about 35 amino acids, or from about 40 amino 
acids to about 45 amino acids, or from about 50 amino acids to about 55 amino acids, or from about 
60 amino acids to about 70 amino acids, or from about 80 amino acids to about 90 amino acids, or 
about 100 amino acids in length. Alternatively, THAP domain or THAP family polypeptide 
activity, avdefmed above, may require a larger portion of the native protein than may be defined by 
protein-protein interaction, DNA binding, cell assays or by sequence alignment. A portion of a 
THAP domain-containing polypeptide from about 110 amino acids to about 115 amino acids, or 
from about 120 amino acids to 130 amino acids, or from about 140 amino acids to about 150 amino 
acids, or from about 160 amino acids to about 170 amino acids, or from about 180 amino acids to 
about 190 amino acids, or from about 200 amino acids to about 250 amino acids, or from about 300 
amino acids to about 350 amino acids, or from about 400 amino acids to about 450 amino acids, or 
from about 500 amino acids to about 600 amino acids, to the extent that said length is consistent 
with the SEQ ID No, or the full length protein, for example any full length protein in SEQ ED NOs: 
1-114, may be required for function. 

As discussed, the invention includes a novel protein domain, including several examples of 
THAP-family members. The invention thus encompasses a THAP-family member comprising a 
polypeptide having at least a THAP domain sequence in the protein or corresponding nucleic acid 
molecule, preferably a THAP domain sequence corresponding to SEQ ID NOs: 1-2. A THAP- 
family member may comprise an amino acid sequence of at least about 25, 30, 35, 40, 45, 50, 60, 
70, 80 to 90 amino acid residues in length, of which at least about 50-80%, preferably at least about 
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60-70%, more preferably at least about 65%, 75% or 90% of the amino acid residues are identical 
or similar amino acids-to the THAP consensus domain SEQ ID NOs: 1-2. 

Identity or similarity may be determined using any desired algorithm, including the 
algorithms and parameters for determining homology which are described herein. 

Optionally, a THAP-domain-containing THAP-family polypeptide comprises a nuclear 
localization sequence (NLS). As used herein, the term nuclear localization sequence refers to an 
amino sequence allowing the THAP-family polypeptide to be localized or transported to the cell 
nucleus. A nuclear localization sequence generally comprises at least about 10, preferably about 13, 
preferably about 16, more preferably about 19, and even more preferably about 21, 23, 25, 30, 35 or 
40 amino acid residues. Alternatively, a THAP-family polypeptide may comprise a deletion of part 
or the entire NLS or a substitution or insertion in a NLS sequence, such that the modified THAP- 
family polypeptide is not localized or transported to the cell nucleus. 

Isolated proteins of the present invention, preferably THAP family or THAP domain 
polypeptides, or a biologically active fragments or homologues thereof, have an amino acid 
sequence sufficiently homologous to the consensus amino acid sequence of SEQ ID NOs: 1-2. As 
used herein, the term "sufficiently homologous" refers to a first amino acid or nucleotide sequence 
which contains a sufficient or minimum number of identical or equivalent (e.g., an amino acid 
residue which has a similar side chain) amino acid residues or nucleotides to a second amino acid or 
nucleotide sequence such that the first and second amino acid or nucleotide sequences share 
common structural domains or motifs and/or a common functional activity. For example, amino 
acid or nycleotide sequences which share common structural domains have at least about 30-40% 
identity, preferably at least about 40-50% identity, more preferably at least about 50-60%, and even 
more preferably at least about 60-70%, 70-80%, 80%, 90%, 95%, 97%, 98%, 99% or 99.8% 
identity across the amino acid sequences of the domains and contain at least one and preferably two 
structural domains or motifs, are defined herein as sufficiently homologous. Furthermore, amino 
acid or nucleotide sequences which share at least about 30%, preferably at least about 40%, more 
preferably at least about 60%, 70%, 80%, 90%, 95%, 97%, 98%, 99% or 99.8% identity and share a 
common functional activity are defined herein as sufficiently homologous. 

It be appreciated that the invention encompasses any of the THAP-family polypeptides, as 
well as fragment thereof, nucleic acids complementary thereto and nucleic acids capable of 
hybridizing thereto under stringent conditions. 

As used herein. "THAP/chemokine complex" refers to a THAP-family polypeptide or a 
biologically active fragment thereof in association with a chemokine or a biologically active 
fragment thereof. In some embodiments, THAP/chemokine complexes include, but are not limited 
to, THAP1/SLC, THAP1/MIG, THAP1/CXCL10, THAP1/CXCL1 1, THAP1/CCL19 and 
THAP1/CCL5. 
THAP-0 to THAP11 
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As mentioned, the inventors have identified several THAP-family members, including 
THAP-0, THAP1, THAP-2, THAP-3, THAP-4, THAP-5, THAP-6, THAP-7, THAP-8, THAP-9, 
THAPlOand THAP11. 
THAP1 Nucleic Acids 

5 The human THAP1 coding sequence, which is approximately 639 nucleotides in length 

shown in SEQ ID NO: 160, encodes a protein which is approximately 213 amino acid residues in 
length. One aspect of the invention pertains to purified or isolated nucleic acid molecules that 
encode THAP1 proteins or biologically active portions thereof as further described herein, as well 
as nucleic acid fragments thereof. Said nucleic acids may be used for example in therapeutic 
10 methods and drug screening assays as further described herein. 

The human THAP1 gene is localized at chromosomes 8, 18, 1 1 . 

The THAP1 protein comprises a THAP domain at amino acids 1-89, the role of which in 
apoptosis is further demonstrated herein. The THAP1 protein comprises an interferon gamma 
homology motif at amino acids 136-169 of human THAP1 
15 (NYTVEDTMHQRKRIHQLEQQVEKLRKKLKTAQQR) (SEQ ID NO: 178), exhibiting 41% 
identity in a 34 residue overlap with human interferon gamma (amino acids 98-131). PML-NBs are 
closely linked to IFN gamma, and many PML-NB components are induced by BFN gamma, with 
IFN gamma responsive elements in the promoters of the corresponding genes. The THAP1 protein 
also includes a nuclear localization sequence at amino acids 146-165 of human THAP1 
(RKRIHQLEQQVEKLRKKLKT) (SEQ ID NO: 179). This sequence is responsible for localization 
of THAP1 Jn the nucleus. As demonstrated in the examples provided herein, deletion mutants of 
THAP1 lacking this sequence are no longer localized in the cell nucleus. The THAP1 protein 
further comprises a PAR4 binding motif (LE(X) I4 QRXRRQXR(X), ,QR/KE) (SEQ ID NO: 180). 
The core of this motif has been defined experimentally by site directed mutagenesis and by 
comparison with mouse ZIP/DAP-like kinase (another PAR4 binding partner) it overlaps amino 
acids 168-175 of human THAP1 but the motif may also include a few residues upstream and 
downstream. 

ESTs corresponding to THAP1 have been identified, and may be specifically included or 
excluded from the nucleic acids of the invention. The ESTs, as indicated below by accession 
30 number, provide evidence for tissue distribution for THAP1 as follows : AL582975 (B cells from 
Burkitt lymphoma); BG708372 (Hypothalamus); BG563619 (liver); BG497522 (adenocarcinoma); 
BG6 16699 (liver); BE932253 (head^neck); AL530396 (neuroblastoma cells). 

An object of the invention is a purified, isolated, or recombinant nucleic acid comprising the 
nucleotide sequence of SEQ ID NO: 160, complementary sequences thereto, and fragments thereof. 
The invention also pertains to a purified or isolated nucleic acid comprising a polynucleotide having 
at least 95% nucleotide identity with a polynucleotide of SEQ ID NO: 160, advantageously 99 % 
nucleotide identity, preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide 
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identity with a polynucleotide of SEQ ID NO: 160, or a sequence complementary thereto or a 
biologically active fragment thereof. Another object of the invention relates to purified, isolated or 
recombinant nucleic acids comprising a polynucleotide that hybridizes, under the stringent 
hybridization conditions defined herein, with a polynucleotide of SEQ ID NO: 160, or a sequence 
5 complementary thereto or a variant thereof or a biologically active fragment thereof In further 
embodiments, nucleic acids of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 
80, 90, 100, 150, 200, 500, or 1000 nucleotides of SEQ ID NO: 160, or the complements thereof 

Also encompassed is a purified, isolated, or recombinant nucleic acid polynucleotide 

10 encoding a THAP1 polypeptide of the invention, as further described herein. 

In another preferred aspect, the invention pertains to purified or isolated nucleic acid 
molecules that encode a portion or variant of a THAP1 protein, wherein the portion or variant 
displays a THAP1 activity of the invention. Preferably said portion or variant is a portion or variant 
of a naturally occurring full-length THAP1 protein. In one example, the invention provides a 

15 polynucleotide comprising, consisting essentially of, or consisting of a contiguous span of at least 
12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, or 1000 nucleotides of SEQ ID 
NO: 160, wherein said nucleic acid encodes a THAP1 portion or variant having a THAP1 activity 
described herein. In other embodiments, the invention relates to a polynucleotide encoding a 
THAP1 portion consisting of 8-20, 20-50, 50-70, 60-100, 100 - 150, 150- 200, 200-205 or 205-212 

20 amino acids of SEQ ID NO: 3, or a variant thereof, wherein said THAP1 portion displays a THAP1 
activity described herein. 

The sequence of SEQ ID NO: 160 corresponds to the human THAP1 cDNA. This cDNA 
comprises sequences encoding the human THAP1 protein (i.e., "the coding region", from 
nucleotides 202 to 840, as well as 5' untranslated sequences (nucleotides 1-201) and 3' untranslated 

25 sequences (nucleotides 841 to 2173). 

Also encompassed by theTHAPl nucleic acids of the invention are nucleic, acid molecules 
which are complementary to THAP1 nucleic acids described herein. Preferably, a complementary 
nucleic acid is sufficiently complementary to the nucleotide sequence shown in SEQ ID NO: 160, 
such that it can hybridize to the nucleotide sequence shown in SEQ ED NO: 1 60, thereby forming a 

30 stable duplex. 

Another object of the invention is a purified, isolated, or recombinant nucleic acid encoding a 
THAP1 polypeptide comprising, consisting essentially of, or consisting of the amino acid sequence 
of SEQ ID NO: 3, or fragments thereof, wherein the isolated nucleic acid molecule encodes one or 
more motifs selected from the group consisting of a THAP domain, a THAP1 target binding region, 
35 a nuclear localization signal and a interferon gamma homology motif. Preferably said THAPl 
target binding region is a PAR4 binding region or a DNA binding region. For example, the 
purified, isolated or recombinant nucleic acid may comprise a genomic DNA or fragment thereof 

-76- 



BNSDOCID: <WO 200405505QA2_I_> 



WO 2004/055050 PCT/IB2003/006434 
which encodes the polypeptide of SEQ ID NO: 3 or a fragment thereof or a cDNA consisting of, 
consisting essentially of, or comprising the sequence of SEQ ID NO: 160 or fragments thereof, 
wherein the isolated nucleic acid molecule encodes one or more motifs selected from the group 
consisting of a THAP domain, a THAPl-target binding region, a nuclear localization signal and a 
interferon gamma homology motif Any combination of said motifs may also be specified. 
Preferably said THAP1 target binding region is a PAR4 binding region or a DNA binding region. 
Particularly preferred nucleic acids of the invention include isolated, purified, or recombinant 
THAP1 nucleic acids comprising, consisting essentially of, or consisting of a contiguous span of at 
least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200 or 300 nucleotides of a 
sequence selected from the group consisting of nucleotide positions ranges consisting of 607 to 708, 
637 to 696 and 703 to 747 of SEQ ID NO: 160. In preferred embodiments, a THAP1 nucleic acid 
encodes a THAP1 polypeptide comprising at least two THAP1 functional domains, such as for 
example a THAP domain and a PAR4 binding region. 

In further preferred embodiments, a THAP1 nucleic acid comprises a nucleotide sequence 
encoding a THAP domain having the consensus amino acid sequence of the formula of SEQ ID 
NOs: 1-2. A THAP1 nucleic acid may also encode a THAP domain wherein at least about 95%, 
90%, 85%, 50-80%, preferably at least about 60-70%, more preferably at least about 65% of the 
amino acid residues are identical or similar amino acids-to the THAP domain consensus sequence 
(SEQ ID NOs: 1-2). The present invention also embodies isolated, purified, and recombinant 

polynucleotides which encode a polypeptide comprising a contiguous span of at least 6 amino 

acids, preferably at least 8 or 10 amino acids, more preferably at least 15, 25, 30, 35, 40, 45, 50, 60, 

70, 80 or 90 amino acids according to the formula of SEQ ID NO: 1 -2. 

The nucleotide sequence determined from the cloning of the THAP1 gene allows for the 

generation of probes and primers designed for use in identifying and/or cloning other THAP1 

family members (e.g. sharing the novel functional domains), as well as THAP1 homologues from 

other species. 

A nucleic acid fragment encoding a "biologically active portion of a THAP 1 protein" can 
be prepared by isolating a portion of the nucleotide sequence of SEQ ID NO: 160, which encodes a 
polypeptide having a THAP1 biological activity (the biological activities of the THAP1 proteins 
described herein), expressing the encoded portion of the THAP1 protein (e.g., by recombinant 
expression in vitro or in vivo) and assessing the activity of the encoded portion of the THAP1 
protein. 

The invention further encompasses nucleic acid molecules that differ from the THAP1 
nucleotide sequences of the invention due to degeneracy of the genetic code and encode the same 
THAP1 proteins and fragment of the invention. 

In addition to the THAP1 nucleotide sequences described above, it will be appreciated by 
those skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid 
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sequences of the THAP1 proteins may exist within a population (e.g., the human population). Such 
genetic polymorphism may exist among individuals within a population due to natural allelic 
variation. Such natural allelic variations can typically result in 1-5% variance in the nucleotide 
sequence of a TH API gene. 
5 Nucleic acid molecules corresponding to natural allelic variants and homologues of the 

THAP1 nucleic acids of the invention can be isolated based on their homology to the THAP1 
nucleic acids disclosed herein using the cDNAs disclosed herein, or a portion thereof, as a 
hybridization probe according to standard hybridization techniques under stringent hybridization 
conditions. 

10 Probes based on the THAP1 nucleotide sequences can be used to detect transcripts or 

genomic sequences encoding the same or homologous proteins. In preferred embodiments, the 
probe further comprises a label group attached thereto, e.g., the label group can be a radioisotope, a 
fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissue which misexpress a THAP1 protein, such as by 

15 measuring a level of a THAP1 -encoding nucleic acid in a sample of cells from a subject e.g., 
detecting THAP1 mRNA levels or determining whether a genomic THAP1 gene has been mutated 
or deleted. 

THAP1 Polypeptides 

The term 'THAP1 polypeptides" is used herein to embrace all of the proteins and 
20 polypeptides of the present invention. Also forming part of the invention are polypeptides encoded 
by the polynucleotides of the invention, as well as fusion polypeptides comprising such 
polypeptides. The invention embodies THAP1 proteins from humans, including isolated or purified 
THAP1 proteins consisting of, consisting essentially of, or comprising the sequence of SEQ ID NO: 
3. 

25 Aspects of the present invention concern the polypeptide encoded by a nucleotide sequence 

of SEQ ID NO: 1 60, a complementary sequence thereof or a fragment thereto. 

Another aspect of the present invention embodies isolated, purified, and recombinant 
polypeptides comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of SEQ ID NO: 

30 3. In other preferred embodiments the contiguous stretch of amino acids comprises the site of a 
mutation or functional mutation, including a deletion, addition, swap or truncation of the amino 
acids in the THAP1 protein sequence. The invention also concerns the polypeptide encoded by the 
THAP1 nucleotide sequences of the invention, or a complementary sequence thereof or a fragment 
thereof. 

35 One aspect of the invention pertains to isolated THAP1 proteins, and biologically active 

portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise anti- 
THAP1 antibodies. In one embodiment, native THAP1 proteins can be isolated from cells or tissue 
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sources by an appropriate purification scheme using standard protein purification techniques. In 
another embodiment, THAP1 proteins are produced by recombinant DNA techniques. Alternative 
to recombinant expression, a THAPi protein or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 
5 Typically, biologically active portions comprise a domain or motif with at least one activity 

of the THAPI protein. The present invention also embodies isolated, purified, and recombinant 
portions or fragments of one THAPI polypeptide comprising a contiguous span of at least 6 amino 
acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, 100 
or 200 amino acids of SEQ ID NO: 3. Also encompassed are THAPI polypeptide which comprise 
10 between 10 and 20, between 20 and 50, between 30 and 60, between 50 and 100, or between 100 
and 200 amino acids of SEQ ID NO: 3. In other preferred embodiments the contiguous stretch of 
amino acids comprises the site of a mutation or functional mutation, including a deletion, addition, 
swap or truncation of the amino acids in the THAPI protein sequence. 

A biologically active THAPI protein may, for example, comprise at least 1, 2, 3, 5, 10, 20 or 
15 30 amino acid changes from the sequence of SEQ ID NO: 3, or may encode a biologically active 
THAPI protein comprising at least 1%, 2%, 3%, 5%, 8%, 10% or 15% changes in amino acids 
from the sequence of SEQ ED NO: 3. 

In a preferred embodiment, the THAPI protein comprises, consists essentially of, or consists 
of a THAP domain at amino acid positions 1 to 89 shown in SEQ ID NO: 3, or fragments or 
20 variants thereof. In other aspects, a THAPI polypeptide comprises a THAPI -target binding region, 
a nuclear localization signal and/or a Interferon Gamma Homology Motif. Preferably a THAPI 
target binding region is a PAR4 binding region or a DNA binding region. The invention also 
concerns the polypeptide encoded by the THAPI nucleotide sequences of the invention, or a 
complementary sequence thereof or a fragment thereof. The present invention thus also embodies 
25 isolated, purified, and recombinant polypeptides comprising, consisting essentially of or consisting 
of a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
preferably at least 12, 15, 20, 25, 30, 40, 50, 70, 80, 90 or 100 amino acids of an amino acid 
sequence selected from the group consisting of positions 1 to 90, 136 to 169, 146 to 165 and 168 to 
175 of SEQ ID NO: 3. In another aspect, a THAPI polypeptide may encode a THAP domain 
30 wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 60-70%, more preferably 
at least about 65% of the amino acid residues are identical or similar amino acids-to the THAP 
domain consensus sequence (SEQ ID NOs: 1-2). Also encompassed by the present invention are 
isolated, purified, nucleic acids encoding a THAPI polypeptide comprising, consisting essentially 
of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in SEQ ID NO: 3, or 
35 fragments or variants thereof. 

In other embodiments, the THAPI protein is substantially homologous to the sequences of 
SEQ ID NO: 3, and retains the functional activity of the THAPI protein, yet differs in amino acid 
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sequence due to natural allelic variation or mutagenesis, as described further herein. Accordingly, in 
another embodiment, the THAP1 protein is a protein which comprises an amino acid sequence 
shares more than about 60% but less than 100% homology with the amino acid sequence of SEQ ID 
NO: 3 and retains the functional activity of the THAP1 proteins of SEQ ID NO: 3, respectively. 
5 Preferably, the protein is at least about 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, 92%, 95%, 
97%, 98%, 99% or 99.8% homologous to SEQ ID NO: 3, but is not identical to SEQ ID NO: 3. 
Preferably the THAP1 is less than identical (e.g. 100% identity) to a naturally occurring THAP1. 
Percent homology can be determined as further detailed above. 
THAP-2 to THAP11 and THAP-0 Nucleic Acids 

10 As mentioned, the invention provides several members of the THAP-family. THAP-2, 

THAP-3, THAP-4, THAP-5, THAP-6, THAP-7, THAP-8, THAP-9, THAP10, THAP11 and 
THAP-0 are described herein. The human and mouse nucleotide sequences corresponding to the 
human cDNA sequences are listed in SEQ ID NOs: 161-171; and the human amino acid sequences 
are listed respectively in SEQ ID NOs: 4-14. Also encompassed by the invention are orthologs of 

15 said THAP-family sequences, including mouse, rat, pig and other orthologs, the amino acid 
sequences of which are listed in SEQ ID NOs: 16-1 14 and the cDNA sequences are listed in SEQ 
ID NOs: 172-175. 
THAP-2 

The human THAP-2 cDNA, which is approximately 1302 nucleotides in length shown in 
20 SEQ ID NO: 161, encodes a protein which is approximately 228 amino acid residues in length, 
shown inJSEQ ID NO: 4. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP-2 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP-2 
25 gene is localized at chromosomes 12 and 3. The THAP-2 protein comprises a THAP domain at 
amino acids 1 to 89. Analysis of expressed sequences (accession numbers indicated, which may be 
specifically included or excluded from the nucleic acids of the invention) in databases suggests that 
THAP-2 is expressed as follows: BG677995 (squamous cell carcinoma); AV7 18199 
(hypothalamus); BI600215 (hypothalamus); AI208780 (Soares_testis__NHT); BE566995 
30 (carcinoma cell line); AI660418 (thymus pooled) 
THAP-3 

The human THAP-3 cDNA which is approximately 1995 nucleotides in length shown in 
SEQ ID NO: 162. The THAP-3 gene encodes a protein which is approximately 239 amino acid 
residues in length, shown in SEQ ID NO: 5. One aspect of the invention pertains to purified or 
35 isolated nucleic acid molecules that encode THAP-3 proteins or biologically active portions thereof 
as further described herein, as well as nucleic acid fragments thereof. Said nucleic acids may be 
used for example in therapeutic methods and drug screening assays as further described herein. The 
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human THAP-3 gene is localized at chromosome 1. The THAP-3 protein comprises a THAP 
domain at amino acids 1 to 89. Analysis of expressed sequences (accession numbers indicated, 
which may be specifically included or excluded from the nucleic acids of the invention) in 
databases suggests that THAP-3 is expressed as follows: BG700517 (hippocampus); BI460812 
(testis) ; BG707197 (hypothalamus); AW960428 (-); BG437177 (large cell carcinoma); BE962820 
(adenocarcinoma); BE548411. (cervical carcinoma cell line); AL522189 (neuroblastoma cells); 
BE545497 (cervical carcinoma cell line); BE280538 (choriocarcinoma); BI086954 (cervix); 
BE744363 (adenocarcinoma cell line); and BI549151 (hippocampus). 
THAP-4 

The human THAP-4 cDNA, shown as a sequence having 1999 nucleotides in length shown 
in SEQ ID NO: 163, encodes a protein which is approximately 577 amino acid residues in length, 
shown in SEQ ID NO: 6. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP-4 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
15 therapeutic methods and drug screening assays as further described herein. The THAP-4 protein 
comprises a THAP domain at amino acids 1 to 90. Analysis of expressed sequences (accession 
numbers indicated, which may be specifically included or excluded from the nucleic acids of the 
invention) in databases suggests that THAP-4 is expressed as follows: AL544881 (placenta); 
BE384014 (melanotic melanoma); AL5 17205 (neuroblastoma cells); BG394703 (retinoblastoma); 
20 BG472327 (retinoblastoma); BI 196071 (neuroblastoma); BE255202 (retinoblastoma); BIO 17349 
(lung^tumox); BF972153 (leiomyosarcoma cell line); BG 116061 (duodenal adenocarcinoma cell 
line); AL530558 (neuroblastoma cells); AL520036 (neuroblastoma cells); AL559902 (B cells from 
Burkitt lymphoma); AL534539 (Fetal brain); BF686560 (leiomyosarcoma cell line); BF345413 
(anaplastic oligodendroglioma with lp/19q loss); BG1 17228 (adenocarcinoma cell line); 
25 BG490646 (large cell carcinoma); and BF769 1 04 (epid_tumor). 
THAP-5 

The human THAP-5 cDNA, shown as a sequence having 1034 nucleotides in length shown 
in SEQ ID NO: 164, encodes a protein which is approximately 239 amino acid residues in length, 
shown in SEQ ID NO: 7. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP-5 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP-5 
gene is localized at chromosome 7. The THAP-5 protein comprises a THAP domain at amino acids 
1 to 90. Analysis of expressed sequences (accession numbers indicated, which may be specifically 
35 included or excluded from the nucleic acids of the invention) in databases suggests that THAP-5 is 
expressed as follows: BG575430 (mammary adenocarcinoma cell line); BI545812 (hippocampus); 



30 



-81- 



BNSDOCID: <WO 200405505QA2_L> 



WO 2004/055050 PO7IB2003/006434 
BI560073 (testis); BG530461 (embryonal carcinoma); BF244164 (glioblastoma); B1461364 (testis); 
AW407519 (germinal center B cells); BF103690 (embryonal carcinoma); and BF939577 (kidney). 
THAP-6 

The human THAP-6 cDNA, shown as a sequence having 2291 nucleotides in length shown 
5 in SEQ ID NO: 165, encodes a protein which is approximately 222 amino acid residues in length, 
shown in SEQ ID NO: 8. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP-6 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP-6 
10 gene is localized at chromosome 4. The THAP-6 protein comprises a THAP domain at amino acids 
1 to 90. Analysis of expressed sequences (accession numbers indicated, which may be specifically 
included or excluded from the nucleic acids of the invention) in databases suggests that THAP-6 is 
expressed as follows: AV684783 (hepatocellular carcinoma); AV698391 (hepatocellular 
carcinoma) ; BI560555 (testis) ; AV688768 (hepatocellular carcinoma); AV692405 (hepatocellular 
15 carcinoma); and AV696360 (hepatocellular carcinoma). 
THAP-7 

The human THAP-7 cDNA, shown as a sequence having 1 242 nucleotides in length shown 
in SEQ ID NO: 166, encodes a protein which is approximately 309 amino acid residues in length, 
shown in SEQ ID NO: 9. One aspect of the invention pertains to purified or isolated nucleic acid 

20 molecules that encode THAP-7 proteins or biologically active portions thereof as further described 
herein, as^well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP-7 
gene is localized at chromosome 22qll.2. The THAP-7 protein comprises a THAP domain at 
amino acids t to 90. Analysis of expressed sequences (accession numbers indicated, which may be 

25 specifically included or excluded from the nucleic acids of the invention) in databases suggests that 
THAP-7 is expressed as follows: BI193682 (epithelioid carcinoma cell line); BE253146 
(retinoblastoma); BE622113 (melanotic melanoma); BE740360 (adenocarcinoma cell line); 
BE5 13955 (Burkitt lymphoma); AL049117 (testis); BF952983 (nervous_normal); AW975614 (-); 
BE273270 (renal cell adenocarcinoma); BE738428 (glioblastoma); BE388215 (endometrium 

30 adenocarcinoma cell line); BF762401 (colon_est); and BG329264 (retinoblastoma). 
THAP-8 

The human THAP-8 cDNA, shown as a sequence having 1383 nucleotides in length shown 
in SEQ ID NO: 167, encodes a protein which is approximately 274 amino acid residues in length, 
shown in SEQ ID NO: 10. One aspect of the invention pertains to purified or isolated nucleic acid 
35 molecules that encode THAP-8 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP-8 
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gene is localized at chromosome 19. The THAP-8 protein comprises a THAP domain at amino 
acids 1 to 92. Analysis of expressed sequences (accession numbers indicated, which may be 
specifically included or excluded from the nucleic acids of the invention) in databases suggests that 
THAP-8 is expressed as follows: BG703645 (hippocampus); BF026346 (melanotic melanoma); 
BE728495 (melanotic melanoma); BG334298 (melanotic melanoma); and BE390697 
(endometrium adenocarcinoma cell line). 
THAP-9 

The human THAP-9 cDNA, shown as a sequence having 693 nucleotides in length shown in 
SEQ ID NO: 168, encodes a protein which is approximately 231 amino acid residues in length, 
shown in SEQ ID NO: 1 1. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP-9 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The THAP-9 protein 
comprises a THAP domain at amino acids 1 to 92. Analysis of expressed sequences (accession 
numbers indicated, which may be specifically included or excluded from the nucleic acids of the 
invention) in databases suggests that THAP-9 is expressed as follows: AA333595 (Embryo 8 
weeks). 

THAP10 

The human THAP 10 cDNA, shown as a sequence having 771 nucleotides in length shown in 
SEQ ID NO: 169, encodes a protein which is approximately 257 amino acid residues in length, 
shown in SEQ ID NO: 12. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP10 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP10 
gene is localized at chromosome 15. The THAP10 protein comprises a THAP domain at amino 
acids 1 to 90. Analysis of expressed sequences (accession numbers indicated, which may be 
specifically included or excluded from the nucleic acids of the invention) in databases suggests that 
THAP10 is expressed as follows: AL526710 (neuroblastoma cells); AV725499 (Hypothalamus) 
;AW966404 (-); AW296810 (lung); and AL557817 (T cells from T cell leukemia). 
30 THAP11 

The human THAP1 1 cDNA, shown as a sequence having 942 nucleotides in length shown in 
SEQ ID NO: 170, encodes a protein which is approximately 314 amino acid residues in length, 
shown in SEQ ID NO: 13. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP1 1 proteins or biologically active portions thereof as further described 
herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP1 1 
gene is localized at chromosome 16. The THAP11 protein comprises a THAP domain at amino 
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acids 1 to 90. Analysis of expressed sequences (accession numbers indicated, which may be 
specifically included or excluded from the nucleic acids of the invention) in databases suggests that 
THAP11 is expressed as follows: AU142300 (retinoblastoma); BI261822 (lymphoma cell line); 
BG423102 (renal cell adenocarcinoma); and BG423864 (kidney). 
5 THAP-0 

The human THAP-0 cDNA, shown as a sequence having 2283 nucleotides in length shown 
in SEQ ED NO: 171, encodes a protein which is approximately 761 amino acid residues in length, 
shown in SEQ ID NO: 14. One aspect of the invention pertains to purified or isolated nucleic acid 
molecules that encode THAP-0 proteins or biologically active portions thereof as further described 

10 herein, as well as nucleic acid fragments thereof. Said nucleic acids may be used for example in 
therapeutic methods and drug screening assays as further described herein. The human THAP-0 
gene is localized at chromosome 11. The THAP-0 protein comprises a THAP domain at amino 
acids 1 to 90. Analysis of expressed sequences (accession numbers indicated, which may be 
specifically included or excluded from the nucleic acids of the invention) in databases suggests that 

15 THAP-0 is expressed as follows: BE713222 (head_neck); BE161184 (head_neck); AL1 19452 
(amygdala); AU129709 (teratocarcinoma); AW965460 (-); AW96546O0; AW958065 (-); and 
BE886885 (leiomyosarcoma). 

An object of the invention is a purified, isolated, or recombinant nucleic acid comprising 
the nucleotide sequence of SEQ ID NOs: 161-171, 173-175 or complementary sequences thereto, 

20 and fragments thereof. The invention also pertains to a purified or isolated nucleic acid comprising 
a polynucleotide having at least 95% nucleotide identity with a polynucleotide of SEQ ED NOs: 
161-171 or 173-175, advantageously 99 % nucleotide identity, preferably 99.5% nucleotide identity 
and most preferably 99.8% nucleotide identity with a polynucleotide of SEQ ID NOs: 161-171, 
173-175 or a' sequence complementary thereto or a biologically active fragment thereof. Another 

25 object of the invention relates to purified, isolated or recombinant nucleic acids comprising a 
polynucleotide that hybridizes, under the stringent hybridization conditions defined herein, with a 
polynucleotide of SEQ ID NOs: 161-171, 173-175 or a sequence complementary thereto or a 
variant thereof or a biologically active fragment thereof. In further embodiments, nucleic acids of 
the invention include isolated, purified, or recombinant polynucleotides comprising a contiguous 

30 span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, or 1000 
nucleotides of a sequence selected from the group consisting of SEQ ID NOs: 161-171, 173-175 or 
the complements thereof. 

Also encompassed is a purified, isolated, or recombinant nucleic acid polynucleotide 
encoding a THAP-2 to THAP 11 or THAP-0 polypeptide of the invention, as further described 

35 herein. 

In another preferred aspect, the invention pertains to purified or isolated nucleic acid 
molecules that encode a portion or variant of a THAP-2 to THAP 11 or THAP-0 protein, wherein 
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the portion or variant displays a THAP-2 to THAP11 or THAP-0 activity of the invention. 
Preferably said portion or variant is a portion or variant of a naturally occurring full-length THAP-2 
to THAP11 or THAP-0 protein. In one example, the invention provides a polynucleotide 
comprising, consisting essentially of, or consisting of a contiguous span of at least 12, 15, 18, 20, 
5 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, or 1000 nucleotides, to the extent that the 
length of said span is consistent with the length of the SEQ ID NO, of a sequence selected from the 
group consisting of SEQ ID NOs: 161-171, 173-175, wherein said nucleic acid encodes a THAP-2 
to THAP1 1 or THAP-0 portion or variant having a THAP-2 to THAP1 1 or THAP-0 activity 
described herein. In other embodiment, the invention relates to a polynucleotide encoding a THAP- 
10 2 to THAP1 1 or THAP-0 portion consisting of 8-20, 20-50, 50-70, 60-100, 100 - 150, 150- 200, 
200-250 or 250 - 350 amino acids, to the extent that the length of said portion is consistent with the 
length of the SEQ ID NO: of a sequence selected from the group consisting of SEQ ID NOs: 4-14, 
17-21, 23-40, 42-56, 58-98, 100-1 14 or a variant thereof, wherein said THAP-2 to THAP11 or 
THAP-0 portion displays a THAP-2 to THAP1 1 or THAP-0 activity described herein. 
15 A THAP-2 to THAP11 or THAP-0 variant nucleic acid may, for example, encode a 

biologically active THAP-2 to THAP1 1 or THAP-0 protein comprising at least 1, 2, 3, 5, 10, 20 or 
30 amino acid changes from the respective sequence selected from the group consisting of SEQ ED 
NO: 4-14, 17-21, 23-40, 42-56, 58-98 and 100-1 14 or may encode a biologically active THAP-2 to 
THAP11 or THAP-0 protein comprising at least 1%, 2%, 3%, 5%, 8%, 10% or 15% changes in 
amino acids from the respective sequence of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 and 
100-114. . 

The sequences of SEQ ID NOs: 4-14 correspond to the human THAP-2 to THAP1 1 and 
THAP-0 DNAs respectively. SEQ ID NOs: 17-21, 23-40, 42-56, 58-98, 100-114 correspond to 
mouse, rat, pig and other orthologs. 

Also encompassed by the THAP-2 to THAP1 1 and THAP-0 nucleic acids of the invention 
are nucleic acid molecules which are complementary to THAP-2 to THAP1 1 or THAP-0 nucleic 
acids described herein. Preferably, a complementary nucleic acid is sufficiently complementary to 
the nucleotide respective sequence shown in SEQ ID NOs: 161-171 and 173-175 such that it can 
hybridize to said nucleotide sequence shown in SEQ ID NOs: 161-171 and 173-175, thereby 
30 forming a stable duplex. 

Another object of the invention is a purified, isolated, or recombinant nucleic acid encoding 
a THAP-2 to THAP1 1 or THAP-0 polypeptide comprising, consisting essentially of, or consisting 
of an amino acid sequence selected from the group consisting of SEQ ID NOs: 4-14, 17-21, 23-40, 
42-56, 58-98, 100-114 or fragments thereof, wherein the isolated nucleic acid molecule encodes a 
THAP domain or a THAP-2 to THAP. 1 or THAP-0 target binding region. Preferably said target 
binding region is a protein binding region, preferably a PAR-4 binding region, or preferably said 
target binding region is a DNA binding region. For example, the purified, isolated or recombinant 
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nucleic acid may comprise a genomic DNA or fragment thereof which encodes a polypeptide 
having a sequence selected from the group consisting of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 
58-98, 100-114 or a fragment thereof. The purified, isolated or recombinant nucleic acid may 
alternatively comprise a cDNA consisting of, consisting essentially of, or comprising a sequence 
selected from the group consisting of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98, 100-1 14 or 
fragments thereof, wherein the isolated nucleic acid molecule encodes a THAP domain or a THAP- 
2 to THAP1 1 or THAP-0 target binding region. In preferred embodiments, a THAP-2 to THAP1 1 
or THAP-0 nucleic acid encodes a THAP-2 to THAP1 1 or THAP-0 polypeptide comprising at least 
two THAP-2 to THAP1 1 or THAP-0 functional domains, such as for example a THAP domain and 
a THAP-2 to THAP1 1 or THAP-0 target binding region. 

Particularly preferred nucleic acids of the invention include isolated, purified, or 
recombinant THAP-2 to THAP1 1 or THAP-0 nucleic acids comprising, consisting essentially of, or 
consisting of a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 
150, 200 or 250 nucleotides of a sequence selected from the group consisting of nucleotide 
positions coding for the relevant amino acids as given in the SEQ ID NO: 161-171 and 173-175. 

In further preferred embodiments, a THAP-2 to THAP 11 or THAP-0 nucleic acid 
comprises a nucleotide sequence encoding a THAP domain having the consensus amino acid 
sequence of the formula of SEQ ED NOs: 1-2. A THAP-2 to THAP1 1 or THAP-0 nucleic acid may 
also encode a THAP domain wherein at least about 95%, 90%, 85%, 50-80%, preferably at least 
about 60-70%, more preferably at least about 65% of the amino acid residues are identical or 
similar arpino acids-to the THAP consensus domain (SEQ ID NOs: 1-2). The present invention 
also embodies isolated, purified, and recombinant polynucleotides which encode a polypeptide 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 or 10 amino acids, 
more preferably at least 15, 25, 30, 35, 40, 45, 50, 60, 70, 80 or 90 amino acids of SEQ ID NOs: 1-2 

The nucleotide sequence determined from the cloning of the THAP-2 to THAP 11 or 
THAP-0 genes allows for the generation of probes and primers designed for use in identifying 
and/or cloning other THAP family members, particularly sequences related to THAP-2 to THAP1 1 
or THAP-0 (e.g. sharing the novel functional domains), as well as THAP-2 to THAP1 1 or THAP-0 
homologues from other species. 

A nucleic acid fragment encoding a biologically active portion of a THAP-2 to THAP1 1 or 
THAP-0 protein can be prepared by isolating a portion of a nucleotide sequence selected from the 
group consisting of SEQ ID NOs: 161-171 and 173-175, which encodes a polypeptide having a 
THAP-2 to THAP 11 or THAP-0 biological activity (the biological activities of the THAP-family 
proteins described herein), expressing the encoded portion of the THAP-2 to THAP1 1 or THAP-0 
protein (e.g., by recombinant expression in vitro or in vivo) and assessing the activity of the 
encoded portion of the THAP-2 to THAP1 1 or THAP-0 protein. 
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The invention further encompasses nucleic acid molecules that differ from the THAP-2 to 
THAP1 1 or THAP-0 nucleotide sequences of the invention due to degeneracy of the genetic code 
and encode the same THAP-2 to THAP1 1 or THAP-0 protein, or fragment thereof, of the invention. 
In addition to the THAP-2 to THAP1 1 or THAP-0 nucleotide sequences described above, it 
5 will be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to 
changes in the amino acid sequences of the respective THAP-2 to THAP1 1 or THAP-0 protein may 
exist within a population (e.g., the human population). Such genetic polymorphism may exist 
among individuals within a population due to natural allelic variation. Such natural allelic variations 
can typically result in 1-5% variance in the nucleotide sequence of a particular THAP-2 to THAP1 1 
10 or THAP-0 gene. 

Nucleic acid molecules corresponding to natural allelic variants and homologues of the 
THAP-2 to THAP11 or THAP-0 nucleic acids of the invention can be isolated based on their 
homology to the THAP-2 to THAP1 1 or THAP-0 nucleic acids disclosed herein using the cDNAs 
disclosed herein, or a portion thereof, as a hybridization probe according to standard hybridization 
1 5 techniques under stringent hybridization conditions. 

Probes based on the THAP-2 to THAP1 1 or THAP-0 nucleotide sequences can be used to 
detect transcripts or genomic sequences encoding the same or homologous proteins. In preferred 
embodiments, the probe further comprises a label group attached thereto, e.g., the label group can 
be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be 
used as a part of a diagnostic test kit for identifying cells or tissue which misexpress a THAP-2 to 
THAP1 1 or THAP-0 protein, such as by measuring a level of a THAP-2 to THAP1 1 or THAP-0- 
encoding nucleic acid in a sample of cells from a subject e.g., detecting THAP-2 to THAP11 or 
THAP-0 mRNA levels or determining whether a genomic THAP-2 to THAP1 1 or THAP-0 gene 
has been mutated or deleted. 
25 THAP-2 to THAP1 1 and THAP-0 Polypeptides 

The term "THAP-2 to THAP1 1 or THAP-0 polypeptides" is used herein to embrace all of 
the proteins and polypeptides of the present invention relating to THAP-2, THAP-3, THAP-4, 
THAP-5, THAP-6, THAP-7, THAP-8, THAP-9, THAP10, THAP11 and THAP-0. Also forming 
part of the invention are polypeptides encoded by the polynucleotides of the invention, as well as 
fusion polypeptides comprising such polypeptides. The invention embodies THAP-2 to THAP1 1 
or THAP-0 proteins from humans, including isolated or purified THAP-2 to THAP11 or THAP-0 
proteins consisting of, consisting essentially of, or comprising a sequence selected from the group 
consisting of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 and 100-1 14. 

The invention concerns the polypeptide encoded by a nucleotide sequence selected from the 
group consisting of SEQ ID NOs: 161-171, 172-175 and a complementary sequence thereof and a 
fragment thereof. 
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The present invention embodies isolated, purified, and recombinant polypeptides 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, 
more preferably at least 12, 15, 20, 25, 30, 40, 50, 100, 150, 200, 300 or 500 amino acids, to the 
extent that said span is consistent with the particular SEQ ID NO:, of a sequence selected from the 
5 group consisting of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 and 100-1 14. In other preferred 
embodiments the contiguous stretch of amino acids comprises the site of a mutation or functional 
mutation, including a deletion, addition, swap or truncation of the amino acids in the THAP-2 to 
THAP1 1 or THAP-0 protein sequence. 

One aspect of the invention pertains to isolated THAP-2 to THAP1 1 and THAP-0 proteins, 

10 and biologically active portions thereof, as well as polypeptide fragments suitable for use as 
immunogens to raise anti-THAP-2 to THAPll or THAP-0 antibodies. In one embodiment, native 
THAP-2 to THAPll or THAP-0 proteins can be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein purification techniques. In another 
embodiment, THAP-2 to THAPll or THAP-0 proteins are produced by recombinant DNA 

15 techniques. Alternative to recombinant expression, a THAP-2 to THAP1 1 or THAP-0 protein or 
polypeptide can be synthesized chemically using standard peptide synthesis techniques. 

Biologically active portions of a THAP-2 to THAP1 1 or THAP-0 protein include peptides 
comprising amino acid sequences sufficiently homologous to or derived from the amino acid 
sequence of the THAP-2 to THAPll or THAP-0 protein, e.g., an amino acid sequence shown in 

20 SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 or 100-1 14, which include less amino acids than the 
respective^ full length THAP-2 to THAP1 1 or THAP-0 protein, and exhibit at least one activity of 
the THAP-2 to THAPll or THAP-0 protein. The present invention also embodies isolated, 
purified, and recombinant portions or fragments of a THAP-2 to THAPll or THAP-0 polypeptide 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, 

25 more preferably at least 12, 15, 20, 25, 30, 40, 50, 100,150, 200, 300 or 500 amino acids, to the 
extent that said span is consistent with the particular SEQ ID NO, of a sequence selected from the 
group consisting of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 and 100-114. Also 
encompassed are THAP-2 to THAPll or THAP-0 polypeptides which comprise between 10 and 
20, between 20 and 50, between 30 and 60, between 50 and 100, or between 100 and 200 amino 

30 acids of a sequence selected from the group consisting of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 
58-98 and 100-114. In other preferred embodiments the contiguous stretch of amino acids 
comprises the site of a mutation or functional mutation, including a deletion, addition, swap or 
truncation of the amino acids in the THAP-2 to THAP1 1 or THAP-0 protein sequence. 

A biologically active THAP-2 to THAP1 1 or THAP-0 protein may, for example, comprise at 

35 least 1, 2, 3, 5, 10, 20 or 30 amino acid changes from the sequence of SEQ ID NOs: 4-14, 17-21, 
23-40, 42-56, 58-98 or 100-114, or may encode a biologically active THAP-2 to THAPll or 
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THAP-0 protein comprising at least 1%, 2%, 3%, 5%, 8%, 10% or 15% changes in amino acids 
from the sequence of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 or 100-1 14. 

In a preferred embodiment, the THAP-2 protein comprises, consists essentially of, or 
consists of a THAP-2 THAP domain, preferably having the amino acid sequence of amino acid 
positions 1 to 89 shown in SEQ ED NO: 4, or fragments or variants thereof. The invention also 
concerns the polypeptide encoded by the THAP-2 nucleotide sequences of the invention, or a 
complementary sequence thereof or a fragment thereof. The present invention thus also embodies 
isolated, purified, and recombinant polypeptides comprising, consisting essentially of or consisting 
of a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
preferably at least 12, 15, 20, 25, 30, 40, 50, 70, 80 or 89 amino acids of a sequence comprising 
amino acid positions 1 to 89 of SEQ ID NO: 4. In another aspect, a THAP-2 polypeptide may 
comprise a THAP domain wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 
60-70%, more preferably at least about 65% of the amino acid residues are identical or similar 
amino acids-to the THAP domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the 
15 present invention are isolated, purified, nucleic acids encoding a THAP-2 polypeptide comprising, 
consisting essentially of, or consisting of a THAP domain at amino acid positions 1 to 89 shown in 
SEQ ID NO: 4, or fragments or variants thereof. Preferably, said THAP-2 polypeptide comprises a 
PAR-4 binding domain and/or a DNA binding domain. 

In a preferred embodiment, the THAP-3 protein comprises, consists essentially of, or consists 
20 of a THAP-3 THAP domain, preferably having the amino acid sequence of amino acid positions 1 
to 89 shown in SEQ ID NO: 5, or fragments or variants thereof. The invention also concerns the 
polypeptide encoded by the THAP-3 nucleotide sequences of the invention, or a complementary 
sequence thereof or a fragment thereof. The present invention thus also embodies isolated, purified, 
and recombinant polypeptides comprising, consisting essentially of or consisting of a contiguous 
25 span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 
15, 20, 25, 30, 40, 50, 70, 80 or 89 amino acids of a sequence comprising amino acid positions 1 to 
89 of SEQ ID NO: 5. In another aspect, a THAP-3 polypeptide may comprise a THAP domain 
wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 60-70%, more preferably 
at least about 65% of the amino acid residues are identical or similar amino acids-to the THAP 
domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the present invention are 
isolated, purified, nucleic acids encoding a THAP-3 polypeptide comprising, consisting essentially 
of, or consisting of a THAP domain at amino acid positions 1 to 89 shown in SEQ ID NO: 5, or 
fragments or variants thereof. Preferably, said THAP-3 polypeptide comprises a PAR-4 binding 
domain and/or a DNA binding domain. 
35 In a preferred embodiment, the THAP-4 protein comprises, consists essentially of, or consists 

of a THAP-4 THAP domain, preferably having the amino acid sequence of amino acid positions 1 
to 90 shown in SEQ ID NO: 6, or fragments or variants thereof. The invention also concerns the 
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polypeptide encoded by the THAIM nucleotide sequences of the invention, or a complementary 
sequence'thereof or a fragment thereof. The present invention thus also embodies isolated, purified, 
and recombinant polypeptides comprising, consisting essentially of or consisting of a contiguous 
span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 
5 15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising amino acid positions 1 to 
90 of SEQ ID NO: 6. In another aspect, a THAP-4 polypeptide may comprise a THAP domain 
wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 60-70%, more preferably 
at least about 65% of the amino acid residues are identical or similar amino acids-to the THAP 
domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the present invention are 
10 isolated, purified, nucleic acids encoding a THAP-4 polypeptide comprising, consisting essentially 
of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in SEQ ID NO: 6, or 
fragments or variants thereof. 

In a preferred embodiment, the THAP-5 protein comprises, consists essentially of, or consists 
of a THAP-5 THAP domain, preferably having the amino acid sequence of amino acid positions 1 
15 to 90 shown in SEQ ID NO: 7, or fragments or variants thereof. The invention also concerns the 
polypeptide encoded by the THAP-5 nucleotide sequences of the invention, or a complementary 
sequence thereof or a fragment thereof The present invention thus also embodies isolated, purified, 
and recombinant polypeptides comprising, consisting essentially of or consisting of a contiguous 
span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 
20 15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising amino acid positions 1 to 
90 of SEQ ID NO: 7. In another aspect, a THAP-5 polypeptide may comprise a THAP domain 
wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 60-70%, more preferably 
at least about 65% of the amino acid residues are identical or similar amino acids-to the THAP 
domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the present invention are 
25 isolated, purified, nucleic acids encoding a THAP-5 polypeptide comprising, consisting essentially 
of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in SEQ ID NO: 7, or 
fragments or variants thereof 

In a preferred embodiment, the THAP-6 protein comprises, consists essentially of, or consists 
of a THAP-6 THAP domain, preferably having the amino acid sequence of amino acid positions 1 
30 to 90 shown in SEQ ID NO: 8, or fragments or variants thereof The invention also concerns the 
polypeptide encoded by the THAP-6 nucleotide sequences of the invention, or a complementary 
sequence thereof or a fragment thereof The present invention thus also embodies isolated, purified, 
and recombinant polypeptides comprising, consisting essentially of or consisting of a contiguous 
span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 
35 15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising amino acid positions 1 to 
90 of SEQ ID NO: 8. In another aspect, a THAP-6 polypeptide may comprise a THAP domain 
wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 60-70%, more preferably 
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at least about 65% of the amino acid residues are identical or similar amino acids-to the THAP 
domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the present invention are 
isolated, purified, nucleic acids encoding a THAP-6 polypeptide comprising, consisting essentially 
of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in SEQ ED NO: 8, or 
fragments or variants thereof. 

In a preferred embodiment, the THAP-7 protein comprises, consists essentially of, or consists 
of a THAP-7 THAP domain, preferably having the amino acid sequence of amino acid positions 1 
to 90 shown in SEQ ED NO: 9, or fragments or variants thereof The invention also concerns the 
polypeptide encoded by the THAP-7 nucleotide sequences of the invention, or a complementary 
sequence thereof or a fragment thereof The present invention thus also embodies isolated, purified, 
and recombinant polypeptides comprising, consisting essentially of or consisting of a contiguous 
span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 
15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising amino acid positions 1 to 
90 of SEQ ID NO: 9. In another aspect, a THAP-7 polypeptide may comprise a THAP domain 
15 wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 60-70%, more preferably 
at least about 65% of the amino acid residues are identical or similar amino acids-to the THAP 
domain consensus domain (SEQ ED NOs: 1-2). Also encompassed by the present invention are 
isolated, purified, nucleic acids encoding a THAP-7 polypeptide comprising, consisting essentially 
of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in SEQ ID NO: 9, or 
20 fragments or variants thereof. 

In a preferred embodiment, the THAP-8 protein comprises, consists essentially of, or consists 
of a THAP-8 THAP domain, preferably having the amino acid sequence of amino acid positions 1 
to 92 shown in SEQ ID NO: 10, or fragments or variants thereof. The invention also concerns the 
polypeptide encoded by the THAP-8 nucleotide sequences of the invention, or a complementary 
25 sequence thereof or a fragment thereof The present invention thus also embodies isolated, purified, 
and recombinant polypeptides comprising, consisting essentially of or consisting of a contiguous 
span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 
15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising amino acid positions 1 to 
92 of SEQ ID NO: 10. In another aspect, a THAP-8 polypeptide may comprise a THAP domain 
30 wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 60-70%, more preferably 
at least about 65% of the amino acid residues are identical or similar amino acids-to the THAP 
domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the present invention are 
isolated, purified, nucleic acids encoding a THAP-8 polypeptide comprising, consisting essentially 
of, or consisting of a THAP domain at amino acid positions 1 to 92 shown in SEQ ED NO: 10, or 
35 fragments or variants thereof 

In a preferred embodiment, the THAP-9 protein comprises, consists essentially of, or 
consists of a THAP-9 THAP domain, preferably having the amino acid sequence of amino acid 
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positions 1 to 92 shown in SEQ ID NO: 1 1, or fragments or variants thereof. The invention also 
concerns the polypeptide encoded by the THAP-9 nucleotide sequences of the invention, or a 
complementary sequence thereof or a fragment thereof. The present invention thus also embodies 
isolated, purified, and recombinant polypeptides comprising, consisting essentially of or consisting 
5 of a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
preferably at least 12, 15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising 
amino acid positions 1 to 92 of SEQ ID NO: 11. In another aspect, a THAP-9 polypeptide may 
comprise a THAP domain wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 
60-70%, more preferably at least about 65% of the amino acid residues are identical or similar 
10 amino acids-to the THAP domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the 
present invention are isolated, purified, nucleic acids encoding a THAP-9 polypeptide comprising, 
consisting essentially of, or consisting of a THAP domain at amino acid positions 1 to 92 shown in 
SEQ ID NO: 1 1 , or fragments or variants thereof. 

In a preferred embodiment, the THAP 10 protein comprises, consists essentially of, or 
15 consists of a THAP 10 THAP domain, preferably having the amino acid sequence of amino acid 
positions 1 to 90 shown in SEQ ID NO: 12, or fragments or variants thereof. The invention also 
concerns the polypeptide encoded by the THAP 10 nucleotide sequences of the invention, or a 
complementary sequence thereof or a fragment thereof. The present invention thus also embodies 
isolated, purified, and recombinant polypeptides comprising, consisting essentially of or consisting 
20 of a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
preferably at least 12, 15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising 
amino acid positions 1 to 90 of SEQ ID NO: 12. In another aspect, a THAP10 polypeptide may 
comprise a THAP domain wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 
60-70%, more preferably at least about 65% of the amino acid residues are identical or similar 
25 amino acids-to the THAP domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the 
present invention are isolated, purified, nucleic acids encoding a THAP 10 polypeptide comprising, 
consisting essentially of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in 
SEQ ID NO: 12, or fragments or variants thereof. 

In a preferred embodiment, the THAP11 protein comprises, consists essentially of, or 
30 consists of a THAP1 1 THAP domain, preferably having the amino acid sequence of amino acid 
positions 1 to 90 shown in SEQ ID NO: 13, or fragments or variants thereof. The invention also 
concerns the polypeptide encoded by the THAP 11 nucleotide sequences of the invention, or a 
complementary sequence thereof or a fragment thereof. The present invention thus also embodies 
isolated, purified, and recombinant polypeptides comprising, consisting essentially of or consisting 
35 of a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
preferably at least 12, 15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising 
amino acid positions 1 to 90 of SEQ ID NO: 13. In another aspect, a THAP1 1 polypeptide may 
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comprise a THAP domain wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 
60-70%, more preferably at least about 65% of the amino acid residues are identical or similar 
amino acids-to the THAP domain consensus domain (SEQ ED NOs: 1-2). Also encompassed by the 
present invention are isolated, purified, nucleic acids encoding a THAP1 1 polypeptide comprising, 
consisting essentially of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in 
SEQ ID NO: 1 3, or fragments or variants thereof. 

In a preferred embodiment, the THAP-0 protein comprises, consists essentially of, or 
consists of a THAP-0 THAP domain, preferably having the amino acid sequence of amino acid 
positions 1 to 90 shown in SEQ ED NO: 14, or fragments or variants thereof. The invention also 
concerns the polypeptide encoded by the THAP-0 nucleotide sequences of the invention, or a 
complementary sequence thereof or a fragment thereof. The present invention thus also embodies 
isolated, purified, and recombinant polypeptides comprising, consisting essentially of or consisting 
of a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
preferably at least 12, 15, 20, 25, 30, 40, 50, 70, 80 or 90 amino acids of a sequence comprising 
amino acid positions 1 to 90 of SEQ ID NO: 14. In another aspect, a THAP-0 polypeptide may 
comprise a THAP domain wherein at least about 95%, 90%, 85%, 50-80%, preferably at least about 
60-70%, more preferably at least about 65% of the amino acid residues are identical or similar 
amino acids-to the THAP domain consensus domain (SEQ ID NOs: 1-2). Also encompassed by the 
present invention are isolated, purified, nucleic acids encoding a THAP-0 polypeptide comprising, 
consisting essentially of, or consisting of a THAP domain at amino acid positions 1 to 90 shown in 
SEQ ED NO: 14, or fragments or variants thereof. 

In other embodiments, the THAP-2 to THAP11 or THAP-0 protein is substantially 
homologous to the sequences of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 or 100-1 14 and 
retains the functional activity of the THAP-2 to THAP1 1 or THAP-0 protein, yet differs in amino 
acid sequence due to natural allelic variation or mutagenesis, as described further herein. 
Accordingly, in another embodiment, the THAP-2 to THAP 11 or THAP-0 protein is a protein 
which comprises an amino acid sequence that shares more than about 60% but less than 100% 
homology with the amino acid sequence of SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 or 100- 
1 14 and retains the functional activity of the THAP-2 to THAP1 1 or THAP-0 proteins of SEQ ID 
NOs: 4-14, 17-21, 23-40, 42-56, 58-98 or 100-1 14, respectively. Preferably, the protein is at least 
about 30%, 40%, 50%, 60%, 70%, 80%, 85%, 90%, 92%, 95%, 97%, 98%, 99% or 99.8% 
homologous to SEQ ID NOs: 4-14, 17-21, 23-40, 42-56, 58-98 or 100-1 14, but is not identical to 
SEQ ED NOs: 4-14, 17-21, 23-40, 42-56, 58-98 or 100-1 14. Preferably the THAP-2 to THAP1 1 or 
THAP-0 is less than identical (e.g. 100% identity) to a naturally occurring THAP-2 to THAP1 1 or 
THAP-0. Percent homology can be determined as further detailed above. 
Assessing polypeptides, methods for obtaining variant nucleic acids and polypeptides 
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It will be appreciated that by characterizing the function of THAP-family polypeptides, the 

invention further provides methods of testing the activity of, or obtaining, functional fragments and 

variants of THAP-family and THAP domain nucleotide sequences involving providing a variant or 

modified THAP-family or THAP domain nucleic acid and assessing whether a polypeptide encoded 

5 thereby displays a THAP-family activity of the invention. Encompassed is thus a method of 

assessing the function of a THAP-family or THAP domain polypeptide comprising : (a) providing a 

THAP family or THAP domain polypeptide, or a biologically active fragment or homologue 

thereof; and (b) testing said THAP family or THAP domain polypeptide, or a biologically active 

fragment or homologue thereof for a THAP-family activity. Any suitable format may be used, 

10 including cell free, cell-based and in vivo formats. For example, said assay may comprise 
expressing a THAP-family or THAP domain nucleic acid in a host cell, and observing THAP- 
family activity in said cell. In another example, a THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof is introduced to a cell, and a THAP-family 
activity is observed. THAP-family activity may be any activity as described herein, includingr (1) 

15 mediating apoptosis or cell proliferation when expressed or introduced into a cell, most preferably 
inducing or enhancing apoptosis, and/or most preferably reducing cell proliferation; (2) mediating 
apoptosis or cell proliferation of an endothelial cell; (3) mediating apoptosis or cell proliferation of 
a hyperproliferative cell; (4) mediating apoptosis or cell proliferation of a CNS cell, preferably a 
neuronal or glial cell; or (5) an activity determined in an animal selected from the group consisting 

20 of mediating, preferably inhibiting angiogenesis, mediating, preferably inhibiting inflammation, 
inhibition of metastatic potential of cancerous tissue, reduction of tumor burden, increase in 
sensitivity to chemotherapy or radiotherapy, killing a cancer cell, inhibition of the growth of a 
cancer cell, or induction of tumor regression. 

In addition to naturally-occurring allelic variants of the THAP-family or THAP domain 

25 sequences that may exist in the population, the skilled artisan will appreciate that changes can be 
introduced by mutation into the nucleotide sequences of SEQ ID NOs: 160-171, thereby leading to 
changes in the amino acid sequence of the encoded THAP-family or THAP domain proteins, with 
or without altering the functional ability of the THAP-family or THAP domain proteins. 

Several types of variants are contemplated including 1) one in which one or more of the 

30 amino acid residues are substituted with a conserved or non-conserved amino acid residue and such 
substituted amino acid residue may or may not be one encoded by the genetic code, or 2) one in 
which one or more of the amino acid residues includes a substituent group, or 3) one in which the 
mutated THAP-family or THAP domain polypeptide is fused with another compound, such as a 
compound to increase the half-life of the polypeptide (for example, polyethylene glycol), or 4) one 

35 in which the additional amino acids are fused to the mutated THAP-family or THAP domain 
polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
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purification of the mutated THAP-family or THAP domain polypeptide or a preprotein sequence. 
Such variants are deemed to be within the scope of those skilled in the art. 

For example, nucleotide substitutions leading to amino acid substitutions can be made in 
the sequences of SEQ ID NOs: 160-175 that do not substantially change the biological activity of 
the protein. An amino acid residue^can be altered from the wild-type sequence encoding a THAP 
family or THAP domain polypeptide, or a biologically active fragment or homologue thereof 
without altering the biological activity^In general, amino acid residues that are conserved among 
the THAP-family of THAP domain-containing proteins of the present invention are predicted to be 
less amenable to alteration. Furthermore, additional conserved amino acid residues may be amino 
acids that are conserved between the THAP-family proteins of the present invention. 

In one aspect, the invention pertains to nucleic acid molecules encoding THAP family or 
THAP domain polypeptides, or biologically active fragments or homologues thereof that contain 
changes in amino acid residues that are not essential for activity. Such THAP-family proteins differ 
in amino acid sequence from SEQ ID NOs: 1-114 yet retain biological activity. In one 
15 embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a 
protein, wherein the protein comprises an amino acid sequence at least about 60% homologous to 
an amino acid sequence selected from the group consisting of SEQ ED NOs: 1-1 14. Preferably, the 
protein encoded by the nucleic acid molecule is at least about 65-70% homologous to an amino acid 
sequence selected from the group consisting of SEQ ID NOs: 1-114, more preferably sharing at 
20 least about 75-80% identity with an amino acid sequence selected from the group consisting of SEQ 
ID NOs: 1-J 14, even more preferably sharing at least about 85%, 90%, 92%, 95%, 97%, 98%, 99% 
or 99.8% identity with an amino acid sequence selected from the group consisting of SEQ ID NOs: 
1-114. 

In another aspect, the invention pertains to nucleic acid molecules encoding THAP-family 
25 proteins that contain changes in amino acid residues that result in increased biological activity, or a 
modified biological activity. In another aspect, the invention pertains to nucleic acid molecules 
encoding THAP-family proteins that contain changes in amino acid residues that are essential for a 
THAP-family activity. Such THAP-family proteins differ in amino acid sequence from SEQ ID 
NOs: 1-1 14 and display reduced or essentially lack one or more THAP-family biological activities. 
30 The invention also encompasses a THAP family or THAP domain polypeptide, or a biologically 
active fragment or homologue thereof which may be useful as dominant negative mutant of a 
THAP family or THAP domain polypeptide. 

An isolated nucleic acid molecule encoding a THAP family or THAP domain polypeptide, 
or a biologically active fragment or homologue thereof homologous to a protein of any one of SEQ 
35 ID NOs: 1-114 can be created by introducing one or more nucleotide substitutions, additions or 
deletions into the nucleotide sequence of SEQ ID NOs: 1-1 14 such that one or more amino acid 
substitutions, additions or deletions are introduced into the encoded protein. Mutations can be 
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introduced into any of SEQ ID NOs: 1-114, by standard techniques, such as site-directed 
mutagenesis and PCR-mediated mutagenesis. For example, conservative amino acid substitutions 
may be made at one or more predicted non-essential amino acid residues. A "conservative amino 
acid substitution" is one in which the amino acid residue is replaced with an amino acid residue 
5 having a similar side chain. Families of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, 
histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., 
glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched 

10 side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in a THAP 
family or THAP domain polypeptide, or a biologically active fragment or homologue thereof may 
be replaced with another amino acid residue from the same side chain family. Alternatively, in 
another embodiment, mutations can be introduced randomly along all or part of a THAP-family or 

15 THAP domain coding sequence, such as by saturation mutagenesis, and the resultant mutants can 
be screened for THAP-family biological activity to identify mutants that retain activity. Following 
mutagenesis of one of SEQ ID NOs: 1-114, the encoded protein can be expressed recombinantly 
and the activity of the protein can be determined. 

In a preferred embodiment, a mutant THAP family or THAP domain polypeptide, or a 

20 biologically active fragment or homologue thereof encoded by a THAP family or THAP domain 
polypeptide, or a biologically active fragment or homologue thereof of THAP domain nucleic acid 
of the invention can be assayed for a THAP-family activity in any suitable assay, examples of 
which are provided herein. 

The invention also provides THAP-family or THAP domain chimeric or fusion proteins. As 

25 used herein, a THAP-family or THAP domain "chimeric protein" or "fusion protein" comprises a 
THAP-family or THAP domain polypeptide of the invention operatively linked, preferably fused in 
frame, to a non-THAP-family or non-THAP domain polypeptide. In a preferred embodiment, a 
THAP-family or THAP domain fusion protein comprises at least one biologically active portion of 
a THAP-family or THAP domain protein. In another preferred embodiment, a THAP-family fusion 

30 protein comprises at least two biologically active portions of a THAP-family protein. For example, 
in one embodiment, the fusion protein is a GST-THAP-family fusion protein in which the THAP- 
family sequences are fused to the C-terminus of the GST sequences. Such fusion proteins can 
facilitate the purification of recombinant THAP-family polypeptides. In another embodiment, the 
fusion protein is a THAP-family protein containing a heterologous signal sequence at its N- 

35 terminus, such as for example to allow for a desired cellular localization in a certain host cell. 

The THAP-family or THAP domain fusion proteins of the invention can be incorporated 
into pharmaceutical compositions and administered to a subject in vivo. Moreover, the THAP- 
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family-fusion or THAP domain proteins of the invention can be used as immunogSto^roduce 
anti-THAP-family or anti or THAP domain antibodies in a subject, to purify THAP-family or 
THAP domain ligands and in screening assays to identify molecules which inhibit the interaction of 
THAP-family or THAP domain with a THAP-family or THAP domain target molecule. 

Furthermore, isolated peptidyl portions of the subject THAP-family or THAP domain 
proteins can also be obtained by screening peptides recombinantly produced from the 
corresponding fragment of the nucleic acid encoding such peptides. In addition, fragments can be 
chemically synthesized using techniques known in the art such as conventional Merrifield solid 
phase f-Moc or t-Boc chemistry. For example, a THAP-family or THAP domain protein of the 
present invention may be arbitrarily divided into fragments of desired length with no overlap of the 
fragments, or preferably divided into overlapping fragments of a desired length. The fragments can 
be produced (recombinantly or by chemical synthesis) and tested to identify those peptidyl 
fragments which can function as either agonists or antagonists of a THAP-family protein activity, 
such as by microinjection assays or in vitro protein binding assays. In an illustrative embodiment, 
peptidyl portions of a THAP-family protein, such as a THAP domain or a THAP-family target 
binding region (e.g. PAR4 in the case of THAPl, THAP-2 and THAP-3), can be tested for THAP- 
family activity by expression as thioredoxin fusion proteins, each of which contains a discrete 
fragment of the THAP-family protein (see, for example, U.S. Patents 5,270,181 and 5,292,646; and 
PCT publication W094/02502). 

The present invention also pertains to variants of the THAP-family or THAP domain 
proteins which function as either THAP-family or THAP domain mimetics or as THAP-family or 
THAP domain inhibitors. Variants of the THAP-family or THAP domain proteins can be generated 
by mutagenesis, e.g., discrete point mutation or truncation of a THAP-family or THAP domain 
protein. An agonist of a THAP-family or THAP domain protein can retain substantially the same, 
or a subset, of the biological activities of the naturally occurring form of a THAP-family or THAP 
domain protein. An antagonist of a THAP-family or THAP domain protein can inhibit one or more 
of the activities of the naturally occurring form of the THAP-family or THAP domain protein by, 
for example, competitively inhibiting the association of a THAP-family or THAP domain protein 
with a THAP-family target molecule. Thus, specific biological effects can be elicited by treatment 
with a variant of limited function. In one embodiment, variants of a THAP-family or THAP domain 
protein which function as either THAP-family or THAP domain agonists (rmmeucs) or as THAP- 
family or THAP domain antagonists can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of a THAP-family or THAP domain protein for THAP-family or 
THAP domain protein agonist or antagon.st activity. In one embodiment, a variegated library of 
THAP-family variants is generated by combinatorial mutagenesis at the nucleic acid level and is 
encoded by a variegated gene library. A variegated library of THAP-famUy variants can be 
produced by, for example, enzymatically ligating a mixture of synthetic oligonucleotides into gene 
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sequences such that a degenerate set of potential THAP-family sequences is expressible as 
individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage display) 
containing the set of THAP-family sequences therein. There are a variety of methods which can be 
used to produce libraries of potential THAP-family variants from a degenerate oligonucleotide 
sequence. Chemical synthesis of a degenerate gene sequence can be performed in an automatic 
DNA synthesizer, and the synthetic gene then ligated into an appropriate expression vector. Use of 
a degenerate set of genes allows for the provision, in one mixture, of all of the sequences encoding 
the desired set of potential THAP-family sequences. 

In addition, libraries of fragments of a THAP-family or THAP domain protein coding 
sequence can be used to generate a variegated population of THAP-family or THAP domain 
fragments for screening and subsequent selection of variants of a THAP-family or THAP domain 
protein. In one embodiment, a library of coding sequence fragments can be generated by treating a 
double stranded PCR fragment of a THAP-family coding sequence with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the double stranded 
15 DNA, renaturing the DNA to form double stranded DNA which can include sense/antisense pairs 
from different nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression vector. 
By this method, an expression library can be derived which encodes N-terminal, C-terminal and 
internal fragments of various sizes of the THAP-family protein. 

Modified THAP-family or THAP domain proteins can be used for such purposes as 
enhancing therapeutic or prophylactic efficacy, or stability (e.g., ex vivo shelf life and resistance to 
proteolytic degradation in vivo). Such modified peptides, when designed to retain at least one 
activity of the naturally occurring form of the protein, are considered functional equivalents of the 
THAP-family or THAP domain protein described in more detail herein. Such modified peptide can 
25 be produced, for instance, by amino acid substitution, deletion, or addition. 

Whether a change in the amino acid sequence of a peptide results in a functional THAP- 
family or THAP domain homolog (e.g. functional in the sense that it acts to mimic or antagonize 
the wild-type form) can be readily determined by assessing the ability of the variant peptide to 
produce a response in cells in a fashion similar to the wild-type THAP-family or THAP domain 
30 protein or competitively inhibit such a response. Peptides in which more than one replacement has 
taken place can readily be tested in the same manner. 

This invention further contemplates a method of generating sets of combinatorial mutants 
of the presently disclosed THAP-family or THAP domain proteins, as well as truncation and 
fragmentation mutants, and is especially useful for identifying potential variant sequences which 
35 are functional in binding to a THAP-family- or THAP domain- target protein but differ from a wild- 
type form of the protein by, for example, efficacy, potency and/or intracellular half-life. One 
purpose for screening such combinatorial libraries is, for example, to isolate novel THAP-family or 
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THAP domain homologs which function as either an agonist or an antagonist of the biological 
activities of the wild-type protein, or alternatively, possess novel activities all together. For 
example, mutagenesis can give rise to THAP-family homologs which have intracellular half-lives 
dramatically different than the corresponding wild-type protein. The altered protein can be 
5 rendered either more stable or less stable to proteolytic degradation or other cellular process which 
result in destruction of, or otherwise inactivation of, a THAP-family protein. Such THAP-family 
homologs, and the genes which encode them, can be utilized to alter the envelope of expression for 
a particular recombinant THAP-family protein by modulating the half-life of the recombinant 
protein. For instance, a short half-life can give rise to more transient biological effects associated 
10 with a particular recombinant THAP-family protein and, when part of an inducible expression 
system, can allow tighter control of recombinant protein levels within a cell. As above, such 
proteins, and particularly their recombinant nucleic acid constructs, can be used in gene therapy 
protocols. 

In an illustrative embodiment of this method, the amino acid sequences for a population of 
THAP-family homologs or other related proteins are aligned, preferably to promote the highest 
homology possible. Such a population of variants can include, for example^ THAP-family 
homologs from one or more species, or THAP-family homologs from the same species but which 
differ due to mutation. Amino acids which appear at each position of the aligned sequences are 
selected to create a degenerate set of combinatorial sequences. There are many ways by which the 
20 library of potential THAP-family homologs can be generated from a degenerate oligonucleotide 
sequence. Chemical synthesis of a degenerate gene sequence can be carried out in an automatic 
DNA synthesizer, and the synthetic genes then be ligated into an appropriate gene for expression. 
The purpose of a degenerate set of genes is to provide, in one mixture, all of the sequences 
encoding the desired set of potential THAP-family sequences. The synthesis of degenerate 
oligonucleotides is well known in the art (see for example. Narang, SA (1983) Tetrahedron 393; 
Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG 
Walton, Amsterdam: Elsevier pp. 273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; 
Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1:477. Such techniques 
have been employed in the directed evolution of other proteins (see, for example, Scott et al. (1990) 
Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 
404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents Nos 5,223,409, 
5,198,346, and 5,096,815). 

Alternatively, other forms of mutagenesis can be utilized to generate a combinatorial 
library, particularly where no other naturally occurring homologs have yet been sequenced. For 
example, THAP-family homologs (both agonist and antagonist forms) can be generated and 
isolated from a library by screening using, for example, alanine scanning mutagenesis and the like 
(Ruf et al. (1994) Biochemistry 33:1565-1572; Wang et al. (1994) J Biol. Chem. 269:3095-3099; 
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Balint et al. (1993) Gene 137:109-118; Grodberg et al. (1993) Eur. J Biochem. 218:597-601; 
Nagashima et al. (1993) J Biol. Chem. 268:2888-2892; Lowman et al. (1991) Biochemistry 
30:10832-10838; and Cunningham et al. (1989) Science 244:1081-1085), by linker scanning 
mutagenesis (Gustin et al. (1993) Virology 193:653-660; Brown et al. (1992) Mol. Cell Biol. 
5 12:2644 2652; McKnight et al. (1982) Science 232:316); by saturation mutagenesis (Meyers et al. 
(1986) Science 232:613); by PCR mutagenesis (Leung et al. (1989) Method Cell Mol Biol 1: 1-19); 
or by random mutagenesis (Miller et al. (1992) A Short Course in Bacterial Genetics, CSHL Press, 
Cold Spring Harbor, NY; and Greener et al. (1994) Strategies in Mol Biol 7:32-34). 

A wide range of techniques are known in the art for screening gene products of 

10 combinatorial libraries made by point mutations, as well as for screening cDNA libraries for gene 
products having a certain property. Such techniques will be generally adaptable for rapid screening 
of the gene libraries generated by the combinatorial mutagenesis of THAP-family proteins. The 
most widely used techniques for screening large gene libraries typically comprises cloning the gene 
library into replicable expression vectors, transforming appropriate cells with the resulting library 

15 of vectors, and expressing the combinatorial genes under conditions in which detection of a desired 
activity facilitates relatively easy isolation of the vector encoding the gene whose product was 
detected. 

Each of the illustrative assays described below are amenable to high through-put analysis as 
necessary to screen large numbers of degenerate THAP-family or THAP domain sequences created 

20 by combinatorial mutagenesis techniques. In one screening assay, the candidate gene products are 
displayed^on the surface of a cell or viral particle, and the ability of particular cells or viral particles 
to bind a THAP-family target molecule (protein or DNA) via this gene product is detected in a 
"panning assay". For instance, the gene library can be cloned into the gene for a surface membrane 
protein of a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 

25 88/06630; Fuchs et al. (1991) BiolTechnology 9:1370-1371, and Goward et al. (1992) TIBS 18:136 
140). In a similar fashion, fluorescently labeled THAP-family target can be used to score for 
potentially functional THAP-family homologs. Cells can be visually inspected and separated under 
a fluorescence microscope, or, where the morphology of the cell permits, separated by a 
fluorescence- activated cell sorter. 

30 In an alternate embodiment, the gene library is expressed as a fusion protein on the surface 

of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences can be 
expressed on the surface of infectious phage, thereby conferring two significant benefits. First, 
since these phage can be applied to affinity matrices at very high concentrations, a large number of 
phage can be screened at one time. Second, since each infectious phage displays the combinatorial 

35 gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, 
the phage can be amplified by another round of infection. The group of almost identical E. coli 
filamentous phages Ml 3, fd, and fl are most often used in phage display libraries, as either of the 
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phage gill or gVIII coat proteins can be used to generate fusion proteins without disrupting the 
ultimate packaging of the viral particle (Ladner et al. PCT publication WO 90/02909; Garrard et al., 
PCT publication WO 92/09690; Marks et al. (1992) J Biol. Chem. 267:16007-16010; Griffiths et al. 
(1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) 
5 PNAS 89:4457 4461). In an illustrative embodiment, the recombinant phage antibody system 
(RPAS, Pharmacia Catalog number 27-9400-01) can be easily modified for use in expressing 
THAP-family combinatorial libraries, and the THAP-family phage library can be panned on 
immobilized THAP family target molecule (glutathione immobilized THAP-family target-GST 
fusion proteins or immobilized DNA). Successive rounds of phage amplification and panning can 
1 0 greatly enrich for THAP-family homologs which retain an ability to bind a THAP-family target and 
which can subsequently be screened further for biological activities in automated assays, in order to 
distinguish between agonists and antagonists. 

The invention also provides for identification and reduction to functional minimal size of 
the THAP-family domains, particularly a THAP domain of the subject THAP-family to generate 
15 mimetics, e.g. peptide or non-peptide agents, which are able to disrupt binding of a polypeptide of 
the present invention with a THAP-family target molecule (protein or DNA). Thus, such mutagenic 
techniques as described above are also useful to map the determinants of THAP-family proteins 
which participate in protein-protein or protein-DNA interactions involved in, for example, binding 
to a THAP-family or THAP domain target protein or DNA. To illustrate, the critical residues of a 
20 THAP-family protein which are involved in molecular recognition of the THAP-family target can 
be determined and used to generate THAP-family target- 13P-derived peptidomimetics that 
competitively inhibit binding of the THAP-family protein to the THAP-family target. By 
employing, for example, scanning mutagenesis to map the amino acid residues of a particular 
THAP-family protein involved in binding a THAP-family target, peptidomimetic compounds can 
25 be generated which mimic those residues in binding to a THAP-family target, and which, by 
inhibiting binding of the THAP-family protein to the THAP-family target molecule, can interfere 
with the function of a THAP-family protein in transcriptional regulation of one or more genes. For 
instance, non hydrolyzable peptide analogs of such residues can be generated using retro-inverse 
peptides (e.g., see U.S. Patents 5,1 16,947 and 5,219,089; and Pallai et al. (1983) Int J Pept Protein 
Res 21:84-92), benzodiazepine (e.g., see Freidinger et al. in Peptides: Chemistry and Biology, G.R. 
Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in 
Peptides.- Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 
1988), substituted gamma lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. 
Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides 
35 (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and Function 
(Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), 
P-tum dipeptide cores (Nagai et al. (1985) Tetrahedron Left 26:647; and Sato et al. (1986) 'j Chem 
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Soc Perkin Trans 1: 123 1), and P-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res 
Commun 126:419; and Dann et al. (1986) Biochem Biophys Res Commun 134:71). 

An isolated THAP-family or TH AP domain protein, or a portion or fragment thereof, can 
be used as an immunogen to generate antibodies that bind THAP-family or THAP domain proteins 
5 using standard techniques for polyclonal and monoclonal antibody preparation. A full-length 
THAP-family protein can be used or, alternatively, the invention provides antigenic peptide 
fragments of THAP-family or THAP domain proteins for use as immunogens. Any fragment of the 
THAP-family or THAP domain protein which contains at least one antigenic determinant may be 
used to generate antibodies. The antigenic peptide of a THAP-family or THAP domain protein 

10 comprises at least 8 amino acid residues of an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 1-114 and encompasses an epitope of a THAP-family or THAP 
domain protein such that an antibody raised against the peptide forms a specific immune complex 
with a THAP-family or THAP domain protein. Preferably, the antigenic peptide comprises at least 
10 amino acid residues, more preferably at least 15 amino acid residues, even more preferably at 

15 least 20 amino acid residues, and most preferably at least 30 amino acid residues. 

Preferred epitopes encompassed by the antigenic peptide are regions of a THAP-family or 
THAP domain protein that are located on the surface of the protein, e.g., hydrophilic regions. 

A THAP-family or THAP domain protein immunogen typically is used to prepare 
antibodies by immunizing a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the 

20 immunogen. An appropriate immunogenic preparation can contain, for example, recombinantly 
expressed^ THAP-family or THAP domain protein or a chemically synthesized THAP-family or 
THAP domain polypeptide. The preparation can further include an adjuvant, such as Freund's 
complete or incomplete adjuvant, or similar immunostimulatory agent. Immunization of a suitable 
subject with "an immunogenic THAP-family or THAP domain protein preparation induces a 

25 polyclonal anti-THAP-family or THAP domain protein antibody response. 

The invention concerns antibody compositions, either polyclonal or monoclonal, capable of 
selectively binding, or selectively bind to an epitope-containing a polypeptide comprising a 
contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at 
least 12, 15, 20, 25, 30, 40, 50, 100, or more than 100 amino acids of an amino acid sequence 

30 selected from the group consisting of amino acid positions 1 to approximately 90 of SEQ ID NOs: 
1-114. The invention also concerns a purified or isolated antibody capable of specifically binding to 
a mutated THAP-family or THAP domain protein or to a fragment or variant thereof comprising an 
epitope of the mutated THAP-family or THAP domain protein. 
Oligomeric Forms of THAP1 

35 Certain embodiments of the present invention encompass THAP1 polypeptides in the form 

of oligomers, such as dimers, trimers, or higher oligomers. Oligomers may be formed by disulfide 
bonds between cysteine residues on different THAP1 polypeptides, for example. In other 
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embodiments, oligomers comprise from two to four THAP1 polypeptides joined by covalent or 
non-covalent interactions between peptide moieties fused to the THAP1 polypeptides. Such peptide 
moieties may be peptide linkers (spacers), or peptides that have the property of promoting 
oligomerization. Leucine zippers and certain polypeptides derived from antibodies are among the 
5 peptides that can promote oligomerization of THAP1 polypeptides attached thereto. DNA 
sequences encoding THAP1 oligomers, or fusion proteins that are components of such oligomers, 
are provided herein. 

In one embodiment of the invention, oligomeric THAP1 may comprise two or more 
THAP1 polypeptides joined through peptide linkers. Examples include those peptide linkers 

10 described in U.S. Patent No. 5,073,627. Fusion proteins comprising multiple THAP1 polypeptides 
separated by peptide linkers may be produced using conventional recombinant DNA technology. 

Another method for preparing THAP1 oligomers involves use of a leucine zipper. Leucine 
zipper domains are peptides that promote oligomerization of the proteins in which they are found. 
Leucine zippers were originally identified in several DNA-binding proteins (Landschulz et aL, 

15 Science 240:1759, 1988), and have since been found in a variety of different proteins. Among the 
known leucine zippers are naturally occurring peptides and derivatives thereof that dimerize or 
trimerize. Examples of leucine zipper domains suitable for producing THAP1 oligomers are those 
described International Publication WO 94/10308. Recombinant fusion proteins comprising a 
THAP1 polypeptide fused to a peptide that dimerizes or trimerizes in solution are expressed in 

20 suitable host cells, and the resulting soluble oligomeric THAP1 is recovered from the culture 
supernatant 

In some embodiments of the invention, a THAP1 or a THAP-family member dimer is 
created by fusing THAP1 or a THAP-family member to an Fc region polypeptide derived from an 
antibody, in a manner that does not substantially affect the binding of THAP1 or a THAP-family 

25 member to a chemokine, such as SLC/CCL2 1 . Preparation of fusion proteins comprising 
heterologous polypeptides fused to various portions of antibody-derived polypeptides (including Fc 
region) has been described, e.g., by Ashkenazi et al. (1991) PNAS 88:10535, Byrn et al. (1990) 
Nature 344:667, and Hollenbaugh and Aruffo "Construction of Immunoglobulin Fusion Proteins", 
in Current Protocols in Immunology, Supp. 4, pages 10.19.1 - 10.19.11, 1992. The THAP- 

30 family/Fc fusion proteins are allowed to assemble much like antibody molecules, whereupon 
interchain disulfide bonds form between Fc polypeptides, yielding divalent THAP. Similar fusion 
proteins of TNF receptors and Fc (see for example Moreland et al. (1997) N. Engl. J. Med. 
337(3):141-147; van der Poll et al. (1997) Blood 89(10):3727-3734; and Ammann et al. (1997) J. 
Clin. Invest. 99(7): 1699- 1703) have been used successfully for treating rheumatoid arthritis. 

35 Soluble derivatives have also been made of cell surface glycoproteins in the immunoglobulin gene 
superfamily consisting of an extracellular domain of the cell surface glycoprotein fused to an 
immunoglobulin constant (Fc) region (see e.g., Capon, D. J. et al. (1989) Nature 337:525-531 and 
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Capon U.S. Patent Nos. 5,116,964 and 5,428,130 [CD4-IgGl constructs]; Linsley, P. S. et al. 
(1991) J. Exp. Med. 173:721-730 [a CD28-IgGl construct and a B7-l-IgGl construct]; and Linsley, 
P. S. et al. (1991) J. Exp. Med. 174:561-569 and U.S. Patent No. 5,434,131 [a CTLA4-IgGl]). Such 
fusion proteins have proven useful for modulating receptor-ligand interactions. 
5 Some embodiments relate to THAP-immunoglobulin fusion proteins and THAP 

chemokine-binding domain fusions with immunoglobulin molecules or fragments thereof. Such 
fusions can be produced using standard methods, for example, by creating an expression vector 
encoding the SLC/CCL21 chemokine-binding protein THAP1 fused to the antibody polypeptide 
and inserting the vector into a suitable host cell. One suitable Fc polypeptide is the native Fc region 

10 polypeptide derived from a human IgGl, which is described in International Publication WO 
93/10151. Another useful Fc polypeptide is the Fc mutein described in U.S. Patent No. 5,457,035. 
The amino acid sequence of the mutein is identical to that of the native Fc sequence presented in 
International Publication WO 93/10151, except that amino acid 19 has been changed from Leu to 
Ala, amino acid 20 has been changed from Leu to Glu, and amino acid 22 has been changed from 

15 Gly to Ala. This mutein Fc exhibits reduced affinity for immunoglobulin receptors. 

SLC/chemokine-binding fragments of human THAP1 or THAP-family polypeptides, rather 
than the full protein, can also be employed in methods of the invention. Fragments may be less 
immunogenic than the corresponding full-length proteins. The ability of a fragment to bind 
chemokines, such as SLC, can be determined using a standard assay. Fragments can be prepared by 

20 any of a number of conventional methods. For example, a desired DNA sequence can be 
synthesized chemically or produced by restriction endonuclease digestion of a full length cloned 
DNA sequence and isolated by electrophoresis on agarose gels. Linkers containing restriction 
endonuclease cleavage sites can be employed to insert the desired DNA fragment into an expression 
vector, or the fragment can be digested at naturally-present cleavage sites. The polymerase chain 

25 reaction (PCR) can also be employed to isolate a DNA sequence encoding a desired protein 
fragment. Oligonucleotides that define the termini of the desired fragment are used as 5' and 3' 
primers in the PCR procedure. Additionally, known mutagenesis techniques can be used to insert a 
stop codon at a desired point, e.g., immediately downstream of the codon for the last amino acid of 
the desired fragment. 

30 In other embodiments, a THAP-family polypeptide or a biologically active fragment 

thereof, for example, an SLC-binding domain of THAP 1 may be substituted for the variable portion 
of an antibody heavy or light chain. If fusion proteins are made with both heavy and light chains of 
an antibody, it is possible to form a THAP-family polypeptide oligomer with at least two, at least 
three, at least four, at least five, at least six, at least seven, at least eight, at least nine, or more than 

3 5 nine THAP-family polypeptides. 

In some embodiments of the present invention, THAP-chemokine binding can be provided 
to decrease the biological availability of a chemokine or otherwise disrupt the activity of 
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chemokine. For example, THAP-family polypeptides, SLC-binding domains of THAP-family 
polypeptides, THAP oligomers, and SLC-binding domain-THAPl-immunoglobulin fusion proteins 
of the invention can be used to interact with SLC thereby preventing it from performing its normal 
biological role. In some embodiments, the entire THAP1 polypeptide (SEQ ID NO: 3) can be used 
to bind to SLC. In other embodiments, fragments of THAP1, such as the SLC-binding domain of 
the THAP1 (amino acids 143-2 13 of SEQ ID NO: 3) can used to bind to SLC. Such fragments can 
be from at least 8, at least 10, at least 12, at least 15, at least 20, at least 25, at least 30, at least 35, at 
least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 80, at least 90, 
at least 100, at least 110, at least 120, at least 130, at least 140, at least 150, at least 160, at least 
170, at least 180, at least 190, at least 200, at least 210 or at least 213 consecutive amino acids of 
SEQ ID NO: 3. In some embodiments, fragments can be from at least 8, at least 10, at least 12, at 
least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, 
at least 60, at least 65 or at least 70 consecutive amino acids of (amino acids 143-213 of SEQ ID 
NO: 3). THAP-family polypeptides that may be capable of binding SLC, for example THAP2-1 1 
and THAP0 or biologically active fragments thereof can also be used to bind to SLC so as to 
decrease its biological availability or otherwise disrupt the activity of this chemokine. 
In some embodiments, a plurality of THAP-family proteins, such as a fusion of two or more 
THAP1 proteins or fragments thereof which comprise an SLC-binding domain (amino acids 143- 
213 of SEQ ID NO: 3) can be used to bind SLC. For example, oligomers comprising THAP1 
fragments of a size of at least 8, at least 10, at least 12, at least 15, at least 20, at least 25, at least 30, 
at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65 or at least 70 
consecutive amino acids of SEQ ID NO: 3 (amino acids 143-213) can be generated. Ammo acid 
fragments which make up the THAP oligomer may be of the same or different lengths. In some 
embodiments," the entire THAP1 protein or biologically active portions thereof may be fused 
together to form an oligomer capable of binding to SLC. THAP-family polypeptides that may be 
capable of binding SLC, for example THAP2-1 1 and THAP0, the THAP-family polypeptides of 
SEQ ID NOs: 1-1 14 or biologically active fragments thereof can also be used to create oligomers 
which bind to SLC so as to decrease its biological availability or otherwise disrupt the activity of 
this chemokine. 

According to another embodiment of the present invention, THAP-family proteins, such as 
THAP1 or portion of THAP1 which comprise an SLC binding domain (amino acids 143-213 of 
SEQ ID NO: 3), may be fused to an immunoglobulin or portion thereof. The portion may be an 
entire immunoglobulin, such as IgG, IgM, IgA or IgE. Additionally, portions of immunoglobulins, 
such as an Fc domain of the immunoglobulin, can be fused to a THAP-family polypeptide, such as 
THAP1, fragments thereof or oligomers thereof . Fragments of THAP1 can be, for example, at 
least 8, at least 10, at least 12, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, 
at least 45, at least 50, at least 55, at least 60, at least 65 or at least 70 consecutive amino acids of 
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SEQ ID NO: 3 (amino acids 143-213). In some embodiments, THAP-family polypeptides that may 
be capable of binding SLC, for example THAP2-1 1 and THAP0, the THAP-family polypeptides of 
SEQ ID NOs: 1-114 or biologically active fragments thereof can also be used to form 
immunoglobulin fusion that bind to SLC so as to decrease its biological availability or otherwise 
5 disrupt the activity of this chernokine. 

Some aspects of the present invention relate to THAP-family polypeptides, chemokine- 
binding domains of THAP-family polypeptides, THAP oligomers, and chemokine-binding domain- 
THAP-immunoglobulin fusion proteins such as those described above which bind to chemokines 
other than SLC. For example, THAP-family polypeptides, chemokine-binding domains of THAP- 

10 family polypeptides, THAP oligomers, and chemokine-binding domain-THAP-immunoglobulin 
fusion proteins can be used to bind to or otherwise interact with chemokines from many families 
such as C chemokines, CC chemokines, C-X-C chemokines, C-X3-C chemokines, XC chemokines 
or CCK chemokines. In particular, THAP-family polypeptides, chemokine-binding domains of 
THAP-family polypeptides, THAP oligomers, and chemokine-binding domain-THAP- 

15 immunoglobulin fusion proteins may interact with chemokines such as XCL1, XCL2, CCL1, 
CCL2, CCL3, CCL3L1, SCYA3L2, CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, 
SCYA10, CCL11, SCYA12, CCL13, CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, 
CCL21, CCL22, CCL23, CCL24, CCL25, CCL26, CCL27, CCL28, clone 391, CARP CC-1, 
CCL1, CK-1, regakine-1, K203, CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, CXCL5, 

20 CXCL6, PPBP, SPBPBP, IL8, CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, CXCL15, 
CXCL16, jsJAP-4, LFCA-1, Scyba, JSC, VHSV-induced protein, CX3CL1 and fCLl. 

In some embodiments of the present invention, THAP-family polypeptides, chemokine- 
binding domains of THAP-family polypeptides, THAP oligomers, and chemokine-binding domain- 
THAP-immunoglobulin fusion proteins can bind to a chernokine extracellularly. For example, the 

25 THAP1 polypeptide, a biologically active fragment thereof (such as the SLC-binding domain of 
THAP1 (amino acids 143-213 of SEQ ID NO: 3)), an oligomer thereof, or an immunoglobulin 
fusion thereof can bind to a chernokine extracellularly. In other examples, chemokine-binding 
domains of other THAP-family members such as THAP2, THAP3, THAP4, THAP5, THAP6, 
THAP7, THAP8, THAP9, THAP10, THAP11 or THAP0, biologically active fragments thereof, 

30 oligomers thereof, or immunoglobulin fusions thereof can be used to bind to chemokines 
extracellularly. Binding of the THAP-family polypeptides, chemokine-binding domains of THAP- 
family polypeptides, THAP oligomers, and chemokine-binding domain-THAP-immunoglobulin 
fusion proteins may either decrease or increase the affinity of the chernokine for its extracellular 
receptor. In cases where binding of the chernokine to its extracellular receptor is inhibited, the 

35 normal biological effect of the chernokine can be inhibited. Such inhibition can prevent the 
occurrence of chemokine-mediated cellular responses, such as the modulation of cell proliferation, 
the modulation of angiogenesis, the modulation of an inflammation response, the modulation of 
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apoptosis, the modulation of cell differentiation. In some embodiments, inhibition of the binding of 
a chemokine to its extracellular receptor can result in transcriptional modulation. Alternatively, in 
cases where binding of the chemokine to its extracellular receptor is activated, the normal 
biological effect of the chemokine can be enhanced. Such enhancement can increase the occurrence 
5 of chemokine-mediated cellular responses, such as the modulation of cell proliferation, the 
modulation of angiogenesis, the modulation of an inflammation response, the modulation of 
apoptosis, the modulation of cell differentiation. In some embodiments, enhancement of the 
binding of a chemokine to its extracellular receptor can result in transcriptional modulation. 

In some embodiments of the present invention, THAP-family polypeptides, chemokine- 
1 0 binding domains of THAP-family polypeptides, THAP oligomers, and chemokine-binding domain- 
THAP-immunoglobulin fusion proteins can bind to a chemokine intracellularly. In some 
embodiments, the THAP-family protein acts as a nuclear receptor for the chemokine. For example, 
the THAP1 polypeptide, a biologically active fragment thereof (such as the SLC-binding domain of 
THAP1 (amino acids 143-213 of SEQ ID NO: 3)), an oligomer thereof, or an immunoglobulin 
15 fusion thereof can bind to a chemokine intracellularly. In other examples, chemokine-binding 
domains of other THAP-family members such as THAP2, THAP3, THAP4, THAP5, THAP6, 
THAP7, THAP8, THAP9, THAP10, THAP11 or THAP0, biologically active fragments thereof] 
oligomers thereof, or immunoglobulin fusions thereof can be used to bind to chemokines 
intracellularly. Binding of the THAP-family polypeptides, chemokine-binding domains of THAP- 
20 family polypeptides, THAP oligomers, and chemokine-binding domain-THAP-immunoglobulin 
fusion proteins may either decrease or increase the affinity of the chemokine for its intracellular 
receptor. In other embodiments, the THAP-family polypeptides, chemokine-binding domains of 
THAP-family polypeptides, THAP oligomers, and chemokine-binding domain-THAP- 
immunoglobulin fusion proteins are the intracellular receptor for the chemokine. In cases where 
binding of the chemokine to its intracellular receptor is inhibited, the normal biological effect of the 
chemokine can be inhibited. Such inhibition can prevent the occurrence of chemokine-mediated 
cellular responses, such as the modulation of cell proliferation, the modulation of angiogenesis, the 
modulation of an inflammation response, the modulation of apoptosis, the modulation of cell 
differentiation. In some embodiments, inhibition of the binding of a chemokine to its intracellular 
receptor can result in transcriptional modulation. Alternatively, in cases where binding of the 
chemokine to its intracellular receptor is activated, the normal biological effect of the chemokine 
can be enhanced. Such enhancement can increase the occurrence of chemokine-mediated cellular 
responses, such as the modulation of cell proliferation, the modulation of angiogenesis, the 
modulation of an inflammation response, the modulation of apoptosis, the modulation of cell 
differentiation. In some embodiments, enhancement of the binding of a chemokine to its 
intracellular receptor can result in transcriptional modulation. 
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In accordance with another aspect of the invention, THAP-family polypeptides, chemokine- 
binding domains of THAP-family polypeptides, THAP oligomers, and chemokine-binding domain- 
THAP-immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions. Such pharmaceutical compositions can be used to decrease or increase the 
bioavailability and functionality of a chemokine. For example, THAP-family polypeptides, SLC- 
binding domains of THAP-family polypeptides, THAP oligomers, and SLC- binding domain- 
THAP1 -immunoglobulin fusion proteins of the present invention can be administered to a subject 
to inhibit an interaction between SLC and its receptor, such as CCR7, on the surface of cells, to 
thereby suppress SLC-mediated responses. The inhibition of chemokine SLC may be useful 
therapeutically for both the treatment of inflammatory or proliferative disorders, as well as 
modulating (e.g., promoting or inhibiting) cell differentiation, cell proliferation, and/or cell death. 

In an additional embodiment of the present invention, the THAP-family polypeptides, 
chemokine-binding domains of THAP-family polypeptides, THAP oligomers, and chemokine- 
binding domain-THAP-irnmunoglobulin fusion proteins of the present invention can be used to 
detect the presence of a chemokine in a biological sample and in screening assays to identify 
molecules which inhibit the interaction of a THAP-family polypeptide with a chemokine. For 
example, the THAP-family polypeptides, SLC-binding domains of THAP-family polypeptides, 
THAP oligomers, and SLC-binding domain-THAPl -immunoglobulin fusion proteins of the present 
invention can be used to detect the presence of SLC in a biological sample and in screening assays 
to identify molecules which inhibit the interaction of a THAP-family polypeptide with SLC. Such 
screening^ssays are similar to those described below for PAR4-THAP interactions. 

Certain aspects of the present invention related to a method of identifying a test compound 
that modulates THAP-mediated activites. In some cases the THAP-mediated acitivity is SLC- 
binding. Test compounds which affect THAP-SLC binding can be identified using a screening 
method wherein a THAP-family polypeptide or a biologically active fragment thereof is contacted 
with a test compound. In some embodiments, the THAP-family polypeptide comprises an amino 
acid sequence having at least 30% amino acid identity to an amino acid sequence of SEQ ID NO: 1 
or SEQ ID NO: 2. Whether the test compound modulates the binding of SLC with a THAP-family 
polypeptide, such as THAP1 (SEQ ID NO: 3), is determined by determining whether the test 
compound modulates the activity of the THAP-family polypeptide or biologically active fragment 
thereof. Biologically active framents of a THAP-family polypeptide may be at least 5, at least 8, at 
least 10, at least 12, at least 15, at least 18, at least 20, at least 25, at least 30, at least 35, at least 40, 
at least 45, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 1 10, at 
least 120, at least 130, at least 140, at least 150, at least 160, at least 170, at least 180, at least 190, 
at least 200, at least 210, at least 220 or at least more than 220 amino acids in length. A 
determination that the test compound modulates the activity of said polypeptide indicates that the 
test compound is a candidate modulator of THAP-mediated activities. 
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Although THAP-family polypeptides, chemokine-binding domains of THAP-family 
polypeptides, THAP oligomers, and chemokine-binding domain-THAP-immunoglobulin fusion 
proteins can be used for the above-mentioned chemokine interactions, it will be appreciated that 
homologs of THAP-family polypeptides, chemokine-binding domains of THAP-family 
polypeptides, THAP oligomers, and chemokine-binding domain-THAP-immunoglobulin fusion 
proteins can be used in place of THAP-family polypeptides, chemokine-binding domains of THAP- 
family polypeptides, THAP oligomers, and chemokine-binding domain-THAP-immunoglobulin 
fusion proteins. For example, homologs having at least about 30-40% identity, preferably at least 
about 40-50% identity, more preferably at least about 50-60%, and even more preferably at least 
about 60-70%, 70-80%, 80%, 90%, 95%, 97%, 98%, 99% or 99.8% identity across the amino acid 
sequences of SEQ ID NOs: 1-1 14 or portions thereof can be used. 

Although this section, entitled "Oligomeric Forms of THAP-1," primarily describes THAP- 
family polypeptides, SLC-binding domains of THAP-family polypeptides, THAP oligomers, SLC- 
binding domain-THAP-immunoglobulin fusion proteins and homologs of these polypeptides as 
well as methods of using such polypeptides, it will be appreciated that such polypeptides are 
included in the class of THAP-type chemokine-binding agents. Accordingly, the above description 
also applies to THAP-type chemokine-binding agents. It will be appreciated that THAP-type 
chemokine-binding agents will be used for applications which include, but are not limited to, 
chemokine binding, inhibiting or enhancing chemokine activity, chemokine detection, reducing the 
symptoms associated with a chemokine influenced or mediated condition, and reducing or 
preventingj'nflammation or other chemokine mediated conditions. THAP-type chemokine-binding 
agents can also be used in the kits, devices, compositions, and procedures described elsewhere 
herein. 

In some embodiments of the present invention, THAP-type chemokine-binding agents bind 
to or otherwise modulate the activity of one or more chemokines selected from the group consisting 
of XCL1, XCL2, CCL1, CCL2, CCL3, CCL3L1, SCYA3L2, CCL4, CCL4L, CCL5..CCL6, CCL7, 
CCL8, SCYA9, SCYA10, CCL11, SCYA12, CCL13, CCL14, CCL15, CCL16, CCL17, CCL18, 
CCL19, CCL20, CCL21, CCL22, CCL23, CCL24, CCL25, CCL26, CCL27, CCL28, clone 391, 
CARP CC-1, CCL1, CK-1, regakine-1, K203, CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, 
CXCL5, CXCL6, PPBP, SPBPBP, IL8, CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, 
CXCL15, CXCL16, NAP-4, LFCA-1, Scyba, JSC, VHSV-induced protein, CX3CL1, and fCLl. 
Chemokine Binding Domains 

In some embodiments of the present invention a chemokine-binding domain that consists 
essentially of the chemokine binding portion of a THAP-family polypeptide is contemplated. In 
some embodiments, the THAP-family polypeptide is THAP-1 (SEQ ID NO: 3) or a homolog 
thereof. Chemokines that are capable of binding to any particular THAP-family member can be 
determined as described in Examples 16, 32 and 33, which set out both in vitro and in vivo assays 
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for determining the binding affinity of several different chemokines to THAP-L The portion of the 
THAP-family protein that binds to the chemokine can readily be determined through the analysis of 
deletion and point mutants of any of the THAP-family members capable of chemokine-binding. 
Such analyses of deletion and point mutants were used to determine the specific region of THAP-1 
5 that permits SLC-binding (see Example 1 5). Additionally, deletion and point mutation studies were 
used to determine portions of THAP-family proteins as well as specific amino acid residues that 
interact with PAR-4 (Examples 4-7 and 13). It will be appreciated that the methods described in 
these Examples can be used to precisely identify the chemokine-binding portion of any THAP- 
family member using any chemokine. 

10 By "chemokine-binding domain" or "portion that binds to a chemokine" is meant a 

fragment which comprises 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 
29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 
55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 
81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 

15 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 
125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 
145, 146, 147, 148, 149, 150, 160, 170, 180, 190, 200, 210 or greater than 210 consecutive amino 
acids of a THAP-family polypeptide but less than the total number of amino acids present in the 
THAP-family polypeptide. In some embodiments, the THAP-family polypeptide is THAP-1 (SEQ 

20 ID NO: 3). 

TJie complete amino acid sequence of each human THAP-family polypeptide is described 
in the Sequence Listing. In particular, THAP-1 is (SEQ ID NO: 3), THAP-2 is (SEQ ID NO: 4), 
THAP-3 is (SEQ ID NO: 5), THAP-4 is (SEQ ED NO: 6), THAP-5 is (SEQ ID NO: 7), THAP-6 is 
(SEQ ED NO: 8), THAP-7 is (SEQ ID NO: 9), THAP-8 is (SEQ ID NO: 10), THAP-9 is (SEQ ID 

25 NO:ll), THAP-10 is (SEQ ID NO: 12), THAP-11 is (SEQ ID NO: 13), THAP-0 is (SEQ ID NO: 
14). The complete amino acid sequence of additional THAP-family polypeptides from other 
species are also listed in the Sequence Listing as SEQ ID NOs: 16-98. As such, the chemokine- 
binding portion of any of these THAP-family polypeptide sequences that are listed in the Sequence 
Listing is explicitly described. In particular, in some embodiments, the chemokine-binding domain 

30 is a fragment of a THAP-family chemokine-binding agent described by the formula: 

for each THAP-family polypeptide, N = the number of amino acids in the full-length 
polypeptide; B = a number between 1 and N - 1 ; and E = a number between 1 and N. 

For any THAP-family polypeptide, a chemokine-binding domain is specified by any 
consecutive sequence of amino acids beginning at an amino acid position B and ending at amino 

35 acid position E, wherein E > B. 
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Methods Of Complex Formation Between A Chemokine And A THAP-Type Chemokine- 
Binding Agent 

Some aspects of the present invention relate to methods for forming a complex between a 
chemokine and a THAP-type chemokine-binding agent. These methods include the step of 
contacting one or more chemokines with one or more THAP-type chemokine-binding agents 
described herein such that a complex comprising one or more chemokines and one or more THAP- 
type chemokine-binding agents is formed. In some embodiments, a plurality of different 
chemokines are contacted with one or a plurality of different THAP-type chemokine-binding agents 
so as to form one or more complexes. Alternatively, a plurality of different THAP-type chemokine- 
binding agents are contacted with one or more chemokines so as to form one or more complexes. 

A number of different chemokines can be used in the above-described complex formation 
methods. Such chemokines include, but are not limited to, XCL1, XCL2, CCL1, CCL2, CCL3, 
CCL3L1, SCYA3L2, CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, SCYA10, CCL11, 
SCYA12, CCL13, CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, CCL21, CCL22, 
CCL23, CCL24, CCL25, CCL26, CCL27, CCL28, clone 391, CAJIP CC-1, CCL1, CK-1, regakine- 
1, K203, CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1; CXCL5, CXCL6, PPBP, SPBPBP, 
IL8, CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, CXCL15, CXCL16, NAP-4, LFCA-1, 
Scyba, JSC, VHSV-induced protein, CX3CL1 and fCLl. 

Method of forming a complex between a THAP-type chemokine-binding agent and a 
chemokine can be used both in vitro and in vivo. For example, in vitro uses can include the 
detection qf a chemokine in a solution or a biological sample that has been removed or withdrawn 
from a subject. Such samples may include, but are not limited to, tissue samples, blood samples, 
and other fluid or solid samples of biological material. In vivo uses can include, but are not limited 
to, the detection or localization of chemokines in a subject, reducing or inhibiting the activity of one 
or more chemokines throughout or in certain areas of a subject's body, and reducing the symptoms 
associated with a chemokine influenced or mediated condition. 
Modulation of Transcription 

In some embodiments of the present invention THAP-family polypeptides, THAP DNA- 
binding domains (THAP domains), homologs of THAP-family proteins or homologs of THAP 
domains are used to modulate transcription. In other embodiments, THAP-family polypeptides, 
THAP domains, homologs of THAP-family proteins or homologs of THAP domains interact with a 
chemokine to modulate transcription. In either of the above-mentioned embodiments, a THAP- 
family polypeptide, THAP domain, THAP-chemokine complex or homologs thereof recognize a 
THAP responsive element. Recognition of the THAP responsive element by a THAP-family 
polypeptide, THAP domain, THAP-chemokine complex or homologs thereof results in the 
modulation of one or more THAP responsive promoters. 
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As used herein, "THAP responsive promoter" means, a promoter comprising one or more 
THAP responsive elements. THAP responsive promoters also include promoters that are indirectly 
regulated by THAP. For example, a THAP responsive element may be present as an upstream 
enhancer sequence, the presence of which, activates transcription at the downstream promoter. In 

5 another nonlimiting example, a first promoter may be modulated by a polypeptide that is encoded 
by a gene under the control of a second promoter having a THAP responsive element, however, the 
first promoter does not comprise a THAP responsive element. In such a case, the activity of the 
first promoter is indirectly responsive to THAP because transcription is modulated by the 
polypeptide encoded by the second promoter which is responsive to THAP. 

10 As used herein, "THAP responsive elements" include, but are not limited to, nucleic acids 

which comprise one or more of the following nucleotide consensus sequences. The first THAP 
responsive element consensus sequence comprises the nucleotide sequences GGGCAA or 
TGGCAA organized as direct repeats with approximately a 5 nucleotide spacing (DR-5 motifs). 
For example, one consensus sequence is GGGCAAnnnnnTGGCAA (SEQ ID NO: 149). Although 

15 GGGCAA and TGGCAA sequences constitute a typical THAP domain DNA binding site (THAP 
responsive element), GGGCAT, GGGCAG and TGGCAG sequences are also DNA target 
sequences recognized by the THAP DNA-binding domain. Additionally, a second THAP 
responsive element consensus sequence comprises the nucleotide sequences TTGCCA or 
GGGCAA organized as everted repeats with 11 nucleotide spacing (ER-1 1 motifs). For example, 

20 one consensus sequence is TTGCCAnnnnnnnnnnnGGGCAA (SEQ ID NO: 159). Although 
TTGCCA. and GGGCAA sequences constitute a typical THAP responsive element, CTGCCA is 
also recognized. 

Another THAP responsive element is the THRE consensus sequence which is illustrated in 
Figure 24 (SEQ ID NO: 306). In some embodiments of the present invention, THRE is a 

25 preferential recognition motif for monomeric THAP-family polypeptides or biologically active 
fragments thereof. In some embodiments, THRE is preferentially recognized by the THAP1 
monomer. Alternatively, in some embodiments, the DR-5 and/or the ER-1 1 motif is preferentially 
recognized by a dimer or a multimer of a THAP-family polypeptide or biologically active 
fragments thereof. In some embodiments, the THAP dimers or multimers comprise THAP1 . 

30 A THAP responsive element can comprise either a single type of consensus nucleotide 

sequence, multiple types of consensus sequences. For example, a THAP responsive element can 
comprise one, two, three, four, five or more than five DR-5 consensus sequences. Similarly, a 
THAP responsive element can comprise one, two, three, four, five or more than five ER-11 
consensus sequences. In another example, a THAP responsive element can comprise one, two, 

35 three, four, five or more than five THRE consensus sequences. In addition, a THAP responsive 
element can comprise a mixture of two, three, four, five or more than five DR-5, ER-1 1 and THRE 
consensus sequences. Furthermore, any of the aforementioned THAP responsive elements can 
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comprise one or more variants of DR-5, ER-1 1 or THRE consensus sequences or variants of some 
or all of DR-5, ER-1 1 or THRE consensus sequences. 

It will be appreciated that other minor nucleotide sequence variations can occur in THAP 
responsive element consensus sequences which do not substantially affect the binding of the THAP 
5 domain to the THAP responsive element. For example, a THAP responsive element can comprise a 
nucleic acid having at least 99%, at least 98%, at least 97%, at least 96%, at least 95, at least 94%, 
at least 93%, at least 92%, at least 91%, at least 90, at least 89%, at least 88%, at least 87%, at least 
86%, at least 85, at least 84%, at least 83%, at least 82%, at least 81%, at least 80, at least 75%, at 
least 70%, at least 65%, at least 60%, at least 55%, or at least 50% nucleotide sequence identity to a 

1 0 consensus sequence for DR-5 , ER- 1 1 or THRE. 

In some embodiments of the present invention, the THAP-family polypeptide, THAP 
domain, THAP-chemokine complex or homologs thereof recognize a THAP responsive element in 
the promoter of the gene or genes whose transcription is modulated. Alternatively, in other 
embodiments, the THAP-family polypeptide, THAP domain, THAP-chemokine complex or 

15 homologs thereof recognize a THAP responsive element at locations other than the promoter of the 
gene or genes whose transcription is modulated. 

Upon binding of the THAP responsive element by a THAP-family polypeptide, THAP 
domain, THAP-chemokine complex or homolog thereof transcription can be modulated. Such 
modulation may include repression or activation of transcription. Whether transcription is 

20 repressed or activated, as well as the extent of repression or activation, can be influenced by many 
factors, incjuding but not limited to, the number and position of THAP responsive elements, the 
THAP-family member or homolog that is bound and, in the case of THAP-chemokine complexes, 
the type of chemokine that forms the THAP chemokine complex. 

In some embodiments, chemokine analogs can be used to bind to THAP-family 

25 polypeptides or biologically active fragments thereof. For example, a chemokine can be modified 
so as to retain its THAP-binding or THAP interaction activity but alter other of its physiological 
effects. Such chemokine analogs can be used to modulate transcription by allowing recognition and 
binding of THAP to a THAP responsive element without mediating other of its physiological 
effects. As used herein, "chemokine analogs" are chemokine homologs having at least 99%, at 

30 least 97%, at least 95, at least 93%, at least 90, at least 85, at least 80, at least 75%, at least 70%, at 
least 65%, at least 60%, at least 50%, at least 40% or at least 30% amino acid identity to a specific 
chemokine. For example, analogs of SLC comprise polypeptide homologs of SLC having at least 
99%, at least 97%, at least 95, at least 93%, at least 90, at least 85, at least 80, at least 75%, at least 
70%, at least 65%, at least 60%, at least 50%, at least 40% or at least 30% amino acid identity to 
35 SLC. As another example, analogs of CXCL9 comprise polypeptide homologs of CXCL9 having 
at least 99%, at least 97%, at least 95, at least 93%, at least 90, at least 85, at least 80, at least 75%, 
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at least 70%, at least 65%, at least 60%, at least 50%, at least 40% or at least 30% amino acid 
identity to CXCL9. Chemokine analogs can also include chemically modified chemokines. 

Some embodiments of the present invention relate to the screening of a test compound to 
determine whether it is capable of modulating transcription of a nucleic acid under control of a 
THAP responsive element. A number of constructs can be generated wherein a nucleic acid is 
placed under control of at least one THAP responsive element. In some embodiments, the construct 
is introduced into a cell which is responsive to a chemokine. For example, in some embodiments, 
the constuct is introduced into a cell which is responsive to SLC, such as a cell expressing the 
CCR7 receptor. In another example, in some embodiments, the constuct is introduced into a cell 
which is responsive to CXCL9, such as a cell expressing the CXCR3 receptor. For example, a 
nucleic acid can be operably linked to a promoter comprising one or more THAP responsive 
elements. The nucleic acid can be nucleic acid which results in a transcript that is capable of 
detection. The transcript may be detected and quantified by any method known in the art. In some 
embodiments, the nucleic acid will encode a reporter enzyme, including but not limited to, GFP, 
luciferase, P-galactosidase, and gus. The activity of such a reporter enzyme can be used to measure 
the amount of transcription that occurs from the promoter containing the THAP responsive 
elelments. 

In some embodiments, a THAP-family protein is allowed to contact the construct 
comprising the nucleic acid that is under control of the THAP responsive element. The THAP- 
family protein may modulate transcription in the absence of the test compound. Alternatively, the 
THAP-family protein may only modulate transcription in the presence of a test compound. In 
either case, the effect of the test compound on the modulation of transcription can be determined by 
determining the increase or decrease in transcription that is caused by the test compound when 
compared to the base level of transcription that occurs in the presence of THAP-family protein prior 
to the addition of test compound. Determining whether the presence of test compound increases or 
decrease the level of transcription at the THAP responsive element when compared to the level of 
transcription in the absence of test compound permits the determination of whether the compound 
modulates transcription of a nucleic acid under the control of a THAP responsive element. 

Certain aspects of the present invention also relate to the use of THAP-family polypeptide- 
chemokine transcription modulators in the treatment or amelioration of conditions resulting from 
too much or a deficiency in the transcription of certain genes. Modulation of the interaction of a 
chemokine with a THAP-family polypeptide can be used in the treatment of an individual suffering 
from one or more specific conditions. For example, the interaction between chemokines and 
THAP-family members, such as the polypeptides of SEQ ID NOs: 1-114 can be used modulate 
transcription of certain genes thereby resulting in suppression of tumorigenesis and/or metastasis, 
inhibition or stimulation of apoptosis of endothelial cells in angiogenesis-dependent diseases 
including but not limited to cancer, cardiovascular diseases, inflammatory diseases, and inhibition 
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of apoptosis of neurons in acute and chronic neurodegenerative disorders, including but not limited 
to Alzheimer's, Parkinson's and Huntington's diseases, amyotrophic lateral sclerosis, HIV 
encephalitis, stroke, epileptic seizures and malignant tumors. 

In some embodiments chemokine analogs can be used to interact with THAP-family 
polypeptides so as to treat or otherwise ameliorate the symptoms associated with the above- 
mentioned conditions. 

It will be appreciated that THAP-type chemokine-binding agents can also be used to 
modulate transcription as described above. Some embodiments of such modulation of transcription 
are set out below. 
Transcription Factor Decoys 

Some embodiments of the present invention relate to transcription factor decoys and 
methods of their use. In some embodiments of the present invention, a transcription factor decoy is 
any molecule that functions to inhibit or otherwise modulate the effect of a THAP/chemokine 
complex or a THAP-family polypeptide or a biologically active fragment thereof on gene 
transcription. In some embodiments, a transcription factor decoy is a molecule that acts as an 
inhibitor of the interaction between a THAP-family polypeptide or a biologically active fragment 
thereof and a nucleic acid. Alternatively, a transcription factor decoy can inhibit the interaction 
between a THAP/chemokine complex and a nucleic acid. For example, the nucleic acid can be a 
THAP responsive promoter or any other nucleic acid sequence which is involved in the modulation 
of the expression of a THAP responsive gene or a gene responsive to a THAP/chemokine complex. 

In .some embodiments of the present invention, the transcription factor decoy functions to 
inhibit, lessen or negate the effect of a THAP/chemokine complex or a THAP-family polypeptide 
or a biologically active fragment thereof on the expression of certain genes. For example, some 
transcription factor decoys function as competitive inhibitors of the interaction between a nucleic 
acid and a THAP/chemokine complex or a nucleic acid and a THAP-family polypeptide or a 
biologically active fragment thereof. In other embodiments, the transcription factor decoy functions 
as a nonreversible or suicide inhibitor. In yet other embodiments, the transcription factor decoy 
acts as a reversible inhibitor. 

Some embodiments of the present invention contemplate transcription factor decoys which 
comprise one or more nucleic acids which comprise or consist essentially of a THAP responsive 
element. THAP responsive elements that are useful for the construction of transcription factor 
decoys include, but are not necessarily limited to, DR-5 elements, ER-11 elements and THRE 
elements. In some embodiments, the transcription factor decoys comprise one or more nucleic 
acids having a nucleotide sequence selected from the group consisting of SEQ ID NOs: 140-159 
and 306. In some embodiments of the present invention, transcription factor decoys comprise a 
plurality of nucleic acids which comprise one or more THAP responsive elements. In such 
embodiments, the sequence of the THAP responsive elements may be the same or different. 
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Some embodiments of the present invention also contemplates pharmaceutical 
compositions which one or more transcription factor decoys in a pharmaceutically acceptable 
carrier. As described above, the pharmaceutical compositions can comprise transcription factor 
decoys comprising one or more nucleic acid sequences which comprise one or more THAP 
5 responsive elements. 

Additional embodiments of the present invention contemplate methods of using 
transcription factor decoys to inhibit, lessen or otherwise modulate the expression of one or more 
genes that are responsive to a THAP/chemokine complex or one or more genes that are responsive 
to a THAP-family polypeptide or a fragment thereof. 

10 Effect of Interactions Between Chemokines and Thap-Type Chemokine-Binding Agents 

Some embodiments of the present invention relate to methods of modulating chemokine 
interactions with cellular receptors. Such receptors can be extracellular or can be molecules that are 
present within the cell. For example, chemokines SLC and ELC can bind to extracellular 
chemokine receptors CCR7 and CCR11. The chemokine CCL5 binds to extracellular chemokine 

15 receptors CCR1, CCR3 and CCR5. The CXCL-family chemokines, CXCL9 and CXCL10, bind to 
the extracellular chemokine receptor, CXCR3. Other chemokine interactions with receptors are 
also known in the art and are included in Ransohoff, R. M. and Karpus, W. J. (2001). Roles of 
Chemokines and Their Receptors in the Induction and Regulation of Autoimmune Disease, in 
Contemporary Clinical Neuroscience: Cytokines and Autoimmune Diseases, V. K. Kuchroo, et al., 

20 eds. Humana Press, Totowa, N.J., pages 157-191. 

Irv some embodiments of the present invention the interaction of chemokines with 
extracellular receptors are enhanced or inhibited by providing to a cell, which expresses one or 
more extracellular chemokine receptors, a THAP-type chemokine-binding agent. Such 
extracellular receptors can include, but are not limited to, CCR1, CCR2, CCR3, CCR4, CCR5, 

25 CCR6, CCR7, CCR8, CCR9, CCR10, CCR11, CXCR1, CXCR2, CXCR3, CXCR4 and CXCR5. 
In some embodiments of the present invention, a THAP-type chemokine-binding agent binds to or 
otherwise interacts with a chemokine thereby forming a complex which binds to the extracellular 
receptor with more or less affinity. In some embodiments, chemokine interaction with one or more 
extracellular receptors is modulated by providing one or more THAP-type chemokine-binding 

30 agents. 

Other aspects of the present invention relate to modulating the movement of a chemokine 
from the outside of a cell to the inside of the cell. For example, modulation of chemokine 
interaction with one or more extracellular receptors can increase or decrease the uptake of 
chemokines into the cell. In some embodiments of the present invention, chemokine uptake into a 
35 cell is modulated by providing THAP-type chemokine-binding agent either in vitro or in vivo in the 
proximity of cell which expresses one or more chemokine receptors. The THAP-type chemokine- 
binding agent binds to or otherwise interacts with one or more chemokines including, but not 
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limited to, XCL1, XCL2, CCL1, CCL2, CCL3, CCL3L1, SCYA3L2, CCL4, CCL4L, CCL5, 
CCL6, CCL7, CCL8, SCYA9, SCYA10, CCL11, SCYA12, CCL13, CCL14, CCL15, CCL16, 
CCL17, CCL18, CCL19, CCL20, CCL21, CCL22, CCL23, CCL24, CCL25, CCL26, CCL27, 
CCL28, clone 391, CAKP CC-1, CCL1, CK-1, regakine-1, K203, CXCL1, CXCL1P, CXCL2, 
CXCL3, PF4, PF4V1, CXCL5, CXCL6, PPBP, SPBPBP, IL8, CXCL9, CXCL10, CXCL11, 
CXCL12, CXCL14, CXCL15, CXCL16, NAP-4, LFCA-1, Scyba, JSC, VHSV-induced protein, 
CX3CL1 and fCLl thereby modulating the uptake of the chemokine into the cell. 

In some embodiments, THAP-type chemokine-binding agents form a complex with one or 
more chemokines inside the cell nucleus. In such embodiments, a THAP-type chemokine-binding 
agent is provided to a cell such that the THAP-type chemokine-binding agent binds to or otherwise 
interacts with one or more chemokines. The THAP-type chemokine-binding agent can be provided 
to cells both in vitro and in vivo. In some embodiments, the THAP-type chemokine-binding agent 
is provided extracellularly wherein it is taken up by the cell either prior to or after binding to a 
chemokine. In other embodiments, a the THAP-type chemokine-binding agent is provided inside 
the cell. For example, a nucleic acid encoding a THAP-type chemokine-binding agent is introduced 
into a cell such that the THAP-type chemokine-binding agent is expressed inside the cell. Methods 
of introducing expressible recombinant nucleic acids into a cell are well known in the art. In some 
embodiments of the present invention, the nucleic acid encoding the THAP-type chemokine- 
binding agent is placed under the control of a constitutive promoter. In other embodiments, the 
promoter which controls expression of the THAP-type chemokine-binding agent is regulatable. 
Chemokines which contact or enter the nucleus are bound by THAP-type chemokine-binding agent 
with has been introduced into the cell. For example, a nucleic acid encoding a full-length THAP1 
polypeptide can be placed under control of a regulatable promoter such that, upon induction, the 
polypeptide is expressed then localized to the nucleus. The THAP1 that is present in the nucleus 
binds to SLC which has been transported to the nucleus thereby forming a THAP1/SLC complex. 
It will be appreciated that other methods can also be used to introduce THAP-type chemokine- 
binding agents into a cell. Additionally, it will be appreciated that more than one type of THAP- 
type chemokine-binding agent can be introduced into a cell. 

In some embodiments, THAP-type chemokine-binding agents can be introduced into the 
cytoplasm of the cell. In such embodiments, the THAP-type chemokine-binding agents that are 
present in the cytoplasm of the cell can be used in the formation of complexes with one or more 
chemokines. The formation of such complexes modulate the transport of chemokine into the 
nucleus. 

In some embodiments of the present invention, chemokines or complexes comprising 
chemokines and THAP-type chemokine-binding agents that are present within the nucleus of the 
cell modulate gene expression. In such embodiments, the expression of one or more genes which 
are under the control of a THAP responsive promoter are modulated. In some embodiments, a 
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THAP responsive promoter includes one or more THAP responsive elements. In other 
embodiments, a THAP responsive promoter need not comprise a THAP responsive element, but 
rather, the promoter is responsive to a gene product that is produced by a gene that is under the 
control of a promoter containing one or more THAP responsive elements. Such THAP responsive 
5 promoters have been described in detail above. 

The THAP-type chemokine-binding agent that is used to modulate transcription of a THAP 
responsive promoter can be any THAP-type chemokine-binding agent; however, some preferred 
agents include THAP1 and polypeptides comprising an amino acid sequence having at least 99%, at 
least 98%, at least 97%, at least 96%, at least 95%, at least 94%, at least 93%, at least 92%, at least 

10 91%, at least 90%, at least 89%, at least 88%, at least 87%, at least 86%, at least 85%, at least 84%, 
at least 83%, at least 82%, at least 81%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55%, at least 50%, at least 45%, at least 40%, at least 35%, or at least 30% amino 
acid sequence identity with the amino acid of SEQ ID NO: 3. In other embodiments, the THAP- 
type chemokine-binding agent is a polypeptide having an amino acid sequence selected from the 

15 group consisting of SEQ ID NOs: 1-1 14 or homologs thereof. 

Chemokines which are useful in the modulation of transcription can be any chemokine 
which binds to or otherwise interacts with a THAP-type chemokine-binding agent. Such 
chemokines include, but are not limited to, XCL1, XCL2, CCL1, CCL2, CCL3, CCL3L1, 
SCYA3L2, CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, SCYA10, CCL11, SCYA12, 

20 CCL13, CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, CCL21, CCL22, CCL23, 
CCL24, QCL25, CCL26, CCL27, CCL28, clone 391, CARP CC-1, CCL1, CK-1, regakine-1, K203, 
CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, CXCL5, CXCL6, PPBP, SPBPBP, IL8, 
CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, CXCL15, CXCL16, NAP-4, LFCA-1, Scyba, 
JSC, VHSV-induced protein, CX3CL1 and fCLl. In some embodiments, polypeptides that are 

25* homologous to one or more of the above-described chemokines can form a complex with a THAP- 
type chemokine-binding agent thereby modulating transcription at a THAP responsive promoter. 
Such homologs can include polypeptides comprising an amino acid sequence having at least 99%, 
at least 98%, at least 97%, at least 96%, at least 95%, at least 94%, at least 93%, at least 92%, at 
least 91%, at least 90%, at least 89%, at least 88%, at least 87%, at least 86%, at least 85%, at least 

30 84%, at least 83%, at least 82%, at least 81%, at least 80%, at least 75%, at least 70%, at least 65%, 
at least 60%, at least 55%, at least 50%, at least 45%, at least 40%, at least 35%, or at least 30% 
amino acid sequence identity with the amino acid sequence of any of the above-described 
chemokines. In some preferred embodiments of the present invention, one or more chemokines 
having an amino acid sequence selected from the group consisting of SEQ ID NOs: 271, 273, 275, 

35 277 and 289 form a complex with one or more THAP-type chemokine-binding agents thereby 
modulating transcription at a THAP responsive promoter. In other embodiments, chemokines 
comprising an amino acid sequence having at least 99%, at least 98%, at least 97%, at least 96%, at 
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least 95%, at least 94%, at least 93%, at least 92%, at least 91%, at least 90%, at least 89%, at least 
88%, at least 87%, at least 86%, at least 85%, at least 84%, at least 83%, at least 82%, at least 8 1 %, 
at least 80%, at least 75%, at least 70%, at least 65%, at least 60%, at least 55%, at least 50%, at 
least 45%, at least 40%, at least 35%, or at least 30% amino acid sequence identity with the amino 
5 acid sequence of a chemokine selected from the group consisting of SEQ ID NOs: 271, 273, 275, 
277 and 289 form a complex with one or more THAP-type chemokine-binding agents thereby 
modulating transcription at a THAP responsive promoter. 
Primers and probes 

Primers and probes of the invention can be prepared by any suitable method, including, for 
10 example, cloning and restriction of appropriate sequences and direct chemical synthesis by a 
method such as the phosphodi ester method of Narang SA et al (Methods Enzymol 1979;68:90-98), 
the phosphodiester method of Brown EL et al (Methods Enzymol 1979;68:109-151), the 
diethylphosphoramidite method of Beaucage et al (Tetrahedron Lett 1981, 22: 1859-1862) and the 
solid support method described in EP 0 707 592. 
15 Detection probes are generally nucleic acid sequences or uncharged nucleic acid analogs 

such as, for example peptide nucleic acids which are disclosed in International Patent Application 
WO 92/20702, morpholino analogs which are described in U.S. Patents Numbered 5,185,444; 
5,034,506 and 5,142,047. If desired, the probe may be rendered "non-extendable" in that additional 
dNTPs cannot be added to the probe. In and of themselves analogs usually are non-extendable and 
20 nucleic acid probes can be rendered non -extendable by modifying the 3' end of the probe such that 
the hydroxyl group is no longer capable of participating in elongation. For example, the 3' end of 
the probe can be functional ized with the capture or detection label to thereby consume or otherwise 
block the hydroxyl group. 

Any of the polynucleotides of the present invention can be labeled, if desired, by 
25 incorporating any label known in the art to be detectable by spectroscopic, photochemical, 
biochemical, immunochemical, or chemical means. For example, useful labels include radioactive 
substances (including, 32 p, ^S, H, '"i), fluorescent dyes (including, 5-bromodesoxyuridin, 
fluorescein, acetyl aminofluorene, digoxigenin) or biotin. Preferably, polynucleotides are labeled at 
their 3' and 5' ends. Examples of non-radioactive labeling of nucleic acid fragments are described 
30 in (Urdea et al. (Nucleic Acids Research. 1 1:4937-4957, 1988) or Sanchez-Pescador et al. (J. Clin. 
Microbiol. 26(10):1934-1938, 1988). In addition, the probes according to the present invention 
may have structural characteristics such that they allow the signal amplification, such structural 
characteristics being, for example, branched DNA probes as those described by Urdea et al (Nucleic 
Acids Symp. Ser. 24:197-200, 1991) or in the European patent No. EP 0 225 807 (Chiron). 
35 A label can also be used to capture the primer, so as to facilitate the immobilization of 

either the primer or a primer extension product, such as amplified DNA, on a solid support. A 
capture label is attached to the primers or probes and can be a specific binding member which 
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forms a binding pair with the solid's phase reagent's specific binding member (e.g. biotin and 
streptavidin). Therefore depending upon the type of label carried by a polynucleotide or a probe, it 
may be employed to capture or to detect the target DNA. Further, it will be understood that the 
polynucleotides, primers or probes provided herein, may, themselves, serve as the capture label. 

5 For example, in the case where a solid phase reagent's binding member is a nucleic acid sequence, 
it may be selected such that it binds a complementary portion of a primer or probe to thereby 
immobilize the primer or probe to the solid phase. In cases where a polynucleotide probe itself 
serves as the binding member, those skilled in the art will recognize that the probe will contain a 
sequence or "tail" that is not complementary to the target. In the case where a polynucleotide 

1 0 primer itself serves as the capture label, at least a portion of the primer will be free to hybridize with 
a nucleic acid on a solid phase. DNA labeling techniques are well known to the skilled technician. 

The probes of the present invention are useful for a number of purposes. They can be 
notably used in Southern hybridization to genomic DNA. The probes can also be used to detect 
PCR amplification products. They may also be used to detect mismatches in a THAP-family gene 

1 5 or mRNA using other techniques. 

Any of the nucleic acids, polynucleotides, primers arid probes of the present invention can 
be conveniently immobilized on a solid support. Solid supports are known to those skilled in the art 
and include the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, 
nitrocellulose strips, membranes, microparticles such as latex particles, sheep (or other animal) red 

20 blood cells, duracytes and others. The solid support is not critical and can be selected by one 
skilled irv the art. Thus, latex particles, microparticles, magnetic or non-magnetic beads, 
membranes, plastic tubes, walls of microliter wells, glass or silicon chips, sheep (or other suitable 
animal's) red blood cells and duracytes are all suitable examples. Suitable methods for 
immobilizing nucleic acids on solid phases include ionic, hydrophobic, covalent interactions and 

25 the like. A solid support, as used herein, refers to any material which is insoluble, or can be made 
insoluble by a subsequent reaction. The solid support can be chosen for its intrinsic ability to attract 
and immobilize the capture reagent. Alternatively, the solid phase can retain an additional receptor 
which has the ability to attract and immobilize the capture reagent. The additional receptor can 
include a charged substance that is oppositely charged with respect to the capture reagent itself or to 

30 a charged substance conjugated to the capture reagent. As yet another alternative, the receptor 
molecule can be any specific binding member which is immobilized upon (attached to) the solid 
support and which has the ability to immobilize the capture reagent through a specific binding 
reaction. The receptor molecule enables the indirect binding of the capture reagent to a solid 
support material before the performance of the assay or during the performance of the assay. The 

35 solid phase thus can be a plastic, derivatized plastic, magnetic or non-magnetic metal, glass or 
silicon surface of a test tube, microtiter well, sheet, bead, microparticle, chip, sheep (or other 
suitable animal's) red blood cells, duracytes and other configurations known to those of ordinary 
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skill in the art. The nucleic acids, polynucleotides, primers and probes of the invention can be 
attached to or immobilized on a solid support individually or in groups of at least 2, 5, 8, 10, 12, 15, 
20, or 25 distinct polynucleotides of the invention to a single solid support. In addition, 
polynucleotides other than those of the invention may be attached to the same solid support as one 
5 or more polynucleotides of the invention. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
locations on a solid support. Alternatively the polynucleotides of the invention may be attached in 
an ordered array wherein each polynucleotide is attached to a distinct region of the solid support 
which does not overlap with the attachment site of any other polynucleotide. Preferably, such an 
10 ordered array of polynucleotides is designed to be "addressable" where the distinct locations are 
recorded and can be accessed as part of an assay procedure. Addressable polynucleotide arrays 
typically comprise a plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. The knowledge of the precise location of each 
polynucleotides location makes these "addressable" arrays particularly useful in hybridization 
15 assays. Any addressable array technology known in the art can be employed with the 
polynucleotides of the invention. One particular embodiment of these polynucleotide arrays is 
known as the Genechips, and has been generally described in US Patent 5,143,854; PCT 
publications WO 90/15070 and 92/10092. 
Recombinant Expression Vectors and Host Cells 
20 Another aspect of the invention pertains to vectors, preferably expression vectors, 

containing^ nucleic acid encoding a THAP family or THAP domain polypeptide, or a biologically 
active fragment or homologue thereof. 

Vectors may have particular use in the preparation of a recombinant protein of the 
invention, or for use in gene therapy. Gene therapy presents a means to deliver a THAP family or 
25 THAP domain polypeptide, or a biologically active fragment or homologue thereof to a subject in 
order to regulate apoptosis for treatment of a disorder. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting 
another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a 
circular double stranded DNA loop into which additional DNA segments can be ligated. Another 
30 type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which they are 
introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a 
host cell upon introduction into the host cell, and thereby are replicated along with the host genome 
35 Moreover, certain vectors are capable of directing the express.on of genes to which they are 
operatively linked. Such vectors are referred to herein as "expression vectors". In general, 
expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In 
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the present specification, "plasmid" and "vector" can be used interchangeably as the plasmid is the 
most commonly used form of vector. However, the invention is intended to include such other 
forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses), which serve equivalent functions. 
5 The recombinant expression vectors of the invention comprise a THAP-family or THAP 

domain nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host 
cell, which means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is operatively 
linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, 
10 "operably linked" is intended to mean that the nucleotide sequence of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression of the nucleotide sequence (for 
example, in an in vitro transcription/translation system or in a host cell when the vector is 
introduced into the host cell). The term "regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory 
15 sequences are described, for example, in Goeddel; Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990); Regulatory sequences include those 
which direct constitutive expression of a nucleotide sequence in many types of host cell and those 
which direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific 
regulatory sequences). It will be appreciated by those skilled in the art that the design of the 
20 expression vector can depend on such factors as the choice of the host cell to be transformed, the 
level of expression of protein desired, etc. The expression vectors of the invention can be 
introduced into host cells to thereby produce proteins or peptides, including fusion proteins or 
peptides, encoded by nucleic acids as described herein (e.g., THAP-family proteins, mutant forms 
of THAP-family proteins, fusion proteins, or fragments of any of the preceding proteins, etc.). 
25 The recombinant expression vectors of the invention can be designed for expression of a 

THAP family or THAP domain polypeptide, or a biologically active fragment or homologue thereof 
in prokaryotic or eukaryotic cells. For example, THAP-family or THAP domain proteins can be 
expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression vectors) yeast 
cells, or mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression 
30 Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif (1990). 
Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for 
example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or non- 
35 fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, usually to 
the amino terminus of the recombinant protein. Such fusion vectors typically serve three purposes: 
1) to increase expression of recombinant protein; 2) to increase the solubility of the recombinant 
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protein; and 3) to aid in the purification of the recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the 
junction of the fusion moiety and the recombinant protein to enable separation of the recombinant 
protein from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion 
expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D. B. and Johnson, K. S. (1988) 
Gene 67:31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, 
Piscataway, N.J.), which fuse glutathione S-transferase (GST), maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 

Purified fusion proteins can be utilized in THAP-family activity assays, (for example, direct 
assays or competitive assays described in detail below), or to generate antibodies specific for 
THAP-family or THAP domain proteins, for example. In a preferred embodiment, a THAP-family 
or THAP domain fusion protein expressed in a retroviral expression vector of the present invention 
can be utilized to infect bone marrow cells which are subsequently transplanted into irradiated 
recipients. The pathology of the subject recipient is then examined after sufficient time has passed 
(for example, six (6) weeks). . 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amann 
et al., (1988) Gene 69:301-3 15) and pET lid (Studier et al., Gene Expression Technology: Methods 
in Enzymology 185, Academic Press, San Diego, Calif. (1990) 60-89). Target gene expression from 
the pTrc vector relies on host RNA polymerase transcription from a hybrid trp-lac fusion promoter. 
Target gene expression from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gn 1). This viral polymerase is 
supplied by host strains BL21 (DE3) or HMS174(DE3) from a resident prophage harboring a T7 
gnl gene under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express the protein 
in a host bacteria with an impaired capacity to proteolytic^ ly cleave the recombinant protein 
(Gottesman, S., Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, Calif (1990) 119-128). Another strategy is to alter the nucleic acid sequence of the nucleic 
acid to be inserted into an expression vector so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al., (1992) Nucleic Acids Res. 20:211 1-21 18). Such 
alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis 
techniques. 

In another embodiment, the THAP-family expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerivisae include pYepSec 1 (Baldari, et al., (1987) 
Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et 
al., (1987) Gene 54:113-123), pYES2 (Invitrogen Corporation, San Diego, Calif), and picZ 
(InVitrogen Corp, San Diego, Calif). 
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Alternatively, THAP-family or THAP domain proteins can be expressed in insect cells 
using baculovirus expression vectors. Baculovirus vectors available for expression of proteins in 
cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al. (1983) Mol. Cell Biol. 
3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170:31-39). In 
5 particularly preferred embodiments, THAP-family proteins are expressed according to Karniski et 
al, Am. J. Physiol. (1998) 275: F79-87. 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian cells 
using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 
(Seed, B. (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987) EMBO J. 6:187-195). 

10 When used in mammalian cells, the expression vector's control functions are often provided by viral 
regulatory elements. For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., and Maniatis, 
T. Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold 

15 Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. In another embodiment, the 
recombinant mammalian expression vector is capable of directing expression of the nucleic acid 
preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express 
the nucleic acid). Tissue-specific regulatory elements are known in the art, and are further described 
below. 

20 The invention further provides a recombinant expression vector comprising a DNA 

molecule of the invention cloned into the expression vector in an antisense orientation. That is, the 
DNA molecule is operatively linked to a regulatory sequence in a manner which allows for 
expression (by transcription of the DNA molecule) of an RNA molecule which is antisense to 
THAP-family mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 

25 antisense orientation can be chosen which direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a recombinant plasmid, 
phagemid or attenuated virus in which antisense nucleic acids are produced under the control of a 

30 high efficiency regulatory region, the activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the regulation of gene expression using antisense 
genes see Weintraub, H. et al., Antisense RNA as a molecular tool for genetic analysis, Reviews- 
Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant expression 

35 vector of the invention has been introduced. The terms "host cell" and "recombinant host cell" are 
used interchangeably herein. It is understood that such term refer not only to the particular subject 
cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur 
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in succeeding generations due to either mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a THAP-farnily protein 
can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells or human cells). Other suitable host cells are 
known to those skilled in the art, including mouse 3T3 cells as further described in the Examples. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for introducing foreign 
nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium chloride co- 
precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. Suitable 
methods for transforming or transfecting host cells can be found in Sambrook, et al. (Molecular 
Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the expression 
vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA 
into their genome. In order to identify and select these integrants, a gene that encodes a selectable 
marker (e.g., resistance to antibiotics) is generally, introduced into the host cells along with the gene 
of interest. Preferred selectable markers include those which confer resistance to drugs, such as 
G418, hygrpmycin and methotrexate. Nucleic acid encoding a selectable marker can be introduced 
into a host cell on the same vector as that encoding a THAP-family protein or can be introduced on 
a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug 
selection (e.g., cells that have incorporated the selectable marker gene will survive, while the other 
cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) a THAP-family protein. Accordingly, the invention further provides 
methods for producing a THAP-family protein using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a recombinant 
expression vector encoding a THAP-family protein has been introduced) in a suitable medium such 
that a THAP-family protein is produced. In another embodiment, the method further comprises 
isolating a THAP-family protein from the medium or the host cell. 

In another embodiment, the invention encompasses a method comprising: providing a cell 
capable of expressing a THAP family or THAP domain polypeptide, or a biologically active 
fragment or homologue thereof, culturing said cell in a suitable medium such that a THAP-family 
or THAP domain protein is produced, and isolating or purifying the THAP-family or THAP domain 
protein from the medium or cell. 
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The host cells of the invention can also be used to produce nonhuman transgenic animals, 
such as for the study of disorders in which THAP family proteins are implicated. For example, in 
one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic stem cell into 
which THAP-family- or THAP domain- coding sequences have been introduced. Such host cells 
5 can then be used to create non-human transgenic animals in which exogenous THAP-family or 
THAP domain sequences have been introduced into their genome or homologous recombinant 
animals in which endogenous THAP-family or THAP domain sequences have been altered. Such 
animals are useful for studying the function and/or activity of a THAP-family or THAP domain 
polypeptide or fragment thereof and for identifying and/or evaluating modulators of a THAP-family 

10 or THAP domain activity. As used herein, a "transgenic animal" is a non-human animal, preferably 
a mammal, more preferably a rodent such as a rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include non-human primates, 
sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which remains in 

15 the genome of the mature animal, thereby directing the expression of an encoded gene product in 
one or more cell types or tissues of the transgenic animal. As used herein, a "homologous 
recombinant animal" is a non-human animal, preferably a mammal, more preferably a mouse, in 
which an endogenous THAP-family or THAP domain gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule introduced into a 

20 cell of the animal, e.g., an embryonic cell of the animal, prior to development of the animal. 
Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are described, for 
example, in U.S. Pat. Nos. 4,736,866 and 4,870,009, both by Leder et al., U.S. Pat. No. 4,873,191 
by Wagner et al. and in Hogan, B., Manipulating the Mouse Embryo, (Cold Spring Harbor 

25 Laboratory Press, Cold Spring Harbor, N.Y., 1986). 
Gene Therapy Vectors 

Preferred vectors for administration to a subject can be constructed according to well 
known methods. Vectors will comprise regulatory elements (e.g. promoter, enhancer, etc) capable 
of directing the expression of the nucleic acid in the targeted cell. Thus, where a human cell is 

30 targeted, it is preferable to position the nucleic acid coding region adjacent to and under the control 
of a promoter that is capable of being expressed in a human cell. 

In various embodiments, the human cytomegalovirus (CMV) immediate early gene 
promoter, the SV40 early promoter, the Rous sarcoma virus long terminal repeat, P actin, rat insulin 
promoter and glyceraldehyde-3 -phosphate dehydrogenase can be used to obtain high-level 

35 expression of the coding sequence of interest. The use of other viral or mammalian cellular or 
bacterial phage promoters which are well-known in the art to achieve expression of a coding 
sequence of interest is contemplated as well, provided that the levels of expression are sufficient for 
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a given purpose. By employing a promoter with well-known properties, the level and pattern of 
expression of the protein of interest following transfection or transformation can be optimized. 

Selection of a promoter that is regulated in response to specific physiologic or synthetic 
signals can permit inducible expression of the gene product. For example in the case where 
expression of a transgene, or transgenes when a multicistronic vector is utilized, is toxic to the cells 
in which the vector is produced in, it may be desirable to prohibit or reduce expression of one or 
more of the transgenes. Several inducible promoter systems are available for production of viral 
vectors where the transgene product may be toxic. 

The ecdysone system (Invitrogen, Carlsbad, CA) is one such system. This system is 
designed to allow regulated expression of a gene of interest in mammalian cells. It consists of a 
tightly regulated expression mechanism that allows virtually no basal level expression of the 
transgene, but over 200-fold inducibility. The system is based on the heterodimeric ecdysone 
receptor of Drosophila, and when ecdysone or an analog such as muristerone A binds to the 
receptor, the receptor activates a promoter to turn on expression of the downstream transgene high 
levels of mRNA transcripts are attained. In this system, both monomers of the heterodimeric 
receptor are constituitively expressed from one vector, whereas the ecdysone-responsive promoter 
which drives expression of the gene of interest is on another plasmid. Engineering of this type of 
system into the gene transfer vector of interest would therefore be useful. Cotransfection of 
plasmids containing the gene of interest and the receptor monomers in the producer cell line would 
then allow for the production of the gene transfer vector without expression of a potentially toxic 
transgene. At the appropriate time, expression of the transgene could be activated with ecdysone or 
muristeron A. Another inducible system that would be useful is the Tet-Off or Tet On system 
(Clontech, Palo Alto, CA) originally developed by Gossen and Bujard (Gossen and Bujard, 1992; 
Gossen et al, 1995). This system also allows high levels of gene expression to be regulated in 
response to tetracycline or tetracycline derivatives such as doxycycline. In the Tet-On system, gene 
expression is turned on in the presence of doxycycline, whereas in the Tet-Off system, gene 
expression is turned on in the absence of doxycycline. These systems are based on two regulatory 
elements derived from the tetracycline resistance operon of E. col,. The tetracycline operator 
sequence to which the tetracycline repressor binds, and the tetracycline repressor protein. The gene 
of interest is cloned into a plasmid behind a promoter that has tetracycline-responsive elements 
present in it. A second plasmid contains a regulatory element called the tetracycline-controlled 
transactivator, which is composed, in the Tet Off system, of the VP16 domain from the herpes 
simplex virus and the wild-type tertracycline repressor. 

Thus in the absence of doxycycline, transcription is constituitively on. In the Tet-OnTm 
system, the tetracycline repressor is not wild-type and in the presence of doxycycline activates 
transcription. For gene therapy vector production, the Tet Off system would be preferable so that 
the producer cells could be grown in the presence of tetracycline or doxycycline and prevent 
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expression of a potentially toxic transgene, but when the vector is introduced to the patient, the gene 
expression would be constituitively on. 

In some circumstances, it may be desirable to regulate expression of a transgene in a gene 
therapy vector. For example, different viral promoters with varying strengths of activity may be 

5 utilized depending on the level of expression desired. In mammalian cells, the CMV immediate 
early promoter if often used to provide strong transcriptional activation. Modified versions of the 
CMV promoter that are less potent have also been used when reduced levels of expression of the 
transgene are desired. When expression of a transgene in hematopoetic.cells is desired, retroviral 
promoters such as the LTRs from MLV or MMTV are often used. Other viral promoters that may 

10 be used depending on the desired effect include SV40, RSV LTR, HIV-1 and HfV-2 LTR, 
adenovirus promoters such as from the EIA, E2A, or MLP region, AAV LTR, cauliflower mosaic 
virus, HSV-TK, and avian sarcoma virus. 

Similarly tissue specific promoters may be used to effect transcription in specific tissues or 
cells so as to reduce potential toxicity or undesirable effects to non-targeted tissues. For example, 

15 promoters such as the PSA, probasin, prostatic acid phosphatase or prostate-specific glandular 
kallikrein (hK2) may be used to target gene expression in the prostate. Similarly, promoters as 
follows may be used to target gene expression in other tissues. 

Tissue specific promoters include in (a) pancreas: insulin, elastin, amylase, pdr-I, pdx-I, 
glucokinase; (b) liver: albumin PEPCK, HBV enhancer, alpha fetoprotein, apolipoprotein C, alpha- 

20 I antitrypsin, vitellogenin, NF-AB, Transthyretin; (c) skeletal muscle: myosin H chain, muscle 
creatine kinase, dystrophin, calpain p94, skeletal alpha-actin, fast troponin 1 ; (d) skin: keratin K6, 
keratin KI; (e) lung: CFTR, human cytokeratin IS (K 18), pulmonary surfactant proteins A, B and 
C, CC-10, Pi; (0 smooth muscle: sm22 alpha, SM-alpha-actin; (g) endothelium: endothelin- I, E- 
selectin, von Willebrand factor, TIE (Korhonen et al., 1995), KDR/flk-I; (h) melanocytes: 

25 tyrosinase; (i) adipose tissue: lipoprotein lipase (Zechner et al., 1988), adipsin (Spiegelman et al., 
1989), acetyl-CoA carboxylase (Pape and Kim, 1989), glycerophosphate dehydrogenase (Dani et 
al., 1989), adipocyte P2 (Hunt et al., 1986); and (j) blood: P-globin. 

In certain indications, it may be desirable to activate transcription at specific times after 
administration of the gene therapy vector. This may be done with such promoters as those that are 

30 hormone or cytokine regulatable. For example in gene therapy applications where the indication is 
in a gonadal tissue where specific steroids are produced or routed to, use of androgen or estrogen 
regulated promoters may be advantageous. Such promoters that are hormone regulatable include 
MMTV, MT-1, ecdysone and RuBisco. Other hormone regulated promoters such as those 
responsive to thyroid, pituitary and adrenal hormones are expected to be useful in the present 

35 invention. Cytokine and inflammatory protein responsive promoters that could be used include K 
and T Kininogen (Kageyama et al., 1987), c-fos, TNF-alpha, C-reactive protein (Arcone et al., 
1988), haptoglobin (Oliviero et al., 1987), serum amyloid A2, C/EBP alpha, IL-1, IL-6 (Poli and 
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Cortese, 1989), Complement C3 (Wilson et ah, 1990), IL-8, alpha-1 acid glycoprotein (Prowse and 
Baumann, 1988), alpha-1 antitypsin, lipoprotein lipase (Zechner et al., 1988), angiotensinogen (Ron 
et al., 1991), fibrinogen, c-jun (inducible by phorboi esters, TNF alpha, UV radiation, retinoic acid, 
and hydrogen peroxide), collagenase (induced by phorboi esters and retinoic acid), metallothionein 
(heavy metal and glucocorticoid inducible), Stromelysin (inducible by phorboi ester, interleukin-1 
and EGF), alpha-2 macroglobulin and alpha- 1 antichymotrypsin. 

It is envisioned that cell cycle regulatable promoters may be useful in the present invention. 
For example, in a bi-cistronic gene therapy vector, use of a strong CMV promoter to drive 
expression of a first gene such as pl6 that arrests cells in the Gl phase could be followed by 
expression of a second gene such as p53 under the control of a promoter that is active in the Gl 
phase of the cell cycle, thus providing a "second hit" that would push the cell into apoptosis. Other 
promoters such as those of various cyclins, PCNA, galectin-3, E2FI, p53 and BRCAI could be used. 

Tumor specific promoters such as osteocalcin, hypoxia-responsive element (HRE), 
NIAGE-4, CEA, alpha-fetoprotein, GRP78/BiP and tyrosinase also may be used to regulate gene 
15 expression in tumor cells. Other promoters that could be used according to the present invention 
include Lac-regulatable, chemotherapy inducible (e.g. MDR), and heat (hyperthermia) inducible 
promoters, Radiation-inducible (e.g., EGR (Joki et al., 1995)), Alpha-inhibin, RNA pol III tRNA 
met and other amino acid promoters, Ul snRNA (Bartlett et al., 1996), MC-1, PGK, -actin and 
alpha-globin. Many other promoters that may be useful are listed in Walther and Stein (1996). 
20 11 is envisioned that any of the above promoters alone or in combination with another may 

be useful according to the present invention depending on the action desired. 

In addition, this list of promoters should not be considered to be exhaustive or limiting, 
those of skill in the art will know of other promoters that may be used in conjunction with the 
THAP-family and THAP domain nucleic acids and methods disclosed herein. 
25 Enhancers 

Enhancers are genetic elements that increase transcription from a promoter located at a 
distant position on the same molecule of DNA. Enhancers are organized much like promoters. That 
is, they are composed of many individual elements, each of which binds to one or more 
transcriptional proteins. The basic distinction between enhancers and promoters is operational. An 
enhancer region as a whole must be able to stimulate transcription at a distance; this need not be 
true of a promoter region or its component elements. On the other hand, a promoter must have one 
or more elements that direct initiation of RNA synthesis at a particular site and in a particular 
orientation, whereas enhancers lack these specificities. Promoters and enhancers are often 
overlapping and contiguous, often seeming to have a very similar modular organization. 

Below is a list of promoters additional to the tissue specific promoters listed above, cellular 
promoters/enhancers and inducible promoters/enhancers that could be used in combination with the 
nucleic acid encoding a gene of interest in an expression construct (list of enhancers, and Table 1). 
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Additionally, any promoter/enhancer combination (as per the Eukaryotic Promoter Data Base 
EPDB) could also be used to drive expression of the gene. Eukaryotic cells can support cytoplasmic 
transcription from certain bacterial promoters if the appropriate bacterial polymerase is provided, 
either as part of the delivery complex or as an additional genetic expression construct. 

Suitable enhancers include: Immunoglobulin Heavy Chain; Immunoglobulin Light Chain; 
T-Cel! Receptor; HLA DQ (x and DQ beta; beta-Inter feron; Interleukin-2; Interleukin-2 Receptor; 
MHC Class II 5; MHC Class II HLA-DRalpha; beta-Actin; Muscle Creatine Kinase; Prealbumin 
(Transthyretin); Elastase I; Metallothionein; Collagenase; Albumin Gene; alpha-Fetoprotein; - 
Globin; beta-Globin; e-fos; c-HA-ras; Insulin; Neural Cell Adhesion Molecule (NCAM); alpha al- 
Antitrypsin; H2B (TH2B) Histone; Mouse or Type I Collagen; Glucose-Regulated Proteins (GRP94 
and GRP78); Rat Growth Hormone; Human Serum Amyloid A (SAA); Troponin I (TN 1); Platelet- 
Derived Growth Factor; Duchenne Muscular Dystrophy; SV40; Polyoma; Retroviruses; 
THAPilloma Virus; Hepatitis B Virus; Human Immunodeficiency Virus; Cytomegalovirus; and 
Gibbon Ape Leukemia Virus. 



TABLE 1 



Element 


Inducer 


MT11 


Phorbol Ester (TP A) 


Heavy metals MMTV (mouse mammary 
tumor Glucocorticoids virus) 




B-Interferon 


poly(rI)X; poly(rc) 


Adenovirus 5 E2 


Ela 


c-jun 


Phorbol Ester (TPA), H202 


H202 Collagenase 


Phorbol Ester (TPA) 


Stromelysin 


Phorbol Ester (TPA), IL- 1 


SV40 


Phorbol Ester (TPA) 


Murine MX Gene 


Interferon, Newcastle Disease Virus 


GRP78 Gene 


A23187 


oc-2-MacrogIobuIin 


IL-6 


Vimentin Serum NMC Class I Gene H-2kB 


Interferon 


HSP70 


Ela, SV40 Large T Antigen 


Insulin E Box 


Glucose 


Proliferin 


Phorbol Ester-TPA 


Tumor Necrosis Factor 


FMA 


Thyroid Stimulating Hormone alpha Gene 


Thyroid Hormone 



In preferred embodiments of the invention, the expression construct comprises a virus or 
engineered construct derived from a viral genome. The ability of certain viruses to enter cells via 
20 receptor-mediated endocytosis and to integrate into host cell genome and express viral genes stably 
and efficiently have made them attractive candidates for the transfer of foreign genes into 
mammalian cells (Ridgeway, 1988; Nicolas and Rubenstein, 1988; Baichwal and Sugden, 1986; 
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Temin, 1986). The first viruses used as gene vectors were DNA viruses including the papovaviruses 
(simian virus 40, bovine papilloma virus, and polyoma) (Ridgeway, 1988; Baichwal and Sugden, 
1986) and adenoviruses (Ridgeway, 1988; Baichwal and Sugden, 1986). These have a relatively 
low capacity for foreign DNA sequences and have a restricted host spectrum. 
5 Furthermore, their oncogenic potential and cytopathic effects in permissive cells raise 

safety concerns. They can accommodate only up to 8 kB of foreign genetic material but can be 
readily introduced in a variety of cell lines and laboratory animals (Nicolas and Rubenstein, 1988; 
Temin, 1986). 
Polyadenylation Signals 

10 Where a cDNA insert is employed, one will typically desire to include a polyadenylation 

signal to effect proper polyadenylation of the gene transcript. The nature of the polyadenylation 
signal is not believed to be crucial to the successful practice of the invention, and any such 
sequence may be employed such as human or bovine growth hormone and SV40 polyadenylation 
signals. Also contemplated as an element of the expression cassette is a terminator. These elements 

15 can serve to enhance message levels and to minimize read through from the cassette into other 
sequences. 
Antisense Constructs 

The term "antisense nucleic acid" is intended to refer to the oligonucleotides 
complementary to the base sequences of DNA and RNA. Antisense oligonucleotides, when 
20 introduced into a target cell, specifically bind to their target nucleic acid and interfere with 
transcription, RNA processing, transport and/or translation. Targeting double-stranded (ds) DNA 
with oligonucleotide leads to triple-helix formation; targeting RNA will lead to double-helix 
formation. 

Antisense constructs may be designed to bind to the promoter and other control regions, 
25 exons, introns or even exon-intron boundaries of a gene. Antisense RNA constructs, or DNA 
encoding such antisense RNAs, may be employed to inhibit gene transcription or translation or both 
within a host cell, either in vitro or in vivo, such as within a host animal, including a human subject. 
Nucleic acid sequences comprising complementary nucleotides" are those which are capable of 
base-pairing according to the standard Watson-Crick complementary rules. That is, that the larger 
30 purines will base pair with the smaller pyrimidines to form only combinations of guanine paired 
with cytosine (G:C) and adenine paired with either thymine (A:T), in the case of DNA, or adenine 
paired with uracil (A:U) in the case of RNA. 

As used herein, the terms "complementary" or "antisense sequences" mean nucleic acid 
sequences that are substantially complementary over their entire length and have very few base 
35 mismatches. For example, micleic acid sequences of fifteen bases in length may be termed 
complementary when they have a complementary nucleotide at thirteen or fourteen positions with 
only single or double mismatches. Naturally, nucleic acid sequences which are "completely 
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complementary" will be nucleic acid sequences which are entirely complementary throughout their 
entire length and have no base mismatches. 

While all or part of the gene sequence may be employed in the context of antisense 
construction, statistically, any sequence 17 bases long should occur only once in the human genome 
5 and, therefore, suffice to specify a unique target sequence. 

Although shorter oligomers are easier to make and increase in vivo accessibility, numerous 
other factors are involved in determining the specificity of hybridization. Both binding affinity and 
sequence specificity of an oligonucleotide to its complementary target increases with increasing 
length. It is contemplated that oligonucleotides of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 
10 more base pairs will be used. One can readily determine whether a given antisense nucleic acid is 
effective at targeting of the corresponding host cell gene simply by testing the constructs in vitro to 
determine whether the endogenous gene's function is affected or whether the expression of related 
genes having complementary sequences is affected. 

In certain embodiments, one may wish to employ antisense constructs which include other 
15 elements, for example, those which include C-5 propyne pyrimidines. 

Oligonucleotides which contain C-5 propyne analogues of uridine and cytidine have been 
shown to bind RNA with high affinity and to be potent antisense inhibitors of gene expression 
(Wagner etal, 1993). 
Ribozyme Constructs 

20 As an alternative to targeted antisense delivery, targeted ribozymes may be used. The term 

"ribozyme;' refers to an RNA-based enzyme capable of targeting and cleaving particular base 
sequences in oncogene DNA and RNA. Ribozymes either can be targeted directly to cells, in the 
form of RNA oligo-nucleotides incorporating ribozyme sequences, or introduced into the cell as an 
expression construct encoding the desired ribozymal RNA. Ribozymes may be used and applied in 

25 much the same way as described for antisense nucleic acids. 
Methods of Gene Transfer 

In order to mediate the effect of transgene expression in a cell, it will be necessary to 
transfer the therapeutic expression constructs of the present invention into a cell. This section 
provides a discussion of methods and compositions of viral production and viral gene transfer, as 

30 well as non-viral gene transfer methods. 
(i) Viral Vector-Mediated Transfer 

The THAP-family gene is incorporated into a viral infectious particle to mediate gene 
transfer to a cell. Additional expression constructs encoding other therapeutic agents as described 
herein may also be transferred via viral transduction using infectious viral particles, for example, by 

35 transformation with an adenovirus vector of the present invention as described herein below. 
Alternatively, retroviral or bovine papilloma virus may be employed, both of which permit 
permanent transformation of a host cell with a gene(s) of interest. Thus, in one example, viral 
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infection of cells is used in order to deliver therapeutically significant genes to a cell. Typically, the 
virus simply will be exposed to the appropriate host cell under physiologic conditions, permitting 
uptake of the virus. Though adenovirus is exemplified, the present methods may be advantageously 
employed with other viral or non-viral vectors, as discussed below. 
Adenovirus 

Adenovirus is particularly suitable for use as a gene transfer vector because of its mid-sized 
DNA genome, ease of manipulation, high titer, wide target-cell range, and high infectivity. The 
roughly 36 kB viral genome is bounded by 100-200 base pair (bp) inverted terminal repeats (ITR), 
in which are contained cis acting elements necessary for viral DNA replication and packaging. The 
early (E) and late (L) regions of the genome that contain different transcription units are divided by 
the onset of viral DNA replication. 

The El region (EIA and EBB) encodes proteins responsible for the regulation of 
transcription of the viral genome and a few cellular genes. The expression of the E2 region (E2A 
and E2B) results in the synthesis of the proteins for viral DNA replication. 

These proteins are involved in DNA replication, late gene expression, and host cell shut off 
(Renan, 1990). The products of the late genes (L 1, L2, U, L4 and L5), including the majority of the 
viral capsid proteins, are expressed only after significant processing of a single primary transcript 
issued by the major late promoter (MLP). The MLP (located at 16.8 map units) is particularly 
efficient during the late phase of infection, and all the mRNAs issued from this promoter possess a 
5' tripartite leader (TL) sequence which makes them preferred mRNAs for translation. 

In order for adenovirus to be optimized for gene therapy, it is necessary to maximize the 
carrying capacity so that large segments of DNA can be included. It also is very desirable to reduce 
the toxicity and immunologic reaction associated with certain adenoviral products. The two goals 
are, to an extent, coterminous in that elimination of adenoviral genes serves both ends. By practice 
of the present invention, it is possible achieve both these goals while retaining the ability to 
manipulate the therapeutic constructs with relative case. 

The large displacement of DNA is possible because the cis elements required for viral DNA 
replication all are localized in the inverted terminal repeats (ITR) (100-200 bp) at either end of the 
linear viral genome. Plasmids containing ITR's can replicate in the presence of a non-defective 
adenovirus (Hay et al., 1984). Therefore, inclusion of these elements in an adenoviral vector should 
permit replication. 

In addition, the packaging signal for viral encapsidation is localized between 194 385 bp 
(0.5-1.1 map units) at the left end of the viral genome (Hearing et al., 1987). This signal mimics the 
protein recognition site in bacteriophage k DNA where a specific sequence close to the left end, but 
outside the cohesive end sequence, mediates the binding to proteins that are required for insertion of 
the DNA into the head structure. El substitution vectors of Ad have demonstrated that a 450 bp (0- 
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1.25 map units) fragment at the left end of the viral genome could direct packaging in 293 cells 

(Levrero et al., 1991). 

Previously, it has been shown that certain regions of the adenoviral genome can be 
incorporated into the genome of mammalian cells and the genes encoded thereby expressed. These 
5 cell lines are capable of supporting the replication of an adenoviral vector that is deficient in the 
adenoviral function encoded by the cell line. There also have been reports of complementation of 
replication deficient adenoviral vectors by "helping" vectors, e.g., wild-type virus or conditionally 
defective mutants. 

Replication-deficient adenoviral vectors can be complemented, in trans, by helper virus. 

10 This observation alone does not permit isolation of the replication-deficient vectors, however, since 
the presence of helper virus, needed to provide replicative functions, would contaminate any 
preparation. Thus, an additional element was needed that would add specificity to the replication 
and/or packaging of the replication-deficient vector. That element, as provided for in the present 
invention, derives from the packaging function of adenovirus. 

15 it has been shown that a packaging signal for adenovirus exists in the left end of the 

conventional adenovirus map (Tibbetts, 1977). Later studies showed that a mutant with a deletion in 
the EIA (194-358 bp) region of the genome grew poorly even in a cell line that complemented the 
early (EIA) function (Hearing and Shenk, 1983). When a compensating adenoviral DNA (0-353 bp) 
was recombined into the right end of the mutant, the virus was packaged normally. Further 

20 mutational analysis identified a short, repeated, position-dependent element in the left end of the 
Ad5 genome. One copy of the repeat was found to be sufficient for efficient packaging if present at 
either end of the genome, but not when moved towards the interior of the Ad5 DNA molecule 
(Hearing et al., 1987). 

By using mutated versions of the packaging signal, it is possible to create helper viruses 
25 that are packaged with varying efficiencies. Typically, the mutations are point mutations or 
deletions. When helper viruses with low efficiency packaging are grown in helper cells, the virus is 
packaged, albeit at reduced rates compared to wild-type virus, thereby permitting propagation of the 
helper. When these helper viruses are grown in cells along with virus that contains wild-type 
packaging signals, however, the wild-type packaging signals are recognized preferentially over the 
30 mutated versions. Given a limiting amount of packaging factor, the virus containing the wild-type 
signals are packaged selectively when compared to the helpers. If the preference is great enough, 
stocks approaching homogeneity should be achieved. 
Retrovirus 

The retroviruses are a group of single-stranded RNA viruses characterized by an ability to 
35 convert their RNA to double-stranded DNA in infected cells by a process of reverse-transcription 
(Coffin, 1990). The resulting DNA then stably integrates into cellular chromosomes as a provirus 
and directs synthesis of viral proteins. 
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The integration results in the retention of the viral gene sequences in the recipient cell and 
its descendants. The retroviral genome contains three genes - gag, pol and env - that code for capsid 
proteins, polymerase enzyme, and envelope components, respectively. A sequence found upstream 
from the gag gene, termed T, functions as a signal for packaging of the genome into virions. Two 
long terminal repeat (LTR) sequences are present at the 5' and 3* ends of the viral genome. These 
contain strong promoter and enhancer sequences and also are required for integration in the host 
cell genome (Coffin, 1990). 

In order to construct a retroviral vector, a nucleic acid encoding a promoter is inserted into 
the viral genome in the place of certain viral sequences to produce a virus that is replication- 
defective. In order to produce virions, a packaging cell line containing the gag, pol and env genes 
but without the LTR and T components is constructed (Mann et al., 1983). When a recombinant 
plasmid containing a human cDNA, together with the retroviral LTR and T sequences is introduced 
into this cell line (by calcium phosphate precipitation for example), the T sequence allows the RNA 
transcript of the recombinant plasmid to be packaged into viral particles, which are then secreted 
into the culture media (Nicolas and Rubenstein, 1988; Temin, 1986; Mann et al., 1983). The media 
containing the recombinant retroviruses is collected, optionally concentrated, and used for gene 
transfer. Retroviral vectors are able to infect a broad variety of cell types. However, integration and 
stable expression of many types of retroviruses require the division of host cells (Paskind et al., 
1975). 

An approach designed to allow specific targeting of retrovirus vectors recently was 
developed based on the chemical modification of a retrovirus by the chemical addition of galactose 
residues to the viral envelope. This modification could permit the specific infection of cells such as 
hepatocytes via asialoglycoprotein receptors, should this be desired. 

A different approach to targeting of recombinant retroviruses was designed in which 
biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor were 
used. The antibodies were coupled via the biotin components by using streptavidin (Roux et al., 
1989). Using antibodies against major histocompatibility complex class I and class II antigens, the 
infection of a variety of human cells that bore those surface antigens was demonstrated with an 
ecotropic virus in vitro (Roux et al., 1989). 

Adeno-associated Virus 

AAV utilizes a linear, single-stranded DNA of about 4700 base pairs. Inverted terminal 
repeats flank the genome. Two genes are present within the genome, giving rise to a number of 
distinct gene products. The first, the cap gene, produces three different virion proteins (VP), 
designated VP-1, VP 2 and VP-3. 

The second, the rep gene, encodes four non-structural proteins (NS). One or more of these 
rep gene products is responsible for transactivating AAV transcription. 



-135- 



WO 2004/055050 PCT/IB2003/006434 

The three promoters in AAV are designated by their location, in map units, in the genome. 
These are, from left to right, P 5, P 19 and P 40. Transcription gives rise to six transcripts, two 
initiated at each of three promoters, with one of each pair being spliced. 

The splice site, derived from map units 42-46, is the same for each transcript. The four non- 
5 structural proteins apparently are derived from the longer of the transcripts, and three virion 
proteins all arise from the smallest transcript. 

AAV is not associated with any pathologic state in humans. Interestingly, for efficient 
replication, AAV requires "helping" functions from viruses such as herpes simplex virus I and D, 
cytomegalovirus, pseudorabies virus and, of course, adenovirus. 
10 The best characterized of the helpers is adenovirus, and many "early" functions for this 

virus have been shown to assist with AAV replication. Low level expression of AAV rep proteins is 
believed to hold AAV structural expression in check, and helper virus infection is thought to 
remove this block. 

The terminal repeats of the AAV vector can be obtained by restriction endonuclease 
15 digestion of AAV or a plasmid such as p201, which contains a modified AAV genome (Samulski et 
al, 1987), or by other methods known to the skilled artisan, including but not limited to chemical or 
enzymatic synthesis of the terminal repeats based upon the published sequence of AAV. The 
ordinarily skilled artisan can determine, by well-known methods such as deletion analysis, the 
-minimum sequence or part of the AAV ITRs which is required to allow function, i.e., stable and site 

20 specific integration. 

The ordinarily skilled artisan also can determine which minor modifications of the sequence 
can be tolerated while maintaining the ability of the terminal repeats to direct stable, site-specific 
integration. 

AAV-^based vectors have proven to be safe and effective vehicles for gene delivery in vitro, 
25 and these vectors are being developed and tested in pre-clinical and clinical stages for a wide range 
of applications in potential gene therapy, both ex vivo and in vivo (Carter and Flotte, 1996; 
Chattedee et al., 1995; Ferrari et al., 1996; Fisher et al., 1996; Flotte et al., 1993; Goodman et al., 
1994; Kaplitt et al., 1994; 1996, Kessler et al., 1996; Koeberl et al., 1997; Mizukami et al., 1996; 
Xiaoetal., 1996). 

30 AAV-mediated efficient gene transfer and expression in the lung has led to clinical trials for 

the treatment of cystic fibrosis (Carter and Flotte, 1996; Flotte et al., 1993). Similarly, the prospects 
for treatment of muscular dystrophy by AAV-mediated gene delivery of the dystrophin gene to 
skeletal muscle, of Parkinson's disease by tyrosine hydroxylase gene delivery to the brain, of 
hemophilia B by Factor IX gene delivery to the liver, and potentially of myocardial infarction by 

35 vascular endothelial growth factor gene to the heart, appear promising since AAV-mediated 
transgene expression in these organs has recently been shown to be highly efficient (Fisher et al., 
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1996; Flotte et aL, 1993; Kaphtt et aL, 1994; 1996; Koeberl et aL, 1997; McCown et al., 1996; Ping 
et al., 1996; and Xiao et al., 1996). 
Other Viral Vectors 

Other viral vectors may be employed as expression constructs in the present invention. 
5 Vectors derived from viruses such as vaccinia virus (Ridgeway, 1988; Baichwai and Sugden, 1986; 
Coupar et aL, 1988) and hepatitus B viruses have also been developed and are useful in the present 
invention. They offer several attractive features for various mammalian cells (Friedmann, 1989; 
Ridgeway, 1988; Baichwai and Sugden, 1986; Coupar et al., 1988; and Horwich et ah, 1990). 

With the recent recognition of defective hepatitis B viruses, new insight was gained into the 

10 structure-function relationship of different viral sequences. In vitro studies showed that the virus 
could retain the ability for helper dependent packaging and reverse transcription despite the deletion 
of up to 80% of its genome (Horwich et aL, 1990). This suggested that large portions of the genome 
could be replaced with foreign genetic material. Chang et aL, recently introduced the 
chloramphenicol acetyltransferase (CAT) gene into duck hepatitis B virus genome in the place of 

15 the polymerase, surface, and pre-surface coding sequences. It was cotransfected with wild-type 
virus into an avian hepatoma cell line. Culture media containing high titers of the recombinant virus 
were used to infect primary duckling hepatocytes. Stable CAT gene expression was detected for at 
least 24 days after transfection (Chang et aL, 1991). 

In still further embodiments of the present invention, the nucleic acids to be delivered are 

20 housed within an infective virus that has been engineered to express a specific binding ligand. The 
virus particje will thus bind specifically to the cognate receptors of the target cell and deliver the 
contents to the cell. A novel approach designed to allow specific targeting of retrovirus vectors was 
recently developed based on the chemical modification of a retrovirus by the chemical addition of 
lactose residues to the viral envelope. This modification can permit the specific infection of 

25 hepatocytes via sialoglycoprotein receptors. 

Another approach to targeting of recombinant retroviruses was designed in which 
biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor were 
used. The antibodies were coupled via the biotin components by using streptavidin (Roux et al., 
1989). Using antibodies against major histocompatibility complex class I and class II antigens, they 

30 demonstrated the infection of a variety of human cells that bore those surface antigens with an 
ecotropic virus in vitro (Roux et aL, 1989). 
(ii) Non-viral Transfer 

DNA constructs of the present invention are generally delivered to a cell. In certain 
situations, the nucleic acid to be transferred is non-infectious, and can be transferred using non-viral 
35 methods. 

Several non-viral methods for the transfer of expression constructs into cultured 
mammalian cells are contemplated by the present invention. These include calcium phosphate 
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precipitation (Graham and Van Der Eb, 1973; Chen and Okayama, 1987; Rippe et aL, 1990) 
DEAE-dextran (Gopal, 1985), electroporation (Tur Kaspa et aL, 1986; Potter et aL, 1984), direct 
microinjection (Harland and Weintraub, 1985), DNA loaded liposomes (Nicolau and Sene, 1982; 
Fraley et aL, 1979), cell sonication (Fechheimer et ai., 1987), gene bombardment using high 
5 velocity microprojectiles (Yang et aL, 1990), and receptor-mediated transfection (Wu and Wu, 
1987; Wu and Wu, 1988). 

Once the construct has been delivered into the cell the nucleic acid encoding the therapeutic 
gene may be positioned and expressed at different sites. In certain embodiments, the nucleic acid 
encoding the therapeutic gene may be stably integrated into the genome of the cell. Thi-s integration 
10 may be in the cognate location and orientation via homologous recombination (gene replacement) 
or it may be integrated in a random, non-specific location (gene augmentation). In yet further 
embodiments, the nucleic acid may be stably maintained in the cell as a separate, episomal segment 
of DNA. Such nucleic acid segments or "episomes" encode sequences sufficient to permit 
maintenance and replication independent of or in synchronization with the host cell cycle. 
15 How the expression construct is delivered to a cell and where in the cell the nucleic acid 

remains is dependent on the type of expression construct employed. 

In a particular embodiment of the invention, the expression construct may be entrapped in a 
liposome. Liposomes are vesicular structures characterized by a phospholipid bilayer membrane 
and an inner aqueous medium. Multilamellar liposomes have multiple lipid layers separated by 
20 aqueous medium. They form spontaneously when phospholipids are suspended in an excess of 
aqueous solution. The lipid components undergo self-rearrangement before the formation of closed 
structures and entrap water and dissolved solutes between the lipid bilayers (Ghosh and Bachhawat, 
1991). The addition of DNA to cationic liposomes causes a topological transition from liposomes to 
optically birefringent liquid-crystalline condensed globules (Radler et aL, 1997). These DNA-lipid 
25 complexes are potential non-viral vectors for use in gene therapy. 

Liposome-mediated nucleic acid delivery and expression of foreign DNA in vitro has been 
very successful. Using the P-lactamase gene, Wong et aL (1980) demonstrated the feasibility of 
liposome-mediated delivery and expression of foreign DNA in cultured chick embryo, HeLa, and 
hepatoma cells. Nicolau et aL (1987) accomplished successful liposome-mediated gene transfer in 
30 rats after intravenous injection. Also included are various commercial approaches involving 
"lipofection" technology. 

In certain embodiments of the invention, the liposome may be complexed with a 
hemagglutinating virus (HVJ). This has been shown to facilitate fusion with the cell membrane and 
promote cell entry of liposome-encapsulated DNA (Kaneda et aL, 1989). 
35 In other embodiments, the liposome may be complexed or employed in conjunction with 

nuclear nonhistone chromosomal proteins (HMG-1) (Kato et aL, 1991). In yet further embodiments, 
the liposome may be complexed or employed in conjunction with both HVJ and HMG-1. In that 
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such expression constructs have been successfully employed in transfer and expression of nucleic 
acid in vitro and in vivo, then they are applicable for the present invention. 

Other vector delivery systems which can be employed to deliver a nucleic acid encoding a 
therapeutic gene into cells are receptor-mediated delivery vehicles. These take advantage of the 
selective uptake of macromolecules by receptor mediated endocytosis in almost all eukaryotic cells. 
Because of the ceil type specific distribution of various receptors, the delivery can be highly 
specific (Wu and Wu, 1993). 

Receptor-mediated gene targeting vehicles generally consist of two components: a cell 
receptor-specific ligand and a DNA-binding agent. Several ligands have been used for receptor- 
mediated gene transfer. The most extensively characterized ligands are asialoorosomucoid (ASOR) 
(Wu and Wu, 1987) and transferring (Wagner et al., 1990). 

Recently, a synthetic neoglycoprotein, which recognizes the same receptor as ASOR, has 
been used as a gene delivery vehicle (Ferkol et al., 1993; Perales et al., 1994) and epidermal growth 
factor (EGF) has also been used to deliver genes to squamous carcinoma cells (Myers, EPO 
15 0273085). 

In other embodiments, the delivery vehicle may comprise a ligand and a liposome. For 
example, Nicolau et al, (1987) employed lactosyl-ceramide, a galactose terminal asialganglioside, 
incorporated into liposomes and observed an increase in the uptake of the insulin gene by 
hepatocytes. Thus, it is feasible that a nucleic acid encoding a therapeutic gene also may be 
20 specifically delivered into a cell type such as prostate, epithelial or tumor cells, by any number of 
receptor-ligand systems with or without liposomes. For example, the human prostate-specific 
antigen (Watt et al, 1986) may be used as the receptor for mediated delivery of a nucleic acid in 
prostate tissue. 

In another embodiment of the invention, the expression construct may simply consist of 
naked recombinant DNA or plasmids. Transfer of the construct may be performed by any of the 
methods mentioned above which physically or chemically permeabilize the cell membrane. This is 
applicable particularly for transfer in vitro, however, it may be applied for in vivo use as well. 
Dubensky et al, (1984) successfully injected polyornavirus DNA in the form of CaP04 precipitates 
into liver and spleen of adult and newborn mice demonstrating active viral replication and acute 
30 infection. 

Benvenisty and Neshif (1986) also demonstrated that direct intraperitoneal injection of 
CaP04 precipitated plasmids results in expression of the transfected genes. It is envisioned that 
DNA encoding a CAM may also be transferred in a similar manner in vivo and express CAM. 

Another embodiment of the invention for transferring a naked DNA expression construct 
into cells may involve particle bombardment. This method depends on the ability to accelerate 
DNA coated microprojectiles to a high velocity allowing them to pierce cell membranes and enter 
cells without killing them (Klein et al, 1987). Several devices for accelerating small particles have 
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been developed. One such device relies on a high voltage discharge to generate an electrical cur- 
rent, which in turn provides the motive force (Yang et al, 1990). The microprojectiles used have 
consisted of biologically inert substances such as tungsten or gold beads. 
Antibodies 

5 Polyclonal anti-THAP-family or anti-THAP domain antibodies can be prepared as 

described above by immunizing a suitable subject with a THAP-family or THAP domain 
immunogen. The anti-THAP-family or anti- THAP domain antibody titer in the immunized subject 
can be monitored over time by standard techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized THAP-family or THAP domain protein. If desired, the antibody 

10 molecules directed against THAP-family can be isolated from the mammal (e.g., from the blood) 
and further purified by well known techniques, such as protein A chromatography to obtain the IgG 
fraction. At an appropriate time after immunization, e.g., when the anti-THAP-family antibody 
titers are highest, antibody-producing cells can be obtained from the subject and used to prepare 
monoclonal antibodies by standard techniques, such as those described in the following references: 

15 the hybridoma technique originally described by Kohler and Milstein (1975) Nature 256:495-497) 
(see also, Brown et al. (1981) J. Immunol. 127:539-46; Brown et al. (1980) J. Biol. Chem. 
255:4980-83 ; Yeh et al. (1976) PNAS 76:2927-31; and Yeh et al. (1982) Int. J. Cancer 29:269-75), 
the more recent human B cell hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), the 
EBV-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan 

20 R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing monoclonal antibody 
hybridomas is well known (see generally R. H. Kenneth, in Monoclonal Antibodies: A New 
Dimension In Biological Analyses, Plenum Publishing Corp., New York, N.Y. (1980); E. A. Lerner 
(1981) Yale J. Biol. Med., 54:387-402; M. L. Gefter et al. (1977) Somatic Cell Genet. 3:231-36). 
Briefly, an imrnortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) 

25 from a mammal immunized with a THAP-family immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are screened to identify a hybridoma producing a 
monoclonal antibody that binds THAP-family. 

Any of the many well known protocols used for fusing lymphocytes and immortalized cell 
lines can be applied for the purpose of generating an anti-THAP-family or anti-THAP domain 

30 monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; Gefter et al. Somatic Cell 
Genet., cited supra; Lemer, Yale J Biol. Med, cited supra; Kenneth, Monoclonal Antibodies, cited 
supra). Moreover, the ordinarily skilled worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the immortal cell line (e.g., a myeloma cell 
line) is derived from the same mammalian species as the lymphocytes. For example, murine 
35 hybridomas can be made by fusing lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an immortalized mouse cell line. Preferred immortal cell 
lines are mouse myeloma cell lines that are sensitive to culture medium containing hypoxanthine, 
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ammopterin and thymidine ("HAT medium"). Any of a number of myeloma cell lines can be used 

as a fusion partner according to standard techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or 

Sp2/0-Agl4 myeloma lines. These myeloma lines are available from ATCC. Typically, HAT- 

sensitive mouse myeloma cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). 

5 Hybridoma cells resulting from the fusion are then selected using HAT medium, which kills 

unfused and unproductively fused myeloma cells (unfused splenocytes die after several days 

because they are not transformed). Hybridoma cells producing a monoclonal antibody of the 

invention are detected by screening the hybridoma culture supernatants for antibodies that bind a 

THAP-family or THAP domain protein, e.g., using a standard ELISA assay. 

10 Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal anti- 

THAP-family or anti-THAP domain antibody can be identified and isolated by screening a 

recombinant combinatorial immunoglobulin library (e.g., an antibody phage display library) with 

THAP-family or THAP domain protein to thereby isolate immunoglobulin library members that 

bind THAP-family or THAP domain proteins. Kits for generating and screening phage display 

15 libraries are commercially available (e.g., the Pharmacia Recombinant Phage Antibody System, 

Catalog No. 27-9400-01; and the Stratagene SurfZAP.TM. Phage Display Kit, Catalog No. 

240612). Additionally, examples of methods and reagents particularly amenable for use in 

generating and screening antibody display library can be found in, for example, Ladner et al. U.S. 

Pat. No. 5,223,409; Kang et al. PCT International Publication No. WO 92/18619; Dower et al. PCT 

20 International Publication No. WO 91/17271; Winter et al. PCT International Publication WO 

92/20791; Markland et al. PCT International Publication No. WO 92/15679; Breitling et al. PCT 

International Publication WO 93/01288; McCafferty et al. PCT International Publication No. WO 

92/01047; Garrard et al. PCT International Publication No. WO 92/09690; Ladner et al. PCT 

International Publication No. WO 90/02809; Fuchs et al. (1991) Biotechnology 9:1370-1372; Hay 

25 et al. (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; 

Griffiths et al. (1993) EMBO J 12:725-734; Hawkins et al. (1992) J. Mol. Biol. 226:889-896; 

Clarkson et al. (1991) Nature 352:624-628; Gram et al. (1992) PNAS 89:3576-3580; Garrad et al. 

(1991) Bio/Technology 9:1373-1377; Hoogenboom et al. (1991) Nuc. Acid Res. 19:4133^137; 

Barbas et al. (1991) PNAS 88:7978-7982; and McCafferty et al. Nature (1990) 348:552-554. 

30 Additionally, recombinant anti-THAP-family or anti-THAP domain antibodies, such as 

chimeric and humanized monoclonal antibodies, comprising both human and non-human portions, 

which can be made using standard recombinant DNA techniques, are within the scope of the 

invention. Such chimeric and humanized monoclonal antibodies can be produced by recombinant 

DNA techniques known in the art, for example using methods described in Robinson et al. 

35 International Application No. PCT/US86/02269; Akira, et al. European Patent Application 184,187; 

Taniguchi, M., European Patent Application 171496; Morrison et al. European Patent Application 

173,494; Neuberger et al. PCT International Publication No. WO 86/01533; Cabilly et al. U.S. Pat. 
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No. 4,816,567; Cabilly et al. European Patent Application 125,023; Better et aJ. (1988) Science 
240:1041-1043; Liu et al. (1987) PNAS 84:3439-3443; Liu et al. (1987) J. Immunol. 139:3521- 
3526; Sun et al. (1987) PNAS 84:214-218; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood 
et al. (1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553-1559); 
5 Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. (1986) BioTechniques 4:214; Winter U.S. 
Pat. No. 5,225,539; Jones et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) Science 
239:1534; and Beidler et al. (1988) J. Immunol. 141:4053-4060. 

An anti-THAP-family of anti-THAP domain antibody (e.g., monoclonal antibody) can be 
used to isolate THAP-family or THAP domain protein by standard techniques, such as affinity 

10 chromatography or immunoprecipitation. For example, an anti-THAP-family antibody can facilitate 
the purification of natural THAP-family from cells and of recombinantly produced THAP-family 
expressed in host cells. Moreover, an anti-THAP-family antibody can be used to detect THAP- 
family protein (e.g., in a cellular lysate or cell supernatant) in order to evaluate the abundance and 
pattern of expression of the THAP-family protein. Anti-THAP-family antibodies can be used 

15 diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment regimen. Detection can be facilitated by 
coupling (i.e., physically linking) the antibody to a detectable substance. Examples of detectable 
substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable enzymes 

20 include horseradish peroxidase, alkaline phosphatase, -galactosidase, or acetylcholinesterase; 
examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; 
examples of suitable fluorescent materials include umbel liferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent materials include 

25 luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 125 ^ 3 31 
I, 35 S or 3 H. 

DRUG SCREENING ASSAYS 

Some embodiments of the present invention provide a method (also referred to herein as a 
"screening assay") for identifying modulators, i.e., candidate or test compounds or agents (e.g., 

30 preferably small molecules, but also peptides, peptidomimetics or other drugs) which bind to 
THAP-family or THAP domain proteins, have an inhibitory or activating effect on, for example, 
THAP-family expression or preferably THAP-family activity, or have an inhibitory or activating 
effect on, for example, the activity of an THAP-family target molecule. In some embodiments 
small molecules can be generated using combinatorial chemistry or can be obtained from a natural 

35 products library. Assays may be cell based, non-cell-based or in vivo assays. Drug screening 
assays may be binding assays or more preferentially functional assays, as further described. 

i 
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In general, any suitable activity of a THAP-family protein can be detected in a drug 
screening assay, including: (1) mediating apoptosis or cell proliferation when expressed or 
introduced into a cell, most preferably inducing or enhancing apoptosis, and/or most preferably 
reducing cell proliferation; (2) mediating apoptosis or cell proliferation of an endothelial cell; (3) 
mediating apoptosis or cell proliferation of a hyperproliferative cell; (4) mediating apoptosis or cell 
proliferation of a CNS cell, preferably a neuronal or glial cell; (5) an activity indicative of a 
biological function in an animal selected from the group consisting of mediating, for example 
enhancing or inhibiting, angiogenesis; mediating, preferably inhibiting, inflammation; inhibiting the 
metastatic potential of cancerous tissue; reducing tumor burden; increasing sensitivity of cancerous 
cells to chemotherapy or radiotherapy; killing a cancer cell, inhibiting the growth of a cancer cell, 
inducing tumor regression; and mediating, preferably inhibiting, one or more of the following 
conditions, T-cell auto-immune infiltrative skin diseases, chronic autoinflammatory skin diseases, 
such as lichen panus and psoriasis, autoimmune encephalomyelitis, multiple sclerosis, rheumatoid 
arthritis, autoimmune diabetes, inflammatory bowel diseases, such as Crohn's disease and 
ulcerative colitis, Hashimoto's thyroiditis, Sjogren's syndrome, gastric lymphomas and chronic 
inflammatory liver disease or (6) interaction with a THAP family target molecule or THAP domain 
target molecule, preferably interaction with a protein or a nucleic acid. 

The invention also provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g., preferably small 
molecules, but also peptides, peptidomimetics or other drugs) which bind to THAP1, PAR4 or 
PML-NB proteins, and have an inhibitory or activating effect on PAR4 or THAP1 recruitment, 
binding to or association with PML-NBs or interaction of a chemokine with a THAP-family 
polypeptide or a cellular response to a chemokine which is mediated by a THAP-family 
polypeptide. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are target molecules of a THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof. In another embodiment, the invention provides 
assays for screening candidate or test compounds which bind to or modulate the activity of a THAP 
family or THAP domain polypeptide, or a biologically active fragment or homologue thereof. The 
test compounds of the present invention can be obtained using any of the numerous approaches in 
combinatorial library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; 
the one-bead one-compound' library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is used with peptide libraries, while the 
other four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of 
compounds (Lam, K. S. (1997) Anticancer Drug Des. 12: 145). 
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Examples of methods for the synthesis of molecular libraries can be found in the art, for 
example in: DeWitt et al. (1993) Proc. Natl. Acad. Sci. U.S.A. 90:6909; Erb et al. (1994) Proc. Natl. 
Acad. Sci. USA 91:11422; Zuckermann et al. (1994). J. Med. Chem. 37:2678; Cho et al. (1993) 
Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2059; Carell et al. (1994) 
5 Angew. Chem. Int. Ed. Engl. 33:2061; and in Gallop et al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) Biotechniques 
13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) Nature 364:555- 
556), bacteria (Ladner U.S. Pat. No. 5,223,409), spores (Ladner U.S. Pat. No. '409), plasmids (Cull 
et al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 
10 249:386-390); (Devin (1990) Science 249:404-406); (Cwirla et al. (1990) Proc. Natl. Acad. Sci. 
87:6378-6382); (Felici (1991) J. Mol. Biol. 222:301-310); (Ladner supra.). 

Determining the ability of the test compound to inhibit or increase THAP-family 
polypeptide activity can also be accomplished, for example, by coupling the THAP family or THAP 
domain polypeptide, or a biologically active fragment or homologue thereof with a radioisotope or 
15 enzymatic label such that binding of the THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof to its cognate target molecule can be determined 
by detecting the labeled THAP family or THAP domain polypeptide, or a biologically active 
fragment or homologue thereof in a complex. For example, compounds (e.g., THAP family or 
THAP domain polypeptide, or a biologically active fragment or homologue thereof) can be labeled 
20 with 125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by direct 
counting of radioemmission or by scintillation counting. Alternatively, compounds can be 
enzymaticaily labeled with, for example, horseradish peroxidase, alkaline phosphatase, or 
luciferase, and the enzymatic label detected by determination of conversion of an appropriate 
substrate to product. The labeled molecule is placed in contact with its cognate molecule and the 
25 extent of complex formation is measured. For example, the extent of complex formation may be 
measured by immunoprecipitating the complex or by performing gel electrophoresis. 

It is also within the scope of this invention to determine the ability of a compound (e.g., 
THAP family or THAP domain polypeptide, or biologically active fragment or homologue thereof) 
to interact with its cognate target molecule without the labeling of any of the interactants. For 
30 example, a microphysiometer can be used to detect the interaction of a compound with its cognate 
target molecule without the labeling of either the compound or the target molecule. McConnell, H. 
M. et al. (1992) Science 257:1906-1912. A microphysiometer such as a cytosensor is an analytical 
instrument that measures the rate at which a cell acidifies its environment using a light-addressable 
potentiometric sensor (LAPS). Changes in this acidification rate can be used as an indicator of the 
35 interaction between compound and cognate target molecule. 

In a preferred embodiment, the assay comprises contacting a cell which expresses a THAP 
family or THAP domain polypeptide, or biologically active fragment or homologue thereof, with a 
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THAP-family or THAP domain protein target molecule to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to inhibit or 
increase the activity of the THAP family or THAP domain polypeptide, or biologically active 
fragment or homologue thereof, wherein determining the ability of the test compound to inhibit or 
increase the activity of the THAP family or THAP domain polypeptide, or biologically active 
fragment or homologue thereof, comprises determining the ability of the test compound to inhibit or 
increase a biological activity of the THAP-family polypeptide expressing cell. 

In another embodiment, the assay comprises contacting a cell which expresses a THAP 
family or THAP domain polypeptide, or biologically active fragment or homologue thereof, with a 
test compound, and determining the ability of the test compound to inhibit or increase the activity 
of the THAP family or THAP domain polypeptide, or biologically active fragment or homologue 
thereof, wherein determining the ability of the test compound to inhibit or increase the activity of 
the THAP family or THAP domain polypeptide, or biologically active fragment or homologue 
thereof, comprises determining the ability of the test compound to inhibit or increase a biological 
activity of the THAP-family polypeptide expressing cell. 

In another preferred embodiment, the assay comprises contacting a cell which is responsive 
to a THAP family or THAP domain polypeptide, or a biologically active fragment or homologue 
thereof, with a THAP-family protein or biologically-active portion thereof, to form an assay 
mixture, contacting the assay mixture with a test compound, and determining the ability of the test 
compound to modulate the activity of the THAP-family protein or biologically active portion 
thereof, wherein determining the ability of the test compound to modulate the activity of the THAP- 
family protein or biologically active portion thereof comprises determining the ability of the test 
compound to modulate a biological activity of the THAP-family polypeptide-responsive cell (e.g., 
determining the ability of the test compound to modulate a THAP-family polypeptide activity. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a THAP-family target molecule (i.e. a molecule with which THAP-family polypeptide 
interacts) with a test compound and determining the ability of the test compound to modulate (e.g. 
stimulate or inhibit) the activity of the THAP-family target molecule. Determining the ability of the 
test compound to modulate the activity of a THAP-family target molecule can be accomplished, for 
example, by determining the ability of the THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof to bind to or interact with the THAP-family 
target molecule. 

Determining the ability of the THAP family or THAP domain polypeptide, or a biologically 
active fragment or homologue thereof to bind to or interact with a THAP-family target molecule 
can be accomplished by one of the methods described above for determining direct binding. In a 
preferred embodiment, determining the ability of the THAP family or THAP domain polypeptide, 
or a biologically active fragment or homologue thereof to bind to or interact with a THAP-family 
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target molecule can be accomplished by determining the activity or the target molecule, tor 
example, the activity of the target molecule can be determined by contacting the target molecule 
with the THAP family or THAP domain polypeptide, or a biologically active fragment or 
homologue thereof and measuring induction of a cellular second messenger of the target (i.e. 
5 intracellular Ca 2 +, diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target an 
appropriate substrate, detecting the induction of a reporter gene (comprising a target-responsive 
regulatory element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
luciferase), or detecting a target-regulated cellular response, for example, signal transduction or 
protein: protein interactions. 
10 In yet another embodiment, an assay of the present invention is a cell-free assay in which a 

THAP family or THAP domain polypeptide, or a biologically active fragment or homologue thereof 
is contacted with a test compound and the ability of the test compound to bind to the THAP family 
or THAP domain polypeptide, or a biologically active fragment or homologue thereof is 
determined. Binding of the test compound to the THAP family or THAP domain polypeptide, or a 
15 biologically active fragment or homologue thereof can be determined either directly or indirectly as 
described above. In a preferred embodiment, the assay includes contacting the THAP family or 
THAP domain polypeptide, or a biologically active fragment or homologue thereof with a known 
compound which binds THAP-family polypeptide (e.g., a THAP-family target molecule) to form an 
assay mixture, contacting the assay mixture with a test compound, and determining the ability of 
20 the test compound to interact with a THAP family or THAP domain polypeptide, or a biologically 
active fragment or homologue thereof, wherein determining the ability of the test compound to 
interact with a THAP-family protein comprises determining the ability of the test compound to 
preferentially bind to THAP family or THAP domain polypeptide, or a biologically active fragment 
or homologue" thereof as compared to the known compound. 
25 In another embodiment, the assay is a cell-free assay in which a THAP family or THAP 

domain polypeptide, or a biologically active fragment or homologue thereof is contacted with a test 
compound and the ability of the test compound to modulate (e.g., stimulate or inhibit) the activity 
of the THAP family or THAP domain polypeptide, or a biologically active fragment or homologue 
thereof is determined. Determining the ability of the test compound to modulate the activity of a 
30 THAP-family protein can be accomplished, for example, by determining the ability of the THAP 
family or THAP domain polypeptide, or a biologically active fragment or homologue thereof to 
bind to a THAP-family target molecule by one of the methods described above for determining 
direct binding. Determining the ability of the THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof to bind to a THAP-family target molecule can 
35 also be accomplished using a technology such as real-time Biomolecular Interaction Analysis 
(BIA). Sjolander, S. and Urbaniczky, C. (1991) Anal. Chem. 63:2338-2345 and Szabo et al. (1995) 
Curr. Opin. Struct. Biol. 5:699-705. As used herein, "BIA" is a technology for studying biospecific 
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interactions in real ti me , without labeling any of the interactants (e.g., BIAcore). Changes in the 
optical phenomenon of surface plasmon resonance (SPR) can be used as an indication of real-time 
reactions between biological molecules. 

In an alternative embodiment, determining the ability of the test compound to modulate the 
activity of a THAP family or THAP domain polypeptide, or a biologically act.ve fragment or 
homologue thereof can be accomplished by determining the ability of the THAP family or THAP 
domain polypeptide, or a biologically active fragment or homologue thereof to further modulate the 
activity of a downstream effector (e.g., a growth factor mediated s,gnal transduction pathway 
component) of a THAP-family target molecule. For example, the activity of the effector molecule 
on an appropriate target can be determined or the binding of the effector to an appropriate target can 
be determined as previously described. 

In yet another embodiment, the cell-free assay involves contacting a THAP family or 
THAP domain polypeptide, or a biologically active fragment or homologue thereof with a known 
compound which binds the THAP-family protein to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to interact with the 
THAP-family protein, wherein determining the ability of the test compound to interact with the 
THAP-family protein comprises determining the ability of the THAP family or THAP domain 
polypeptide, or a biologically active fragment or homologue thereof to preferentially bind to or 
modulate the activity of a THAP-family target molecule. 

The cell-free assays of the present invention are amenable to use of both soluble and/or 
membrane-bound forms of isolated proteins (e.g. THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof or molecules to which THAP-family targets 
bind). In the case of cell-free assays in which a membrane-bound form an isolated protein is used it 
may be desirable to utilize a solubilizing agent such that the membrane-bound form of the isolated 
protein is maintained in solution. Examples of such solubilizing agents include non-ionic detergents 
such as n-octylglucoside, n-dodecylglucoside, n-dodeeylmaltoside, octanoyl-N-methylglucamide 
decanoyl-N-methylglucamuie, Triton.fRTM. X-100, Triton.RTM. X-114, ThesitRTM] 
Isotridecypoly(ethylene glycol ether) n ,3-[(3-cho.amidopropyl)dimethylamminio]- 1-propane 
sulfonate (CHAPS), B-^-cholamidopropyDdimethylammin.oJ^hydroxy-l-propane sulfonate 
(CHAPSO), or N-dodecyl=N,N-dim e thyl-3-ammonio-l-propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, it may 
be de SI rable to immobi.ize either THAP family or THAP domain polypeptide, or a biologically 
acnve fragment or homologue thereof or a target molecule thereof to facilitate separation of 
complexed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automaton of the assay. Binding of a test compound to a THAP family or THAP domain 
polypeptrde, or a biolog,caHy active fragment or homologue thereof, or interaction of a THAP- 
famdy protein with a target molecule in the presence and absence of a candidate compound, can be 
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accomplished in any vessel suitable for containing the reactants. Examples of such vessels include 
microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be 
provided which adds a domain that allows one or both of the proteins to be bound to a matrix. For 
example, glutathione-S-fransferase/THAP-family fusion proteins or glutathione-S-transferase/target 
fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, Mo.) 
or glutathione derivatized microtitre plates, which are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or THAP-family protein, and the mixture 
incubated under conditions conducive to complex formation (e.g., at physiological conditions for 
salt and pH). Following incubation, the beads or microtitre plate wells are washed to remove any 
unbound components, the matrix immobilized in the case of beads, complex determined either 
directly or indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of THAP-family polypeptide binding or activity 
determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the screening 
15 assays of the invention. For example, either a THAP-family protein or a THAP-family target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated THAP- 
family protein or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) 
using techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, III.), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, 
20 antibodies reactive with a THAP-family protein or target molecule but which do not interfere with 
binding of the THAP-family protein to its target molecule can be derivatized to the wells of the 
plate, and unbound target or THAP-family protein trapped in the wells by antibody conjugation. 
Methods for detecting such complexes, in addition to those described above for the GST- 
immobilized complexes, include immunodetection of complexes using antibodies reactive with the 
25 THAP-family protein or target molecule, as well as enzyme-linked assays which rely on detecting 
an enzymatic activity associated with the THAP-family protein or target molecule. 

In another embodiment, modulators of THAP-family or THAP domain polypeptides 
expression are identified in a method wherein a cell is contacted with a candidate compound and the 
expression of THAP-family or THAP domain polypeptides mRNA or protein in the cell is 
30 determined. The level of expression of THAP-family polypeptide mRNA or protein in the presence 
of the candidate compound is compared to the level of expression of THAP-family polypeptide or 
THAP domain mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of THAP-family polypeptide expression based on 
this comparison. For example, when expression of THAP-family polypeptide or THAP domain 
35 mRNA or protein is greater (statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of THAP-family 
polypeptide or THAP domain mRNA or protein expression. Alternatively, when expression of 
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THAP-family polypeptide or THAP domain mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of THAP-family polypeptide or THAP domain mRNA or protein 
expression. The level of THAP-family polypeptide or THAP domain mRNA or protein expression 
5 in the cells can be determined by methods described herein for detecting THAP-family polypeptide 
or THAP domain mRNA or protein. 

In yet another aspect of the invention, the THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof can be used as "bait proteins" in a two-hybrid 
assay or three-hybrid assay using the methods described above for use in THAP-family 

10 polypeptide/PAR4 interactions assays, to identify other proteins which bind to or interact with 
THAP-family polypeptide ('THAP-family-binding proteins" or M THAP-family-bp") and are 
involved in THAP-family polypeptide activity. Such THAP-family- or THAP domain-binding 
proteins are also likely to be involved in the propagation of signals by the THAP-family or THAP 
domain proteins or THAP-family or THAP domain proteins targets as, for example, downstream 

15 elements of a THAP-family polypeptide- or THAP domain-mediated signaling pathway. 
Alternatively, such THAP-family-binding proteins are likely to be THAP-family polypeptides 
inhibitors. 

THAP/DNA BINDING ASSAYS 

In another embodiment of the invention a method is provided for identifying compounds 

20 which interfere with THAP-family DNA binding activity, comprising the steps of: contacting a 
THAP-famjly protein or a portion thereof immobilized on a solid support with both a test 
compound and DNA fragments, or contacting a DNA fragment immobilized on a solid support with 
both a test compound and a THAP-family protein. The binding between DNA and the THAP- 
protein or a portion thereof is detected, wherein a decrease in DNA binding when compared to 

25 DNA binding in the absence of the test compound indicates that the test compound is an inhibitor of 
THAP-family DNA binding activity, and an increase in DNA binding when compared to DNA 
binding in the absence of the test compound indicates that the test compound is an inducer of or 
restores THAP-family DNA binding activity. As discussed further, DNA fragments may be 
selected to be specific THAP-family protein target DNA obtained for example as described in 

30 Example 28, or may be non-specific THAP-family target DNA. Methods for detecting protein- 
DNA interactios are well known in the art, including most commonly used electrophoretic mobility 
shift assays (EMSAs) or by filter binding (Zabel et al, (1991) J. Biol. Chem., 266:252; and 
Okamoto and Beach, (1994) Embo J. 13: 4816). Other assays are available which are amenable for 
high throughput detection and quantification of specific and nonspecific DNA binding (Amersham, 

35 N.J.; and Gal S. et al, 6 th Ann. Conf. Soc. Biomol. Screening, 6-9 Sept 2000, Vancouver, B.C.). 

In a first aspect, a screening assay involves identifying compounds which interfere with 
THAP-family DNA binding activity without prior knowledge about specific THAP-family binding 
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sequences. For example, a THAP-family protein is contacted with both a test compound and a 
library of oligonucleotides or a sample of DNA fragments not selected based on specific DNA 
sequences. Preferably the THAP-family protein is immobilized on a solid support (such as an array 
or a column). Unbound DNA is separated from DNA which is bound to the THAP-famliy protein, 
5 and the DNA which is bound to THAP-family protein is detected and can be quantitated by any 
means known in the art. For example, the DNA fragment is labelled with a detectable moiety, such 
as a radioactive moiety, a colorimetric moiety or a fluorescent moiety. Techniques for so labelling 
DNA are well known in the art. 

The DNA which is bound to the THAP-family protein or a portion thereof is separated 
10 from unbound DNA by immunoprecipitation with antibodies which are specific for the THAP- 
family protein or a portion thereof. Use of two different monoclonal ant i -THAP-family antibodies 
may result in more complete immunoprecipitation than either one alone. The amount of DNA 
which is in the immunoprecipitate can be quantitated by any means known in the art. THAP-family 
proteins or portions thereof which bind to the DNA can also be detected by gel shift assays (Tan, 
15 Cell, 62:367, 1990), nuclease protection assays, or methylase interference assays. 

It is still another object of the invention to provide methods for identifying compounds 
which restore the ability of mutant THAP-family proteins or portions thereof to bind to DNA 
sequences. In one embodiment a method of screening agents for use in therapy is provided 
comprising: measuring the amount of binding of a THAP-family protein or a portion thereof which 
20 is encoded by a mutant gene found in cells of a patient to DNA molecules, preferably random 
oligonucleotides or DNA fragments from a nucleic acid library; measuring the amount of binding of 
said THAP-family protein or a portion thereof to said nucleic acid molecules in the presence of a 
test substance; and comparing the amount of binding of the THAP-family protein or a portion 
thereof in the presence of said test substance to the amount of binding of the THAP-family protein 
25 in the absence of said test substance, a test substance which increases the amount of binding being a 
candidate for use in therapy. 

In another embodiment of the invention, oligonucleotides can be isolated which restore to 
mutant THAP-family proteins or portions thereof the ability to bind to a consensus binding 
sequence or conforming sequences. Mutant THAP-family protein or a portion thereof and random 
30 oligonucleotides are added to a solid support on which THAP-family-specific DNA fragments are 
immobilized. Oligonucleotides which bind to the solid support are recovered and analyzed. Those 
whose binding to the solid support is dependent on the presence of the mutant THAP-family protein 
are presumptively binding the support by binding to and restoring the conformation of the mutant 
protein. 

35 If desired, specific binding can be distinguished from non-specific binding by any means 

known in the art. For example, specific binding interactions are stronger than non-specific binding 
interactions. Thus the incubation mixture can be subjected to any agent or condition which 
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destabilizes protein/DNA interactions such that the specific binding reaction is the predominant one 
detected. Alternatively, as taught more specifically below, a non-specific competitor, such as dl-dC, 
can be added to the incubation mixture. If the DNA containing the specific binding sites is labelled 
and the competitor is unlabeled, then the specific binding reactions will be the ones predominantly 
detected upon measuring labelled DNA. 

According to another embodiment of the invention, after incubation of THAP-family 
protein or a portion thereof with specific DNA fragments all components of the cell lysate which do 
not bind to the DNA fragments are removed. This can be accomplished, among other ways, by 
employing DNA fragments which are attached to an insoluble polymeric support such as agarose, 
cellulose and the like. After binding, all non-binding components can be washed away, leaving 
THAP-family protein or a portion thereof bound to the DNA/solid support. The THAP-family 
protein or a portion thereof can be quantitated by any means known in the art. It can be determined 
using an immunological assay, such as an ELISA, RIA or Western blotting. 

In another embodiment of the invention a method is provided for identifying compounds 
which specifically bind to THAP-family-specific-DNA sequences, comprising the steps of: 
contacting a THAP-family-specific DNA fragment immobilized on a solid support with both a test 
compound and wild-type THAP-family protein or a portion thereof to bind the wild-type THAP- 
family protein or a portion thereof to the DNA fragment; determining the amount of wild-type 
THAP-family protein which is bound to the DNA fragment, inhibition of binding of wild-type 
THAP-family protein by the test compound with respect to a control lacking the test compound 
suggesting binding of the test compound to the THAP-family-specific DNA binding sequences. 

It is still another object of the invention to provide methods for identifying compounds 
which restore the ability of mutant THAP-family proteins or portions thereof to bind to specific 
DNA binding sequences. In one embodiment a method of screening agents for use in therapy is 
provided comprising: measuring the amount of binding of a THAP-family protein or a portion 
thereof which is encoded by a mutant gene found in cells of a patient to a DNA molecule which 
comprises more than one monomer of a specific THAP-family target nucleotide sequence; 
measuring the amount of binding of said THAP-family protein to said nucleic acid molecule in the 
presence of a test substance; and comparing the amount of binding of the THAP-family protein in 
the presence of said test substance to the amount of binding of the THAP-family protein or a 
portion thereof in the absence of said test substance, a test substance which increases the amount of 
binding being a candidate for use in therapy. 

In another embodiment of the invention a method is provided for screening agents for use 
m therapy comprising: contacting a transfected cell with a test substance, said transfected cell 
containing a THAP-family protein or a portion thereof which is encoded by a mutant gene found in 
cells of a pat.ent and a reporter gene construct comprising a reporter gene which encodes an 
assayable product and a sequence which conforms to a THAP-family DNA binding site wherein 
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said sequence is upstream from and adjacent to said reporter gene; and determining whether the 
amount of expression of said reporter gene is altered by the test substance, a test substance which 
alters the amount of expression of said reporter gene being a candidate for use in therapy. 

In still another embodiment a method of screening agents for use in therapy is provided 
5 comprising: adding KNA polymerase ribonucleotides and a THAP-family protein or a portion 
thereof to a transcription construct, said transcription construct comprising a reporter gene which 
encodes an assayable product and a sequence which conforms to a THAP-family consensus binding 
site, said sequence being upstream from and adjacent to said reporter gene, said step of adding 
being effected in the presence and absence of a test substance; determining whether the amount of 
10 transcription of sa.d reporter gene is altered by the presence of said test substance, a test substance 
which alters the amount of transcription of said reporter gene being a candidate for use in therapy. 

According to the present invention compounds which have THAP-family activity are those 
which specifically complex with a THAP-family-specific DNA binding site. Oligonucleotides and 
oligonucleotide containing nucleotide analogs are also contemplated among those compounds 
1 5 which are able to complex with a THAP-family-specific DNA binding site. 

Further assays to modulate THAP-family polypeptide activity in vivo 
It will be appreciated that any suitable assay that allows detection of THAP-family 
polypeptide or THAP domain activity can be used. Examples of assays for testing protein 
interaction, nucleic acid binding or modulation of apoptosis in the presence or absence of a test 
20 compound are further described herein. Thus, the invention encompasses a method of identifying a 
candidate. THAP-family polypeptide modulator (e.g. activator or inhibitor), said method 
comprising: 

a) providing a cell comprising a THAP family or THAP domain polypeptide, or a 
biologically active fragment or homolog thereof; 
25 b) contacting said cell with a test compound; and 

c) determining whether said compound selectively modulates (e.g. activates or inhibits) 
THAP-family polypeptide activity, preferably pro-apoptotic activity, or THAP family or THAP 
domain target binding; wherein a determination that said compound selectively modulates (e.g. 
activates or inhibits) the activity of said polypeptide indicates that said compound is a candidate 
30 modulator (e.g. activator or inhibitor respectively) of said polypeptide. Preferably, the THAP 
family or THAP domain target is a protein or nucleic acid. 

Preferably the cell is a cell which has been transfected with an recombinant expression 
vector encoding a THAP family or THAP domain polypeptide, or a biologically active fragment or 
homologue thereof. 

35 Several examples of assays for the detection of apoptosis are described herein, in the 

section titled "Apoptosis assays". Several examples of assays for the detection of THAP family or 
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THAP domain target interactions are described herein, including assays for detection of protein 

interactions and nucleic acid binding. 

In one example of an assay for apoptosis activity, a high throughput screening assay for 

molecules that abrogate or stimulate THAP-family polypeptide proapoptotic activity is provided 

5 based on serum-withdrawal induced apoptosis in a 3T3 cell line with tetracycline-regulated 

expression of a THAP family or THAP domain polypeptide, or a biologically active fragment or 

homologue thereof. Apoptotic cells can be detected by TUNEL labeling in 96- or 384-wells 

microplates. A drug screening assay can be carried out along the lines as described in Example 23. 

3T3 cells, which have previously been used to analyze the pro-apoptotic activity of PAR4 (Diaz- 

10 Meco et al, 1996; Berra et al., 1997), can be transfected with expression vectors encoding a THAP- 
family or THAP domain polypeptide allowing the ectopic expression of THAP-family polypeptide. 
Then, the apoptotic response to serum withdrawal is assayed in the presence of a test compound, 
allowing the identification of test compounds that either enhance or inhibit the ability of THAP- 
family or THAP domain polypeptide to induce apoptosis. Transfected cells are deprived of serum 

15 and cells with apoptotic nuclei are counted. Apoptotic nuclei can be counted by DAPI staining and 
in situ TUNEL assays. 

Further THAP-family poIypeptide/THAP-target interaction assays 

In exemplary methods THAP/THAP target interaction assays are described in the context of 
THAP1 and the THAP target Par4. However, it will be appreciated that assays for screening for 

20 modulators of other THAP family members or THAP domains and other THAP target molecules 
may be carried out by substituting these for THAP1 and Par4 in the methods below. For example, 
in some embodiments, modulators which affect the interaction between a THAP-family polypeptide 
and SLC are identified. It will be appreciated, however, that the same assays can be used to 
determine the interaction between any THAP-target polypeptide (for example, a chemokine) and a 

25 THAP-family polypeptide, which comprises an interaction domain for the chemokine. THAP- 
family polypeptides that can be used in these assays include the polypeptides of SEQ ID NOs: 1- 
114, biologically active fragments thereof, THAP-family polypeptide oligomers, oligomers 
comprising a THAP-family chemokine-binding domain, THAP-family polypeptide- 
immunoglobulin fusions, THAP-family chemokine-binding domain-immunoglobulin fusions and 

30 polypeptide homologs having at least 30% amino acid identity to any one of the aforementioned 
polypeptides. 

As demonstrated in Examples 4, 5, 6, and 7 and Figures 3, 4 and 5, the inventors have 
demonstrated using several experimental methods that THAP1 interacts with the pro-apoptotic 
protein Par4. In particular, it has been shown that THAP1 interacts with Par4 wild type (Par4) and 
35 a Par4 death domain (Par4DD) in a yeast two-hybrid system. Yeast cells were cotransformed with 
BD7-THAP1 and AD7-Par4, AD7, AD7-Par4DD or AD7-Par4) expression vectors. Transformants 
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were selected on media lacking histidine and adenine. Identical results were obtained by 
^transformation of AD7-THAP1 with BD7-Par4, BD7, BD7-Par4DD or BD7-Par4). 

The inventors have also demonstrated in vitro binding of THAP1 to GST-Par4DD. 
Par4DD was expressed as a GST fusion protein, purified on glutathione sepharose and employed as 
5 an affinity matrix for binding of in vitro translated 35 S-methionine labeled THAP1 . GST served as 
negative control. 

Furthermore, the inventors have shown that THAP1 interacts with both Par4DD and SLC in 
vivo. Myc-Par4DD and GFP-THAP1 expression vectors were cotransfected in primary human 
endothelial cells. Myc-Par4DD was stained with monoclonal anti-myc antibody. Green 
1 0 fluorescence, GFP-THAP1 ; red fluorescence, Par4DD. 

The invention thus encompasses assays for the identification of molecules that modulate 
(stimulate or inhibit) THAP-family polypeptide/PAR4 binding. In preferred embodiments, the 
invention includes assays for the identification of molecules that modulate (stimulate or inhibit) 
THAP1 /PAR4 binding or TH API /SLC binding. 
1 5 Four examples of high throughput screening assays include: 

1) a two hybrid-based assay in yeast to find drugs that disrupt interaction of the THAP- 
family bait with the PAR4 or SLC as prey 

2) an in vitro interaction assay using recombinant THAP-family polypeptide and PAR4 or 

SLC proteins 

20 3) a chip-based binding assay using recombinant THAP-family polypeptide and PAR4 or 

SLC protejns 

2) a fluorescence resonance energy transfer (FRET) cell-based assay using THAP-family 
polypeptide and PAR4 or SLC proteins fused with fluorescent proteins 

The invention thus encompasses a method of identifying a candidate THAP-family 
25 polypeptide/PAR4 or SLC interaction modulator, said method comprising: 

a) providing a THAP family or THAP domain polypeptide, or a biologically active 
fragment or homologue thereof and a PAR4 or SLC polypeptide or fragment thereof; 

b) contacting said THAP family or THAP domain polypeptide with a test compound; and 

c) determining whether said compound selectively modulates (e.g. activates or inhibits) 
30 THAP-family/PAR4 or SLC interaction activity. 

Also envisioned is a method comprising: 

a) providing a cell comprising a THAP family or THAP domain polypeptide, or a 
biologically active fragment or homologue thereof and a PAR4 or SLC polypeptide or fragment 
thereof; 

35 b) contacting said cell with a test compound; and 

c) determining whether said compound selectively modulates (e.g. activates or inhibits) 
THAP-family/PAR4 or SLC interaction activity. 
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In general, any suitable assay for the detection of protein-protein interaction may be used. 
In one example, a THAP family or THAP domain polypeptide, or a biologically active 
fragment or homologue thereof can be used as a "bait protein" and a PAR4 or SLC protein can be 
used as a "prey protein" (or vice-versa) in a two-hybrid assay (see, e.g., U.S. Pat. No. 5,283,317; 
5 Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel 
et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; and Brent 
WO94/10300). The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two 
different DNA constructs. In one construct, the gene that codes for a THAP family or THAP 
10 domain polypeptide, or a biologically active fragment or homologue thereof -is fused to a gene 
c encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the other 
construct, the gene that codes for a THAP family or THAP domain polypeptide, or a biologically 
active fragment or homologue thereof ("prey" or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If the "bait" and the "prey" proteins are able to 
15 interact, in vivo, forming a THAP-family polypeptide/PAR4 complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This proximity 
allows transcription of a reporter gene (e.g., LacZ) which is operably linked to a transcriptional 
regulatory site responsive to the transcription factor. Expression of the reporter gene can be 
detected and cell colonies containing the functional transcription factor can be isolated and used to 
20 obtain the cloned gene which encodes the protein which interacts with the THAP-family protein. 
This assay ^can thus be carried out in the presence or absence of a test compound, whereby 
modulation of THAP-family polypeptide/PAR4 or SLC interaction can be detected by lower or lack 
of transcription of the reported gene. 

In other examples, in vitro THAP-family polypeptide/PAR4 or SLC interaction assays can 
25 be carried out, several examples of which are further described herein. For example, a recombinant 
THAP family or THAP domain polypeptide, or a biologically active fragment or homologue thereof 
is contacted with a recombinant PAR4 or SLC protein or biologically active portion thereof, and the 
ability of the PAR4 or SLC protein to bind to the THAP-family protein is determined. Binding of 
the PAR4 or SLC protein compound to the THAP-family protein can be determined either directly 
30 or indirectly as described herein. In a preferred embodiment, the assay includes contacting the 
THAP family or THAP domain polypeptide, or a biologically active fragment or homologue thereof 
with a PAR4 or SLC protein which binds a THAP-family protein (e.g., a THAP-family target 
molecule) to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with a THAP-family protein, wherein 
35 determining the ability of the test compound to interact with a THAP-family protein comprises 
determining the ability of the test compound to preferentially bind to THAP-family or biologically 
active portion thereof as compared to the PAR4 or SLC protein. For example, the step of 
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determining the ability of the test compound to interact with a THAP-family protein may comprise 
determining the ability of the compound to displace Par4 or SLC from a THAP-family protein/Par4 
or SLC complex thereby forming a THAP-family protein/compound complex. Alternatively, it will 
be appreciated that it is also possible to determine the ability of the test compound to interact with a 
5 PAR4 or SLC protein, wherein determining the ability of the test compound to interact with a 
PAR4 or SLC protein comprises determining the ability of the test compound to preferentially bind 
to PAR4 or SLC or biologically active portion thereof as compared to the THAP-family protein. 
For example, the step of determining the ability of the test compound to interact with a THAP- 
family protein may comprise determining the ability of the compound to displace Par4 or SLC from 
10 a THAP-family protein/Par4 or SLC complex thereby forming a THAP-family protein/compound 
complex. 

Assays to modulate THAP-family polypeptide and/or Par4 trafficking in the PML nuclear bodies 
(PML NBs) 

As demonstrated in Examples 8 and 9, the inventors have demonstrated using several 

1 5 experimental methods that THAP1 and Par4 localize in PML NBs. 

The inventors demonstrated that THAP1 is a novel protein associated with PML-nuclear 
bodies. Double immunofluorescence staining showed colocalization of THAP1 with PML-NBs 
proteins, PML and Daxx. Primary human endothelial cells were transfected with GFP-THAP1 
expression vector ; endogenous PML and Daxx were stained with monoclonal anti-PML and 

20 polyclonal anti-Daxx antibodies, respectively. 

Tfce inventors also demonstrated that Par4 is a novel component of PML-NBs that 
colocalizes with THAP1 in vivo by several experiments. In one experiments, double 
immunofluorescence staining revealed colocalization of Par4 and PML at PML-NBs in primary 
human endothelial cells or fibroblasts. Endogenous PAR4 and PML were stained with polyclonal 

25 anti-PAR4 and monoclonal anti-PML antibodies, respectively. In another experiment, double 
staining revealed colocalization of Par4 and THAP1 in cells expressing ectopic GFP-THAP1. 
Primary human endothelial cells or fibroblasts were transfected with GFP-TH AP 1 expression 
vector ; endogenous Par4 was stained with polyclonal anti-PAR4 antibodies. 

The inventors further demonstrated that PML recruits the THAP1/Par4 complex to PML- 

30 NBs. Triple immunofluorescence staining showed colocalization of THAP1, Par4 and PML in cells 

overexpressing PML and absence of colocalization in cells expressing ectopic SplOO. Hela cells 

were cotransfected with GFP-TH API and HA-PML or HA-SP100 expression vectors; HA-PML or 

HA-SP100 and endogenous Par4 were stained with monoclonal anti-HA and polyclonal anti-Par4 

antibodies, respectively. 

35 Assays to modulate THAP family protein trafficking in the PML nuclear bodies 

Provided are assays for the identification of drugs that modulate (stimulate or inhibit) 

THAP-family or THAP domain protein, particularly THAP1, binding to PML-NB proteins or 
localization to PML-NBs. In general, any suitable assay for the detection of protein-protein 
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interaction may be used. Two examples of high throughput screening assays include 1) a two 
hybrid-based assay in yeast to find compounds that disrupt interaction of the THAP1 bait with the 
PML-NB protein prey; and 2) in vitro interaction assays using recombinant THAP1 and PML-NB 
proteins. Such assays may be conducted as described above with respect to THAP-family/Par4 
5 assays except that the PML-NB protein is used in place of Par4. Binding may be detected, for 
example, between a THAP-family protein and a PML protein or PML associated protein such as 
daxx, splOO, spl40, p53, pRB, CBP, BLM or SUMO-L 

Other assays for which standard methods are well known include assays to identify 
molecules that modulate, generally inhibit, the colocalization of THAP1 with PML-NBs. 

10 Detection can be carried out using a suitable label, such as an anti-THAPl antibody, and an 
antibody allowing the detection of PML-NB protein. 
Assays to modulate PAR4 trafficking in the PML bodies 

Provided are assays for the identification of drugs that modulate (stimulate or inhibit) PAR4 
binding to PML-NB proteins or localization to PML-NBs. In general, any suitable assay for the 

15 detection of protein-protein interaction may be used. Two examples of high throughput screening 
assays include 1) a two hybrid-based assay in yeast to find compounds that disrupt interaction of the 
PAR4 bait with the PML-NB protein prey; and 2) in vitro interaction assays using recombinant 
PAR4 and PML-NB proteins. Such assays may be conducted as described above with respect to 
THAP-family polypeptide/Par4 assays except that the PML-NB protein is used in place of the 

20 THAP-family polypeptide. Binding may be detected, for example, between a Par4 protein and a 
PML protein or PML associated protein such as daxx, splOO, sp!40, p53, pRB, CBP, BLM or 
SUMO-1. 

Other assays for which standard methods are well known include assays to identify 
molecules that modulate, generally inhibit, the colocalization of PAR4 with PML-NBs. Detection 

25 can be carried out using a suitable label, such as an anti-PAR4 antibody, and an antibody allowing 
the detection of PML-NB protein. 

This invention further pertains to novel agents identified by the above-described screening 
assays and to processes for producing such agents by use of these assays. Accordingly, in one 
embodiment, the present invention includes a compound or agent obtainable by a method 

30 comprising the steps of any one of the aforementioned screening assays (e.g., cell-based assays or 
cell-free assays). For example, in one embodiment, the invention includes a compound or agent 
obtainable by a method comprising contacting a cell which expresses a THAP-family target 
molecule with a test compound and determining the ability of the test compound to bind to, or 
modulate the activity of, the THAP-family target molecule. In another embodiment, the invention 

35 includes a compound or agent obtainable by a method comprising contacting a cell which expresses 
a THAP-family target molecule with a THAP-family protein or biologically-active portion thereof, 
to form an assay mixture, contacting the assay mixture with a test compound, and determining the 
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ability of the test compound to interact with, or modulate the activity of, the THAP-family target 

molecule. In another embodiment, the invention includes a compound or agent obtainable by a 
method comprising contacting a THAP-family protein or biologically active portion thereof with a 
test compound and determining the ability of the test compound to bind to, or modulate (e.g., 
5 stimulate or inhibit) the activity of, the THAP-family protein or biologically active portion thereof. 
In yet another embodiment, the present invention includes a compound or agent obtainable by a 
method comprising contacting a THAP-family protein or biologically active portion thereof with a 
known compound which binds the THAP-family protein to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to interact 
1 0 with, or modulate the activity of the THAP-family protein. 

Accordingly, it is within the scope of this invention to further use an agent identified as 
described herein in an appropriate animal model. For example, an agent identified as described 
herein (e.g., a THAP-family or THAP domain modulating agent, an antisense THAP-family or 
THAP domain nucleic acid molecule, a THAP-family- or THAP domain- specific antibody, or a 
15 THAP-family- or THAP domain- binding partner) can be used in an animal model to determine the 
efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, an agent identified 
as described herein can be used in an animal model to determine the mechanism of action of such 
an agent. Furthermore, this invention pertains to uses of novel agents identified by the above- 
described screening assays for treatments as described herein. 
20 The present invention also pertains to uses of novel agents identified by the above- 

described screening assays for diagnoses, prognoses, and treatments as described herein. 
Accordingly, it is within the scope of the present invention to use such agents in the design, 
formulation, synthesis, manufacture, and/or production of a drug or pharmaceutical composition for 
use in diagnosis, prognosis, or treatment, as described herein. For example, in one embodiment, the 
25 present invention includes a method of synthesizing or producing a drug or pharmaceutical 
composition by reference to the structure and/or properties of a compound obtainable by one of the 
above-described screening assays. For example, a drug or pharmaceutical composition can be 
synthesized based on the structure and/or properties of a compound obtained by a method in which 
a cell which expresses a THAP-family target molecule is contacted with a test compound and the 
30 ability of the test compound to bind to, or modulate the activity of, the THAP-family target 
molecule is determined. In another exemplary embodiment, the present invention includes a method 
of synthesizing or producing a drug or pharmaceutical composition based on the structure and/or 
properties of a compound obtainable by a method in which a THAP-family protein or biologically 
active portion thereof is contacted with a test compound and the ability of the test compound to 
35 bind to, or modulate (e.g., stimulate or inhibit) the activity of, the THAP-family protein or 
biologically active portion thereof is determined. 
Apoptosis assays 
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It will be appreciated that any suitable apoptosis assay may be used to assess the apoptotic 
activity of a THAP family or THAP domain polypeptide, or a biologically active fragment or 
homologue thereof. 

Apoptosis can be recognized by a characteristic pattern of morphological, biochemical, and 
5 molecular changes. Cells going through apoptosis appear shrunken, and rounded; they also can be 
observed to become detached from culture dish. The morphological changes involve a characteristic 
pattern of condensation of chromatin and cytoplasm which can be readily identified by microscopy. 
When stained with a DNA-binding dye, e.g., H33258, apoptotic cells display classic condensed and 
punctate nuclei instead of homogeneous and round nuclei. 
10 A hallmark of apoptosis is endonucleolysis, a molecular change in which nuclear DNA is 

initially degraded at the linker sections of nucleosomes to give rise to fragments equivalent to single 
and multiple nucleosomes. When these DNA fragments are subjected to gel electrophoresis, they 
reveal a series of DNA bands which are positioned approximately equally distant from each other 
on the gel. The size difference between the two bands next to each other is about the length of one 
15 nucleosome, i.e., 120 base pairs. This characteristic display of the DNA bands is called a DNA 
ladder and it indicates apoptosis of the cell. Apoptotic cells can be identified by flow cytometric 
methods based on measurement of cellular DNA content, increased sensitivity of DNA to 
denaturation, or altered light scattering properties. These methods are well known in the art and are 
within the contemplation of the invention. 
20 Abnormal DNA breaks which are characteristic of apoptosis can be detected by any means 

known in tbe art. In one preferred embodiment, DNA breaks are labeled with biotinylated dUTP (b- 
dUTP). As described in U.S. Patent No. 5,897,999, cells are fixed and incubated in the presence of 
biotinylated dUTP with either exogenous terminal transferase (terminal DNA transferase assay; 
TdT assay) or DNA polymerase (nick translation assay; NT assay). The biotinylated dUTP is 
25 incorporated into the chromosome at the places where abnormal DNA breaks are repaired, and are 
detected with fluorescein conjugated to avidin under fluorescence microscopy. 
Assessing THAP-family, THAPdomain and PAR4 polypeptides activity 

For assessing the nucleic acids and polypeptides of the invention, the apoptosis indicator 
which is assessed in the screening method of the invention may be substantially any indicator of the 
30 viability of the cell. By way of example, the viability indicator may be selected from the group 
consisting of cell number, cell refractility, cell fragility, cell size, number of cellular vacuoles, a 
stain which distinguishes live cells from dead cells, methylene blue staining, bud size, bud location, 
nuclear morphology, and nuclear staining. Other viability indicators and combinations of the 
viability indicators described herein are known in the art and may be used in the screening method 
35 of the invention. 

Cell death status can be evaluated based on DNA integrity. Assays for this determination 
include assaying DNA on an agarose gel to identify DNA breaking into oligonucleosome ladders 
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and immunohistochemically detecting the nicked ends of DNA by labeling the free DNA end with 
fluorescein or horseradish peroxidase-conjugated UTP via terminal transferase. Routinely, one can 
also examine nuclear morphology by propidium iodide (PI) staining. All three assays (DNA ladder, 
end-labeling, and PI labelling) are gross measurements and good for those cells that are already 
5 dead or at the end stage of dying. 

In a preferred example, an apoptosis assay is based on serum-withdrawal induced apoptosis 
in a 3T3 cell line with tetracycline-regulated expression of a THAP family or THAP domain 
polypeptide, or a biologically active fragment or homologue thereof. Detection of apoptotic cells is 
accomplished by TUNEL labeling cells in 96- or 384-well microplates. This example is further 

10 described in Example 23. 

In other aspects, assays may test for the generation of cytotoxic death signals, anti-viral 
responses (Tartaglia et ah, (1993) Cell 74(5):845-531), and/or the activation of acid 
sphingomyelinase (Wiegmann et al., (1994) Cell 78(6): 1005-1 5) when the THAP-family protein is 
overexpressed or ectopically expressed in cells. Assaying for modulation of apoptosis can also be 

15 carried out in neuronal cells and lymphocytes for example, where factor withdrawal is known to 
induce cell suicide as demonstrated with neuronal cells requiring nerve growth factor to survive 
(Martin, D. P. et al, (1988) J. Cell Biol 106, 829-844) and lymphocytes depending on a specific 
lymphokine to live (Kyprianou, N. and Isaacs, J. T. (1988) Endrocrinology 122:552-562). 
THAP-family or THAP domain polypeptide -marker fusions in cell assays 

20 In one method, an expression vector encoding the a THAP family or THAP domain 

polypeptide, or a biologically active fragment or homologue thereof can be used to evaluate the 
ability of the polypeptides of the invention to induce apoptosis in cells. If desired, a THAP-family 
or THAP domain polypeptide may be fused to a detectable marker in order to facilitate 
identification of those cells expressing the a THAP family or THAP domain polypeptide, or a 

25 biologically active fragment or homologue thereof. For example, a variant of the Aequoria victoria 
GFP variant, enhanced green fluorescent protein (EGFP), can be used in fusion protein production 
(CLONTECH Laboratories, Inc., 1020 East Meadow Circle, Palo Alto, Calif. 94303), further 
described in U.S. Patent No. 6,191,269. 

The THAP-family- or THAP domain polypeptide cDNA sequence is fused in-frame by 

30 insertion of the THAP-family- or THAP domain polypeptide encoding cDNA into the Sall-BamHI 
site of plasmid pEGFP-NI (GenBank Accession # U55762). Cells are transiently transfected by the 
method optimal for the cell being tested (either CaPO 4 or Lipofectin). Expression of a THAP- 
family or THAP domain polypeptide and induction of apoptosis is examined using a fluorescence 
microscope at 24 hrs and 48 hrs post-transfection. Apoptosis can be evaluated by the TUNEL 

35 method (which involves 3* end-labeling of cleaved nuclear and/or morphological criteria DNA) 
(Cohen et al. (1984) J. Immunol. 132:38-42). Where the screen uses a fusion polypeptide 
comprising a THAP-family or THAP domain polypeptide and a reporter polypeptide (e.g., EGFP), 
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apoptosis can be evaluated by detection of nuclear localization of the reporter polypeptide in 
fragmented nuclear bodies or apoptotic bodies. For example, where a THAP-family or THAP 
domain polypeptide- EGFP fusion polypeptide is used, distribution of THAP-family or THAP 
domain polypeptide EGFP-associated fluorescence in apoptotic cells would be identical to the 
5 d,stribution of DAPI or Hoechst 33342 dyes, which are conventionally used to detect the nuclear 
DNA changes associated with apoptosis (Cohen et a!., supra). A m.nimum of approximately 100 
cells, which display characteristic EGFP fluorescence, are evaluated by fluorescence m.croscopy. 
Apoptosis is scored as nuclear fragmentation, marked apoptotic bodies, and cytoplasmic boiling. 
The characteristics of nuclear fragmentation are particularly visible when THAP-family or THAP 
1 0 domain polypeptide-EGFP condenses in apoptotic bodies. 

The ability of the THAP-family- or THAP domain polypeptides to undergo nuclear 
localization and to induce apoptosis can be tested by transient expression in 293 human kidney 
cells. If proved susceptible to THAP-fanuly- or THAP domain- induced apoptosis, 293 cells can 
serve as a convenient initial screen for those THAP family or THAP domain polypeptides, or 
biologically active fragments or homologues thereof that will likely also induce apoptosis in other 
(e.g. endothelial cells or cancer cells). In an exemplary protocol, 293 cells are transfected w.th 
plasmid vectors expressing THAP-family- or THAP domain- EGFP fusion protein. Approximately 
5* 106 293 cells in 100 mm dishes were transfected with 10 g of plasmid DNA using the calcium- 
phosphate method. The plasmids used are comprise CMV enhancer/promoter and THAP-family- 
or THAP domain- EGFP coding sequence). Apoptosis is evaluated 24 hrs after transfection by 
TUNEL and DAPI staining. The THAP-family- or THAP domain- EGFP vector transfected cells 
are evaluated by fluorescence microscopy with observation of typ.cal nuclear aggregation of the 
EGFP marker as an indication of apoptosis. If apoptotic, the distribution of EGFP signal ,n cells 
expressmg THAP-family- or THAP domain-EGFP will be identical to the distribution of DAPI or 
Hoechst 33342 dyes, which are conventionally used to detect the nuclear DNA changes associated 
with apoptosis (Cohen et ah, supra). 

The ability of the THAP family or THAP domain polypeptides, or bio.ogically active 
fragments or homologues thereof to induce apoptosis can also be tested by expression assays in 
human cancer cells, for example as avai.able from NCI. Vector type (for examp.e plasm* or 
retrovira. or sindbis viral) can be selected based on efficiency in a given cell type. After the period 
indicated, cel.s are evaluated for morphological signs of apoptosis, including aggregation of THAP- 
fam,ly- or THAP domain- EGFP into nuclear apoptotic bodies. Cells are counted under a 
fluorescence microscope and scored as to the presence or absence of apoptotic signs, or cel.s are 
scored by fluorescent TUNEL assay and counted in a flow cytometer. Apoptosis is expressed as a 
percent of cells displaying typical advanced changes of apoptosis. 

Cells from the NCI panel of tumor cells include from example: 
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-colon cancer, expression using a retroviral expression vector, with evaluation of apoptosis 
at 96 hrs post- infection (cell lines KM 12; HT-29; SW-620; COLO205; HCT-5; HCC 2998; HCT- 
116); 

-CNS tumors, expression using a retroviral expression vector, with evaluation of apoptosis 
5 at 96 hrs post-infection (cell lines SF-268, astrocytoma; SF-539, glioblastoma; SNB-19, 
gliblastoma; SNB-75, astrocytoma; and U251, glioblastoma; 

-leukemia cells, expression using a retroviral expression vector, with evaluation of 
apoptosis at 96 hrs post-infection (cell lines CCRF-CEM, acute lymphocytic leukemia (ALL); 
K562, acute myelogenous leukemia (AML); MOLT-4, ALL; SR, immunoblastoma large cell; and 
1 0 RPMI 8226, Myeloblastoma); 

-prostate cancer, expression using a retroviral expression vector, with evaluation of 
apoptosis at 96 hrs post-infection (PC-3); 

-kidney cancer, expression using a retroviral expression vector, with evaluation of apoptosis 
at 96 hrs post-infection (cell lines 768-0; UO-31; TK10; ACHN); 
15 -skin cancer, expression using a retroviral expression vector, with evaluation of apoptosis at 

96 hrs post-infection (Melanoma) (cell lines SKMEL-28; Ml 4; SKMEL-5; MALME-3); 

-lung cancer, expression using a retroviral expression vector, with evaluation of apoptosis at 
96 hrs post-infection (cell lines HOP-92; NCI-H460; HOP-62; NCI-H522; NCI-H23; A549; NCI- 
H226; EKVX; NCI-H322); 
20 -breast cancer, expression using a retroviral expression vector, with evaluation of apoptosis 

at 96 hrs post-infection (cell lines MCF-7; T-47D; MCF-7/ADR; MDAMB43; MDAMB23; MDA- 
N; BT-549); 

-ovary cancer, expression using either a retroviral expression vector and protocol or the 
Sindbis viral expression vector and protocol, with evaluation of apoptosis at 96 hrs post-infection 

25 with retrovirus or at 24 hrs post-infection with Sindbis viral vectors (cell lines OVCAR-8; OVCAR- 
4; IGROV-1; OVCAR-5; OVCAR3; SK-OV-3). 

In a further representative example, the susceptibility of malignant melanoma cells to 
apoptosis induced by a THAP family or THAP domain polypeptide, or a biologically active 
fragment or homologue thereof can be tested in several known melanoma cell types: human 

30 melanoma WM 266-4 (ATCC CRL-1676); human malignant melanoma A-375 (ATCC CRL-1619); 
human malignant, melanoma A2058 (ATCC CRL-11147); human malignant melanoma SK-MEL- 
31 (ATCC HTB-73); human malignant melanoma RPMI-7591 ATCC HTB-66 (metastasis to 
lymph node). Primary melanoma isolates can also be tested. In addition, human chronic 
myelogenous leukemia K-562 cells (ATCC CCL-243), and 293 human kidney cells (ATCC CRL- 

35 1573) (transformed primary embryonal cell) are tested. Normal human primary derma) fibroblasts 
and Rat-1 fibroblasts serve as controls. All melanoma cell lines are metastatic on the basis of their 
isolation from metastases or metastatic nodules. A transient expression strategy is used in order to 
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cells) are plated in 24 well dishes at 3.5*10 4 cells/well. The following day, the cells are transfected 
with a marker plasmid encoding beta-galactosidase, in combination with an expression plasmid 
encoding THAP-family or THAP domain polypeptide, by the Lipofectamine procedure 
(Gibco/BRL). At 24 hours post transfection, cells are fixed and stained with X-Gal to detect beta- 
5 galactosidase expression in cells that received plasmid DNA (Miura et al., supra). The number of 
blue cells is counted by microscopic examination and scored as either live (flat blue cells) or dead 
(round blue cells). The cell killing activity of the THAP-family or THAP domain polypeptide in 
this assay is manifested by a large reduction in the number of blue cells obtained relative to co- 
transfection of the beta-gal plasmid with a control expression vector (i.e., with no THAP-family or 

10 THAP domain polypeptide cDNA insert). 

In yet another example, beta-galactosidase co-transfection assays can be used for 
determination of cell death. The assay is performed as described (Hsu, H. et al, (1995). Cell 81,495- 
504; Hsu, H. et al, (1996a). Cell 84, 299-308; and Hsu, H. et al, (1996b) Inmunity 4, 387-396 and 
U.S. Patent No. 6,242,569). Transfected cells are stained with X-gal as described in Shu, H. B. et al, 

15 ((1995) J. Cell Sci. 108, 2955-2962). The number of blue cells from 8 viewing fields of a 35 mm 
dish is determined by counting. The average number from one representative experiment is shown. 

Assays for apoptosis can also be carried out by making use of any suitable biological 
marker of apoptosis. Several methods are described as follows. 

In one aspect, fluorocytometric studies of cell death status can be carried out. Technology 

20 used in fluorocytometric studies employs the identification of cells at three different phases of the 
cell cycles Gj, S. and G2- This is largely performed by DNA quantity staining by propidium iodide 
labeling. Since the dying cell population contains the same DNA quantity as the living counterparts 
at any of the three phases of the cell cycle, there is no way to distinguish the two cell populations. 
One can perform double labeling for a biological marker of apoptosis (e.g. terminin Tp30, U.S. 

25 Patent No. 5,783,667) positivity and propidium iodide (PI) staining together. Measurement of the 
labeling indices for the biological marker of apoptosis and PI staining can be used in combination to 
obtain the exact fractions of those cells in G\ that are living and dying. Similar estimations can be 

made for the S-phase and G2 phase cell populations. 

In this assay, the cells are processed for formaldehyde fixation and extraction with 0.05% 

30 Triton. Afterwards, the cell specimens are incubated with monoclonal antibody to a marker of 
apoptosis overnight at room temperature or at 37C for one hour. This is followed by further 
incubation with fluoresceinated goat antimouse antibody, and subsequent incubation by propidium 
iodide staining. The completely processed cell specimens are then evaluated by fluorocytometric 
measurement on both fluorescence (marker of apoptosis) and rhodamine (PI) labeling intensity on a 

35 per cell basis, with the same cell population simultaneously. 

In another aspect, it is possible to assess the inhibitory effect on cell growth by therapeutic 
induction of apoptosis. One routine method to determine whether a particular chemotherapeutic 
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drug can inhibit cancerous cell growth is to examine cell population size either in culture, by 
measuring the reduction in cell colony size or number, or measuring soft agar colony growth or in 
vivo tumor formation in nude mice, which procedures require time for development of the colonies 
or tumor to be large enough to be detectable. Experiments involved in these approaches in general 
5 require large-scale planning and multiple repeats of lengthy experimental span (at least three 
weeks). Often these assays do not take into account the fact that a drug may not be inhibiting cell 
growth, but rather killing the cells, a more favorable consequence needed for chemotherapeutic 
treatment of cancer. Thus, assays for the assessment of apoptosis activity can involve using a 
biological or biochemical marker specific for quiescent, non-cycling or non-proliferating cells. For 

10 example, a monoclonal antibody can be used to assess the non-proliferating population of cells in a 
given tissue which indirectly gives a measure of the proliferating component of a tumor or cell 
mass. This detection can be combined with a biological or biochemical marker (e.g. antibodies) to 
detect the dying cell population pool, providing a powerful and rapid assessment of the 
effectiveness of any given drugs in the containment of cancerous cell growth. Applications can be 

15 easily performed at the immunofluorescence microscopic level with cultured cells or tissue 
sections. 

In other aspects, a biological or biochemical marker can be used to assess pharmacological 
intervention on inhibition of cell death frequency in degenerative diseases. For degenerative 
diseases such as Alzheimer's or Parkinson's disease, these losses may be due to the premature 

20 activation of the cell death program in neurons. In osteoporosis, the cell loss may be due to an 
improper balance between osteoblast and osteoclast cells, due to the too active programmed cell 
death process killing more cells than the bone tissue can afford. Other related phenomena may also 
occur in the wound healing process, tissue transplantation and cell growth in the glomerus during 
kidney infection, where the balance between living and dying cell populations is an essential issue 

25 to the health status of the tissue, and are further described in the section titled "Methods of 
treatment". A rapid assessment of dying cell populations can be made through the 
immunohistochemical and biochemical measurements of a biological or biochemical marker of 
apoptosis in degenerative tissues. In one example, a biological or biochemical marker can be used 
to assess cell death status in oligodendrocytes associated with Multiple Sclerosis. Positive staining 

30 of monoclonal antibody to a marker of apoptosis (such as Tp30, U.S. Patent No. 5,783,667) occurs 
in dying cultured human oligodendrocytes. The programmed cell death event is activated in these 
oligodendrocytes by total deprivation of serum, or by treatment with tumor necrosis factor (TNF). 

In general, a biological or biochemical marker can also be used to assess cell death status in 
pharmacological studies in animal models. Attempting to control either a reduced cell death rate, in 

35 the case of cancer, or an increased cell death rate, in the case of neurodegeneration, has been 
recently seen as a new mode of disease intervention. Numerous approaches via either intervention 
with known drugs or gene therapy are in progress, starting from the base of correcting the altered 
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programmed cell death process, with the concept on maintaining a balanced cell mass in any given 
tissue. For these therapeutic interventions, the bridge between studies in cultured cells and clinical 
trials is animal studies, i.e. success in intervention with animal models, in either routine laboratory 
animals or transgenic mice bearing either knock-out or overexpression phenotypes. Thus, a 
5 biological or biochemical marker of apoptosis, such as an antibody for an apoptosis-specific 
protein, is a useful tool for examining apoptotic death status in terms of change in dying cell 
numbers between normal and experimentally manipulated animals. In this context the invention, as 
a diagnostic tool for assessing cell death status, could help to determine the efficacy and potency of 
a drug or a gene therapeutic approach. 
10 As discussed, provided are methods for assessing the activity of THAP-family members 

and therapeutic treatment acting on THAP-family members or related biological pathways. 
However, in other aspects, the same methods may be used for assessment of apoptosis in general, 
when a THAP-family member is used as a biological marker of apoptosis. Thus, the invention also 
provides diagnostic and assay methods using a THAP-family member as a marker of cell death or 
15 apoptotic activity. Further diagnostic assays are also provided herein in the section 
titled 'Diagnostic and prognostic uses'. 
METHODS OF TREATMENT 

A large body of evidence gathered from experiments carried out with apoptosis modulating 
strategies suggests that treatments acting on apoptosis-inducing or cell proliferation-reducing 
20 proteins may offer new treatment methods for a wide range of disorders. Methods of treatment 
according to the invention may act in a variety of manners, given the novel function provided for a 
number of proteins, and the linking of several biological pathways. 

Provided herein are treatment methods based on the functionalization of the THAP-family 
members. THAP family or THAP domain polypeptides, and biologically active fragments and 
25 homologues thereof, as described further herein may be useful in modulation of apoptosis or cell 
proliferation. 

The methods of treatment involve acting on a molecule of the invention (that is, a THAP 
family member polypeptide, THAP-family target, or PAR4 or PAR4 target). Included are methods 
which involve modulating THAP-family polypeptide activity, THAP-family target activity, or 
30 PAR4 or PAR4 target activity. This modulation (increasing or decreasing) of activity can be 
carried out in a number of suitable ways, several of which have been described in the present 
application. 

For example, methods of treatment may involve modulating a "THAP-family activity", 
"biological activity of a THAP-family member" or "functional activity of a THAP-family member". 
35 Modulating THAP-family activity may involve modulating an association with a THAP-family- 
target molecule (for example, association of THAP 1 , THAP2 or THAP3 with Par4 or association of 
THAP1, THAP2 or THAP3 with a PML-NB protein) or preferably any other activity selected from 
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the group consisting of: (1) mediating apoptosis or cell proliferation when expressed or introduced 
into a cell, most preferably inducing or enhancing apoptosis, and/or most preferably reducing cell 
proliferation; (2) mediating apoptosis or cell proliferation of an endothelial cell; (3) mediating 
apoptosis or cell proliferation of a hyperproliferative cell; (4) mediating apoptosis or cell 
5 proliferation of a CNS cell, preferably a neuronal or glial cell; or (5) an activity determined in an 
animal selected from the group consisting of mediating, preferably inhibiting angiogenesis, 
mediating, preferably inhibiting inflammation, inhibition of metastatic potential of cancerous 
tissue, reduction of tumor burden, increase in sensitivity to chemotherapy or radiotherapy, killing a 
cancer cell, inhibition of the growth of a cancer cell, or induction of tumor regression. Detecting 
10 THAP-family activity may also comprise detecting any suitable therapeutic endpoint associated 
with a disease condition discussed herein. 

In another example, methods of treatment may involve modulating a "PAR4 activity", 
"biological activity of PAR4" or "functional activity of PAR4 ". Modulating PAR4 activity may 
involve modulating an association with a PAR4-target molecule (for example THAP1, THAP2, 
1 5 THAP3 or PML-NB protein) or most preferably PAR4 apoptosis inducing or enhancing (e.g. signal 
transducing) activity, or inhibition of cell proliferation or cell cycle. 

Methods of treatment may involve modulating the recruitment, binding or association of 
proteins to PML-NBs, or otherwise modulating PML-NBs activity. The present invention also 
provides methods for modulating PAR4 activity, comprising modulating PAR4 interactions with 
20 THAP-family proteins, and PAR4 and PML-NBs, as well as modulating THAP-family activity, 
comprising, modulating for example THAP1 interactions with PML-NBs. The invention 
encompasses inhibiting or increasing the recruitment of THAP1, or PAR4 to PML-NBs. 
Preventing the binding of either or both of THAP1 or PAR4 to PML-NBs may increase the 
bioavailability' or THAP1 and/or PAR4, thus providing a method of increasing THAP1 and/or 
25 PAR4 activity. The invention also encompasses inhibiting or increasing the binding of a THAP- 
family protein (such as THAP1) or PAR4 to PML-NBs or to another protein associated with PML- 
NBs, such as a protein selected from the group consisting of daxx, splOO, spl40, p53, pRB, CBP, 
BLM, SUMO-1. For example, the invention encompasses modulating PAR4 activity by preventing 
the binding of THAP1 to PAR4, or by preventing the recruitment or binding of PAR4 to PML-NBs. 
30 Therapeutic methods and compositions of the invention may involve (1) modulating 

apoptosis or cell proliferation, most preferably inducing or enhancing apoptosis, and/or most 
preferably reducing cell proliferation; (2) modulating apoptosis or cell proliferation of an 
endothelial cell (3) modulating apoptosis or cell proliferation of a hyperproliferative cell; (4) 
modulating apoptosis or cell proliferation of a CNS cell, preferably a neuronal or glial cell; (5) 
35 inhibition of metastatic potential of cancerous tissue, reduction of tumor burden, increase in 
sensitivity to chemotherapy or radiotherapy, killing a cancer cell, inhibition of the growth of a 
cancer cell, or induction rumor regression; or (6) interaction with a THAP family target molecule or 
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THAP domain target molecule, preferably interaction with a protein or a nucleic acid. Methods 
may also involve improving a symptom of or ameliorating a condition as further described herein. 
Antiapoptotic therapy 

Molecules of the invention (e.g. those obtained using the screening methods described 
5 herein, dominant negative mutants, antibodies etc.) which inhibit apoptosis are also expected to be 
useful in the treatment and/or prevention of disease. Diseases in which it is desirable to prevent 
apoptosis include neurodegenerative diseases such as Alzheimer's disease, Parkinson's disease, 
amyotrophic lateral sclerosis, retinitis pigmentosa and cerebellar degeneration; myelodysplasia such 
as aplastic anemia; ischemic diseases such as myocardial infarction and stroke; hepatic diseases 
10 such as alcoholic hepatitis, hepatitis B and hepatitis C; joint-diseases such as osteoarthritis; 
atherosclerosis; and etc. The apoptosis inhibitor of the present invention is especially preferably 
used as an agent for prophylaxis or treatment of a neurodegenerative disease (see also Adams, J. 

M., Science, 281:1322 (1998). 

Included as inhibitors of apoptosis as described herein are generally any molecule which 

1 5 inhibits activity of a THAP family or THAP domain polypeptide, or a biologically active fragment 
or homologue thereof, a THAP-family target protein or PAR4 (particularly PAR4/PML-NB protein 
interactions). THAP-family and THAP domain polypeptides inhibitors may include for example 
antibodies, peptides, dominant negative THAP-family or THAP domain analogs, small molecules, 
ribozyme or antisense nucleic acids. These inhibitors may be particularly advantageous in the 

20 treatment of neurodegenerative disorders. Particularly preferred are inhibitors which affect binding 
of THAP-family protein to a THAP-family target protein, and inhibitors which affect the DNA 
binding activity of a THAP-family protein. 

In further preferred aspects the invention provides inhibitors of THAP-family activity, 
including but' not limited to molecules which interfere or inhibit interactions of THAP-family 

25 proteins with PAR4, for the treatment of endothelial cell related disorders and neurodegenerative 
disorders. Support is found in the literature, as PAR4 appears to play a key role in neuronal 
apoptosis in various neurodegenerative disorders (Guo et al., 1998; Mattson et al., 2000; Mattson et 
al., 1999; Mattson et al., 2001). THAP1, which is expressed in brain and associates with PAR4 may 
therefore also play a key role in neuronal apoptosis. Drugs that inhibit THAP-family and/or inhibit 

30 THAP-family/PAR4 complex formation may lead to the development of novel preventative and 
therapeutic strategies for neurodegenerative disorders. 
Apoptosis regulation in endothelial cells 

The invention also provides methods of regulating angiogenesis in a subject which are 
expected to be useful in the treatment of cancer, cardiovascular diseases and inflammatory diseases. 

35 An inducer of apoptosis of immortalized cells is expected to be useful in suppressing tumori genesis 
and/or metastasis in malignant tumors. Examples of malignant tumors include leukemia (for 
example, myelocytic leukemia, lymphocytic leukemia such as Burkitt lymphoma), digestive tract 
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carcinoma, lung carcinoma, pancreas carcinoma, ovary carcinoma, uterus carcinoma, brain tumor, 

malignant melanoma, other carcinomas, and sarcomas.The present inventors have isolated both 

THAP1 and PAR4 cDNAs from human endothelial cells, and both PAR4 and PML are known to be 

expressed predominantly in blood vessel endothelial cells (Boghaert et al., (1997) Cell Growth 

Differ 8(8):881-90; Terris B. et al, (1995) Cancer Res. 55(7): 1590-7, 1995), suggesting that the 

PML-NBs-and the newly associated THAP1/PAR4 proapoptotic complex may be a major regulator 

of endothelial cell apoptosis in vivo and thus constitute an attractive therapeutic target for 

angiogenesis-dependent diseases. For example, THAP1 and PAR4 pathways may allow selective 

treatments that regulate (e.g. stimulate or inhibit) angiogenesis. 

In a first aspect, the invention provides methods of inhibiting endothelial cell apoptosis, by 

administering a THAP1 or PAR4 inhibitor, or optionally a THAP1/PAR4 interaction inhibitor or 

optionally an inhibitor of THAP1 DNA binding activity. As further described herein, the THAP 

domain is involved in THAP1 pro-apoptotic activity. Deletion of the THAP domain abrogates the 

proapoptotic activity of THAP 1 in mouse 3T3 fibroblasts, as shown in Example 11. Also, as 

further described herein, deletion of residues 168-172 or replacement of residues 171-172 abrogates 

THAP1 binding to PAR4 both in vitro and in vivo and results in lack of recruitment of PAR4 by 

THAP1 to PML-NBs. For PAR4, the leucine zipper domain is required (and is sufficient) for 

binding to THAP1. 

Inhibiting endothelial cell apoptosis may improve angiogenesis and vasculogenesis in 
patients with ischemia and may also interfere with focal dysregulated vascular remodeling, the key 
mechanisn\for atherosclerotic disease progression. 

In another aspect, the invention provides methods of inducing endothelial cell apoptosis, by 
administering for example a biologically active THAP family polypeptide such as THAP 1 , a THAP 
domain polypeptide or a PAR4 polypeptide, or a biologically active fragment or homologue 
thereof, or a THAP1 or PAR4 stimulator. Stimulation of endothelial cell apoptosis may prevent or 
inhibit angiogenesis and thus limit unwanted neovascularization of tumors or inflamed tissues (see 
Dimmeler and Zeiher, Circulation Research, 2000, 87 :434-439). 
Angiogenesis 

Angiogenesis is defined in adult organism as the formation of new blood vessels by a 
process of sprouting from pre-existing vessels. This neovascularization involves activation, 
migration, and proliferation of endothelial cells and is driven by several stimuli, among those shear 
stress. Under normal physiological conditions, humans or animals undergo angiogenesis only in 
very specific restricted situations. For example, angiogenesis is normally observed in wound 
healing, fetal and embryonal development and formation of the corpus luteum, endometrium and 
placenta. Molecules of the invention may have endothelial inhibiting or inducing activity, having 
the capability to inhibit or induce angiogenesis in general. 
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Both controlled and uncontrolled angiogenesis are thought to proceed in a similar manner. 
Endothelial cells and pericytes, surrounded by a basement membrane, form capillary blood vessels. 
Angiogenesis begins with the erosion of the basement membrane by enzymes released by 
endothelial cells and leukocytes. The endothelial cells, which line the lumen of blood vessels, then 
5 protrude through the basement membrane. Angiogenic stimulants induce the endothelial cells to 
migrate through the eroded basement membrane. The migrating cells form a "sprout" off the parent 
blood vessel, where the endothelial cells undergo mitosis and proliferate. The endothelial sprouts 
merge with each other to form capillary loops, creating the new blood vessel. 

Persistent, unregulated angiogenesis occurs in a multiplicity of disease states, tumor 
10 metastasis and abnormal growth by endothelial cells and supports the pathological damage seen in 
these conditions. The diverse pathological disease states in which unregulated angiogenesis is 
present have been grouped together as angiogenic dependent or angiogenic associated diseases. It is 
thus an object of the present invention to provide methods and compositions for treating diseases 
and processes that are mediated by angiogenesis including, but not limited to, hemangioma, solid 
15 tumors, leukemia, metastasis, telangiectasia psoriasis scleroderma, pyogenic granuloma, 
Myocardial angiogenesis, plaque neovascularization, cororany collaterals, ischemic limb 
angiogenesis, corneal diseases, rubeosis, neovascular glaucoma, diabetic retinopathy, retrolental 
fibroplasia, arthritis, diabetic neovascularization, macular degeneration, wound healing, peptic 
ulcer, fractures, keloids, vasculogenesis, hematopoiesis, ovulation, menstruation, and placentation. 

20 (i) Anti-angiogenic therapy 

Intone aspect the invention provides anti-angiogenic therapies as potential treatments for a 
wide variety of diseases, including cancer, arteriosclerosis, obesity, arthritis, duodenal ulcers, 
psoriasis, proliferative skin disorders, cardiovascular disorders and abnormal ocular 
neovascularization caused, for example, by diabetes (Folkman, Nature Medicine 1:27 (1995) and 

25 Folkman, Seminars in Medicine of the Beth Israel Hospital, Boston, New England Journal of 
Medicine, 333:1757 (1995)). Anti-angiogenic therapies are thought to act by inhibiting the 
formation of new blood vessels. 

The present invention thus provides methods and compositions for treating diseases and 
processes mediated by undesired and uncontrolled angiogenesis by administering to a human or 

30 animal a composition comprising a substantially purified THAP family or THAP domain 
polypeptide, or a biologically active fragment, homologue or derivative thereof in a dosage 
sufficient to inhibit angiogenesis, administering a vector capable of expressing a nucleic acid 
encoding a THAP-family or THAP domain protein, or administering any other inducer of 
expression or activity of a THAP-family or THAP domain protein. The present invention is 

35 particularly useful for treating or for repressing the growth of tumors. Administration of THAP- 
family or THAP domain nucleic acid, protein or other inducer to a human or animal with 
prevascularized metastasized tumors will prevent the growth or expansion of those tumors. THAP- 
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family activity may be used in combination with other compositions and procedures for the 
treatment of diseases. For example, a tumor may be treated conventionally with surgery, radiation 
or chemotherapy combined with THAP-family or THAP domain protein and then THAP-family or 
THAP domain protein may be subsequently administered to the patient to extend the dormancy of 
micrometastases and to stabilize any residual primary tumor. 

In a preferred example, a THAP-family polypeptide activity, preferably a THAP1 activity is 
used for the treatment of arthritis, for example rheumatiod arthritis. Rheumatoid arthritis is 
characterized by symmetric, polyarticular inflammation of synovial-lined joints, and may involve 
extraarticular tissues, such as the pericardium, lung, and blood vessels. 
(ii) Angiogenic therapy 

In another aspect, the inhibitors of THAP-family protein activity, particularly THAP1 
activity, could be used as an anti-apoptotic and thus as an angiogenic therapy. Angiogenic 
therapies are potential treatments for promoting wound healing and for stimulating the growth of 
new blood vessels to by-pass occluded ones. Thus, pro-angiogenic therapies could potentially 
augment or replace by-pass surgeries and balloon angioplasty (PTCA). For example, with respect 
to neovascularization to bypass occluded blood vessels, a "therapeutically effective amount" is a 
quantity which results in the formation of new blood vessels which can transport at least some of 
the blood which normally would pass through the blocked vessel. 

The THAP-family protein of the present invention can for example be used to generate 
antibodies that can be used as inhibitors of apoptosis. The antibodies can be either polyclonal 
antibodies pr monoclonal antibodies. In addition, these antibodies that specifically bind to the 
THAP-family protein can be used in diagnostic methods and kits that are well known to those of 
ordinary skill in the art to detect or quantify the THAP-family protein in a body fluid. Results from 
these tests can be used to diagnose or predict the occurrence or recurrence of a cancer and other 
angiogenic mediated diseases. 

It will be appreciated that other inhibitors of THAP-family and THAP domain proteins can 
also be used in angiogenic therapies, including for example small molecules, antisense nucleic 
acids, dominant negative THAP-family and THAP domain proteins or peptides identified using the 
above methods. 

In view of applications in both angiogenic and antiangiogenic therapies, molecules of the 
invention may have endothelial inhibiting or inducing activity, having the capability to inhibit or 
induce angiogenesis in general. It will be appreciated that methods of assessing such capability are 
known in the art, including for example assessing antiangiogenic properties as the ability inhibit the 
growth of bovine capillary endothelial cells in culture in the presence of fibroblast growth factor. 

It is to be understood that the present invention is contemplated to include any derivatives 
of the THAP family or THAP domain polypeptides, and biologically active fragments and 
homologues thereof that have endothelial inhibitory or apoptotic activity. The present invention 
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includes full-length THAP-family and THAP domain proteins, derivatives of the THAP-family and 
THAP domain proteins and biologically-active fragments of the THAP-family and THAP domain 
proteins. These include proteins with THAP-family protein activity that have amino acid 
substitutions or have sugars or other molecules attached to amino acid functional groups. The 
5 methods also contemplate the use of genes that code for a THAP-family protein and to proteins that 
are expressed by those genes. 

As discussed, several methods are described herein for delivering a modulator to a subject 
in need of treatment, including for example small molecule modulators, nucleic acids including via 
gene therapy vectors, and polypeptides including peptide mimetics, active polypeptides, dominant 
10 negative polypeptides and antibodies. It will be thus be appreciated that modulators of the 
invention identified according to the methods in the section titled "Drug Screening Assays" can be 
further tested in cell or animal models for their ability to ameliorate or prevent a condition 
involving a THAP-family polypeptide, particularly THAP1, THAP1, THAP2 or THAP3/PAR4 
interactions, THAP-family DNA binding or PAR4 / PML-NBs interactions. Likewise, nucleic 
1 5 acids, polypeptides and vectors (e.g. viral) can also be assessed in a similar manner. 

An "individual" treated by the methods of this invention is a vertebrate, particularly a 
mammal (including model animals of human disease, farm animals, sport animals, and pets), and 
typically a human. 

"Treatment" refers to clinical intervention in an attempt to alter the natural course of the 

20 individual being treated, and may be performed either for prophylaxis or during the course of 
clinical pathology. Desirable effects include preventing occurrence or recurrence of disease, 
alleviation of symptoms, diminishment of any direct or indirect pathological consequences of the 
disease, such as hyperresponsiveness, inflammation, or necrosis, lowering the rate of disease 
progression, amelioration or palliation of the disease state, and remission or improved prognosis. 

25 The "pathology" associated with a disease condition is anything that compromises the well-being, 
normal physiology, or quality of life of the affected individual. 

Treatment is performed by administering an effective amount of a THAP-family 
polypeptide inhibitor or activator. An "effective amount" is an amount sufficient to effect a 
beneficial or desired clinical result, and can be administered in one or more doses. 

30 The* criteria for assessing response to therapeutic modalities employing the lipid 

compositions of this invention are dictated by the specific condition, measured according to 
standard medical procedures appropriate for the condition. 
REDUCING CHEMOKINE MEDIATED EFFECTS 

Some aspects of the present invention relate to the use of THAP-family polypeptides, 

35 including THAP-1, chemokine-binding domains of THAP-family polypeptides, THAP-family 
polypeptide or THAP-family chemokine-binding domain fusions to immunoglobulin Fc, oligomers 
of THAP-family polypeptides or THAP-family chemokine-binding domains, or homologs of any of 
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the above-listed compositions (together and herein after referred to as THAP-type chemokine- 
binding agents) for reducing the inflammation or the symptoms associated with diseases or 
conditions that are influenced or mediated by chemokine binding or activity. In such embodiments, 
the THAP-type chemokine binding agents are administered to a subject in effective amounts so as 
5 to reduce the symptoms associated with the condition. In some embodiments, the chemokine that is 
effected by the THAP-type chemokine binding agent is SLC, CCL19, CCL5, CXCL9, CXCL10 or 
a combination of these chemokines. In other embodiments, the chemokine that is effected by the 
THAP-type chemokine binding agent is XCL1, XCL2, CCL1, CCL2, CCL3, CCL3L1, SCYA3L2, 
CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, SCYA10, CCL11, SCYA12, CCL13, 

10 CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, CCL21, CCL22, CCL23, CCL24, 
CCL25, CCL26, CCL27, CCL28, clone 391, CARP CC-1, CCL1, CK-1, regakine-1, K203, 
CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, CXCL5, CXCL6, PPBP, SPBPBP, IL8, 
CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, CXCL15, CXCL16, NAP-4, LFCA-1, Scyba, 
JSC, VHSV-induced protein, CX3CL1, fCLl or a combination of these chemokines. In some 

15 embodiments, the THAP-type chemokine-binding agent is administered directly whereas in other 
embodiments it is administered as a pharmaceutical composition. In either case, the routes of 
administration that are known in the art and described herein may be used to deliver the THAP-type 
chemokine-binding agent to the subject. 

Some embodiments of the present invention relate to a device for delivering the THAP-type 

20 chemokine-binding agent or pharmaceutical composition thereof to the subject. In such 
embodiment, the device comprises a container which contains the THAP-type chemokine-binding 
agent or pharmaceutical composition thereof. For example, in some embodiments, the device may 
be a conventional device including, but not limited to, syringes, devices for intranasal 
administration of compositions and vaccine guns. In one embodiment, the device comprises a 

25 member which receives the THAP-type chemokine-binding agent or pharmaceutical composition 
thereof in communication with a mechanism for delivering the composition to the subject. In some 
embodiments, the device is an inhaler or a patch for transdermal administration. 
Pharmaceutical Compositions 

Compounds capable of inhibiting THAP-family activity, preferably small molecules but 

30 also including peptides, THAP-family nucleic acid molecules, THAP-family proteins, and anti- 
THAP-family antibodies (also referred to herein as "active compounds") of the invention can be 
incorporated into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise a pharmaceutical ly acceptable carrier. As used herein the language 
"pharmaceutically acceptable carrier" is intended to include any and all solvents, dispersion media, 

35 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use of such media and agents for 
pharmaceutically active substances is well known in the art. Except insofar as any conventional 
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media or agent is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include parenteral, e.g., 
5 intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (topical), transmucosal, 
and rectal administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include the following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other 
synthetic solvents; antibacterial agents such as ben2yl alcohol or methyl parabens; antioxidants such 
10 as ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; 
buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as 
sodium chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or 
sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or 
multiple dose vials made of glass or plastic. 
15 Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 

(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include 
physiological saline, bacteriostatic water, Cremophor ELcc (BASF, Parsippany, N.J.) or phosphate 
buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent 
20 that easy syringability exists. It must be stable under the conditions of manufacture and storage and 
must be pteserved against the contaminating action of microorganisms such as bacteria and fungi. 
The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol 
(for example, glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and suitable 
mixtures thereof. The proper fluidity can be maintained, for example, by the use of a coating such 
25 as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use 
of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and 
the like. In many cases, it will be preferable to include isotonic agents, for example, sugars, 
polyalcohols such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of 
30 the injectable compositions can be brought about by including in the composition an agent which 
delays absorption, for example, aluminum monostearate and gelatin. 

Where the active compound is a protein, peptide or anti-THAP-family antibody, sterile 
injectable solutions can be prepared by incorporating the active compound (e.g., ) in the required 
amount in an appropriate solvent with one or a combination of ingredients enumerated above, as 
35 required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the 
active compound into a sterile vehicle which contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the case of sterile powders for the preparation of 
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sterile injectable solutions, the preferred methods of preparation are vacuum drying and freeze- 
drying which yields a powder of the active ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
5 enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form of 
tablets, troches, or capsules. For administration by inhalation, the compounds are delivered in the 
form of an aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. Systemic administration can also be by 

10 transmucosal or transdermal means. For transmucosal or transdermal administration, penetrants 
appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally 
known in the art, and include, for example, for transmucosal administration, detergents, bile salts, 
and fusidic acid derivatives. Transmucosal administration can be accomplished through the use of 
nasal sprays or suppositories. For transdermal administration, the active compounds are formulated 

15 into ointments, salves, gels, or creams as generally known in the art. Most preferably, active 
compound is delivered to a subject by intravenous injection. 

In one embodiment, the active compounds are prepared with carriers that will protect the 
compound against rapid elimination from the body, such as a controlled release formulation, 
including implants and microencapsulated delivery systems. Biodegradable, biocompatible 

20 polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, 
polyorthoesjers, and polylactic acid. Methods for preparation of such formulations will be apparent 
to those skilled in the art. The materials can also be obtained commercially from Alza Corporation 
and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical ly acceptable 

25 carriers. These can be prepared according to methods known to those skilled in the art, for example, 
as described in U.S. Pat. No. 4,522,81 1. 

It is especially advantageous to formulate oral or preferably parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be treated; each 

30 unit containing a predetermined quantity of active compound calculated to produce the desired 
therapeutic effect in association with the required pharmaceutical carrier. The specification for the 
dosage unit forms of the invention are dictated by and directly dependent on the unique 
characteristics of the active compound and the particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an active compound for the treatment of 

35 individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50 
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(the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% 
of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and 
it can be expressed as the ratio LD50/ED50. Compounds which exhibit large therapeutic indices are 
preferred. While compounds that exhibit toxic side effects may be used, care should be taken to 

5 design a delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies preferably 
within a range of circulating concentrations that include the ED50 with little or no toxicity. The 
10 dosage may vary within this range depending upon the dosage form employed and the route of 
administration utilized. For any compound used in the method of the invention, the therapeutically 
effective dose can be estimated initially from cell culture assays. A dose may be formulated in 
animal models to achieve a circulating plasma concentration range that includes the IC50 (i.e., the 
concentration of the test compound which achieves a half-maximal inhibition of symptoms) as 

1 5 determined in cell culture. Such information can be used to more accurately determine useful doses 
in humans. Levels in plasma may be measured, for example, by high performance liquid 
chromatography. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

20 It will be appreciated that THAP-type chemokine-binding agents can be formulated as 

pharmaceutical compositions and administered as described above. Additionally, the effective 
dose, route of administration, duration of administration, duration between doses and therapeutic 
effect can be determined by the methods described above as well as using methods that are well 
known in the art. 

25 Diagnostic and Prognostic Uses 

The nucleic acid molecules, proteins, protein homologues, and antibodies described herein 
can be used in one or more of the following methods: diagnostic assays, prognostic assays, 
monitoring clinical trials, and pharmacogenetics; and in drug screening and methods of treatment 
(e.g., therapeutic and prophylactic) as further described herein. 

30 The invention provides diagnostic and prognositc assays for detecting THAP-family 

members, as further described. Also provided are diagnostic and prognostic assays for detecting 
interactions between THAP-family members and THAP-family target molecules. In a preferred 
example, a THAP-family member is THAP1, THAP2 or THAP3 and the THAP-family target is 
PAR4 or a PML-NB protein. 

35 The invention also provides diagnostic and prognositc assays for detecting THAP1 and/or 

PAR4 localization to or association with PML-NBs, or association with or binding to a PML-NB- 
associated protein, such as daxx, splOO, spl40, p53, pRB, CBP, BLM or SUMO-1. In a preferred 
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method, the invention provides detecting PAR4 localization to or association with PML-NBs. In a 
further aspect, the invention provides detecting THAP-family nucleic acid binding activity. 

The isolated nucleic acid molecules of the invention can be used, for example, to detect 
THAP-family polypeptide mRNA (e.g., in a biological sample) or a genetic alteration in a THAP- 
5 family gene, and to modulate a THAP-family polypeptide activity, as described further below. The 
THAP-family proteins can be used to treat disorders characterized by insufficient or excessive 
production of a THAP-family protein or THAP-family target molecules. In addition, the THAP- 
family proteins can be used to screen for naturally occurring THAP-family target molecules, to 
screen for drugs or compounds which modulate, preferably inhibit THAP-family activity, as well as 

10 to treat disorders characterized by insufficient or excessive production of THAP-family protein or 
production of THAP-family protein forms which have decreased or aberrant activity compared to 
THAP-family wild type protein. Moreover, the anti-THAP-family antibodies of the invention can 
be used to detect and isolate THAP-family proteins, regulate the bioavailability of THAP-family 
proteins, and modulate THAP-family activity. 

15 Accordingly one embodiment of the present invention involves a method of use (e.g., a 

diagnostic assay, prognostic assay, or a prophylactic/therapeutic method of treatment) wherein a 
molecule of the present invention (e.g., a THAP-family protein, THAP-family nucleic acid, or most 
preferably a THAP-family inhibitor or activator) is used, for example, to diagnose, prognose and/or 
treat a disease and/or condition in which any of the aforementioned THAP-family activities is 

20 indicated. In another embodiment, the present invention involves a method of use (e.g., a diagnostic 
assay, prognostic assay, or a prophylactic/therapeutic method of treatment) wherein a molecule of 
the present invention (e.g., a THAP-family protein, THAP-family nucleic acid, or a THAP-family 
inhibitor or activator) is used, for example, for the diagnosis, prognosis, and/or treatment of 
subjects, preferably a human subject, in which any of the aforementioned activities is pathologically 

25 perturbed. In a preferred embodiment, the methods of use (e.g., diagnostic assays, prognostic 
assays, or prophylactic/therapeutic methods of treatment) involve administering to a subject, 
preferably a human subject, a molecule of the present invention (e.g., a THAP-family protein, 
THAP-family nucleic acid, or a THAP-family inhibitor or activator) for the diagnosis, prognosis, 
and/or therapeutic treatment. In another embodiment, the methods of use (e.g., diagnostic assays, 

30 prognostic assays, or prophylactic/therapeutic methods of treatment) involve administering to a 
human subject a molecule of the present invention (e.g., a THAP-family protein, THAP-family 
nucleic acid, or a THAP-family inhibitor or activator). 

For example, the invention encompasses a method of determining whether a THAP-family 
member is expressed within a biological sample comprising: a) contacting said biological sample 

35 with: ii) a polynucleotide that hybridizes under stringent conditions to a THAP-family nucleic acid; 
or iii) a detectable polypeptide (e.g. antibody) that selectively binds to a THAP-family polypeptide; 
and b) detecting the presence or absence of hybridization between said polynucleotide and an RNA 
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spec.es within said sample, or the presence or absence of binding of sa,d detectable polypeptide to a 
polypeptide within said sample. A detection of said hybridization or of said binding indicates that 
said THAP-fam.ly member is expressed within said sample. Preferably, the polynucleotide is a 
primer, and wherein said hybridization is detected by detecting the presence of an amplification 
product comprising said primer sequence, or the detectable polypeptide is an antibody. 

Also envisioned is a method of determining whether a mammal, preferably human, has an 
elevated or reduced level of expression of a THAP-family member, comprising: a) providing a 
biological sample from said mammal; and b) comparing the amount of a THAP-family polypeptide 
or of a THAP-family RNA species encoding a THAP-family polypeptide within said biological 
sample with a level detected in or expected from a control sample. An increased amount of said 
THAP-family polypeptide or said THAP-family RNA species within said biological sample 
compared to said level detected in or expected from said control sample indicates that said mammal 
has an elevated level of THAP-family expression, and a decreased amount of said THAP-family 
polypeptide or said THAP-family RNA species within said biological sample compared to said 
15 level detected in or expected from said control sample indicates that said mammal has a reduced 
level of expression of a THAP-family member. 

The present invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, and monitoring clinical trials are used for prognostic (predictive) 
purposes to thereby treat an individual, prophylactically. Accordingly, one aspect of the present 
20 invention relates to diagnostic assays for determining THAP-family protein and/or nucleic acid 
expression^ well as THAP-family activity, in the context of a biological sample (e.g., blood, 
serum, cells, tissue) to thereby determine whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant THAP-family expression or 
activity. The invention also provides for prognostic (or predictive) assays for determining whether 
25 an individual is at risk of developing a disorder associated with a THAP-family protein, nucleic 
acid expression or activity. For example, mutations in a THAP-family gene can be assayed in a 
biological sample. Such assays can be used for prognostic or predictive purpose to thereby 
phophylactically treat an individual prior to the onset of a disorder characterized by or associated 
with a THAP-family protein, nucleic acid expression or activity. 
30 Accordingly, the methods of the present invention are applicable generally to diseases 

related to regulation of apoptosis, including but not limited to disorders characterized by unwanted 
cell proliferation or generally aberrant control of differentiation, for example neoplastic or 
hyperplastic disorders, as well as disorders related to proliferation or lack thereof of endothelial 
cells, inflammatory disorders and neurodegenerative disorders. 
35 Diagnostic Assays 

An exemplary method for detecting the presence (quantitative or not) or absence of a 
THAP-family protein or nucleic acid in a biological sample involves obtaining a biological sample 
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from a test subject and contacting the biological sample with a compound or an agent capable of 
detecting a THAP-family protein or nucleic acid (e.g., mRNA, genomic DNA) that encodes THAP- 
family protein such that the presence of the THAP-family protein or nucleic acid is detected in the 
biological sample. A preferred agent for detecting a THAP-family mRNA or genomic DNA is a 
5 labeled nucleic acid probe capable of hybridizing to a THAP-family mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length THAP-family nucleic acid, such as the nucleic 
acid of SEQ ID NO: 160 such as a nucleic acid of at least 15, 30, 50, 100, 250, 400, 500 or 1000 
nucleotides in length and sufficient to specifically hybridize under stringent conditions to a THAP- 
family mRNA or genomic DNA or a portion of a THAP-family nucleic acid. Other suitable probes 

1 0 for use in the diagnostic assays of the invention are described herein. 

In preferred embodiments, the subject method can be characterized by generally comprising 
detecting, in a tissue sample of the subject (e.g. a human patient), the presence or absence of a 
genetic lesion characterized by at least one of (i) a mutation of a gene encoding one of the subject 
THAP-family proteins or (ii) the mis-expression of a THAP-family gene. To illustrate, such genetic 

15 lesions can be detected by ascertaining the existence of at least one of (i) a deletion of one or more 
nucleotides from a THAP-family gene, (ii) an addition of one or more nucleotides to such a THAP- 
family gene, (iii) a substitution of one or more nucleotides of a THAP-family gene, (iv) a gross 
chromosomal rearrangement or amplification of a THAP-family gene, (v) a gross alteration in the 
level of a messenger RNA transcript of a THAP-family gene, (vi) aberrant modification of a THAP- 

20 family gene, such as of the methylation pattern of the genomic DNA, (vii) the presence of a non- 
wild type splicing pattern of a messenger RNA transcript of a THAP-family gene, and (viii) a non- 
wild type level of a THAP-family -target protein. 

A preferred agent for detecting a THAP-family protein is an antibody capable of binding to 
a THAP-family protein, preferably an antibody with a detectable label. Antibodies can be 

25 polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 
F(ab') 2 ) can be used. The term "labeled", with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or antibody 
by reactivity with another reagent that is directly labeled. Examples of indirect labeling include 

30 detection of a primary antibody using a fluorescently labeled secondary antibody and end-labeling 
of a DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. The 
term "biological sample" is intended to include tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present within a subject. That is, the detection method of 
the invention can be used to detect a THAP-family mRNA, protein, or genomic DNA in a 

35 biological sample in vitro as well as in vivo. For example, in vitro techniques for detection of a 
THAP-family mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques 
for detection of a THAP-family protein include enzyme linked immunosorbent assays (ELISAs), 
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Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for detection of 
a THAP-family genomic DNA include Southern hybridizations. Furthermore, in vivo techniques 
for detection of a THAP-family protein include introducing into a subject a labeled anti-THAP- 
family antibody. For example, the antibody can be labeled with a radioactive marker whose 
5 presence and location in a subject can be detected by standard imaging techniques. 

In yet another exemplary embodiment, aberrant methylation patterns of a THAP-family 
gene can be detected by digesting genomic DNA from a patient sample with one or more restriction 
endonucleases that are sensitive to methylation and for which recognition sites exist in the THAP- 
family gene (including in the flanking and intronic sequences). See, for example, Buiting et al. 
10 (1994) Human Mol Genet 3:893-895. Digested DNA is separated by gel electrophoresis, and 
hybridized with probes derived from, for example, genomic or cDNA sequences. The methylation 
status of the THAP-famUy gene can be determined by comparison of the restriction pattern 
generated from the sample DNA with that for a standard of known methylation. 

Furthermore, gene constructs such as those described herein can be utilized ,n diagnostic 
15 assays to determine if a cell's growth or differentiation state is no longer dependent on the 
regulatory function of a THAP-family protein, e.g. in determining the phenotype of a transformed 
cell. Such knowledge can have both prognostic and therapeutic benefits. To illustrate, a sample of 
cells from the tissue can be obtained from a patient and dispersed in appropriate cell culture media, 
a portion of the cells in the sample can be caused to express a recombinant THAP-family protein or 
20 a THAP-family target protein, e.g. by transfection with a expression vector described herein, or to 
increase the expression or activity of an endogenous THAP-family protein or THAP-family target 
protein, and subsequent growth of the cells assessed. The absence of a change in phenotype of the 
cells despite expression of the THAP-family or THAP-family target protein may be indicative of a 
lack of dependence on cell regulatory pathways which includes the THAP-family or THAP-family 
25 target protein, e.g. THAP-family- or THAP-family target-mediated transcription. Depending on the 
nature of the tissue of interest, the sample can be in the form of cells isolated from, for example, a 
blood sample, an exfoliated cell sample, a fine needle aspirant sample, or a biopsied tissue sample. 
Where the initial sample is a solid mass, the tissue sample can be minced or otherwise dispersed so 
that cells can be cultured, as is known in the art. 
. 30 In yet another embodiment, a diagnostic assay is provided which detects the ability of a 

THAP-family gene product, e.g., isolated from a biopsied cell, to bind to other cellular proteins. For 
instance, it will be desirable to detect THAP-family mutants which, while expressed at appreciable 
levels in the cell, are defective at binding a THAP-family target protein (having either diminished 
or enhanced binding affinity). Such mutants may arise, for example, from mutations, e.g., point 
35 mutants, which may be impractical to detect by the diagnostic DNA sequencing techniques or by 
the immunoassays described above. The present invention accordingly further contemplates 
diagnostic screening assays which generally comprise cloning one or more THAP-family genes 
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from the sample cells, and expressing the cloned genes under conditions which permit detection of 
an interaction between that recombinant gene product and a target protein, e.g., for example the 
THAP1 gene and a target PAR4 protein or a PML-NB protein. As will be apparent from the 
description of the various drug screening assays set forth below, a wide variety of techniques can be 
used to determine the ability of a THAP-family protein to bind to other cellular components. These 
techniques can be used to detect mutations in a THAP-family gene which give rise to mutant 
proteins with a higher or lower binding affinity for a THAP-family target protein relative to the 
wild-type THAP-family. Conversely, by switching which of the THAP-family target protein and 
THAP-family protein is the "bait" and which is derived from the patient sample, the subject assay 
can also be used to detect THAP-family target protein mutants which have a higher or lower 
binding affinity for a THAP-family protein relative to a wild type form of that THAP-family target 
protein. 

In an exemplary embodiment, a PAR4 or a PMB-NB protein (e.g. wild-type) can be 
provided as an immobilized protein (a "target"), such as by use of GST fusion proteins and 
glutathione treated microtitre plates. A THAP1 gene (a "sample" gene) is amplified from cells of a 
patient sample, e.g., by PCR, ligated into an expression vector, and transformed into an appropriate 
host cell. The recombinantly produced THAP1 protein is then contacted with the immobilized 
PAR4 or PMB-NB protein, e.g., as a lysate or a semi-purified preparation, the complex washed, and 
the amount of PAR4 or PMB-NB protein /THAP1 complex determined and compared to a level of 
wild-type complex formed in a control. Detection can be by, for instance, an immunoassay using 
antibodies against the wild-type form of the THAP1 protein, or by virtue of a label provided by 
cloning the sample THAP1 gene into a vector which provides the protein as a fusion protein 
including a detectable tag. For example, a myc epitope can be provided as part of a fusion protein 
with the sample THAP1 gene. Such fusion proteins can, in addition to providing a detectable label, 
also permit purification of the sample THAP1 protein from the lysate prior to application to the 
immobilized target. In yet another embodiment of the subject screening assay, the two hybrid 
assay, described in the appended examples, can be used to detect mutations in either a THAP- 
family gene or THAP-family target gene which alter complex formation between those two 
proteins. 

Accordingly, the present invention provides a convenient method for detecting mutants of 
THAP-family genes encoding proteins which are unable to physically interact with a THAP-family 
target "bait" protein, which method relies on detecting the reconstitution of a transcriptional 
activator in a THAP-family/THAP-family target-dependent fashion. 

In one embodiment, the biological sample contains protein molecules from the test subject. 
Alternatively, the biological sample can contain mRNA molecules from the test subject or genomic 
DNA molecules from the test subject. A preferred biological sample is a serum sample isolated by 
conventional means from a subject. In another embodiment, the methods further involve obtaining 
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a control biological sample from a control subject, contacting the control sample with a compound 
or agent capable of detecting a THAP-family protein, mRNA, or genomic DNA, such that the 
presence of a THAP-family protein, mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of a THAP-family protein, mRNA or genomic DNA in the control 
sample with the presence of a THAP-family protein, mRNA or genomic DNA in the test sample. 
The invention also encompasses kits for detecting the presence of THAP-family protein, mRNA or 
genomic DNA in a biological sample. For example, the kit can comprise a labeled compound or 
agent capable of detecting a THAP-family protein or mRNA or genomic DNA in a biological 
sample; means for determining the amount of a THAP-family member in the sample; and means for 
comparing the amount of THAP-family member in the sample with a standard. The compound or 
agent can be packaged in a suitable container. The kit can further comprise instructions for using 
the kit to detect THAP-family protein or nucleic acid. 

In certain embodiments, detection involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Pat. Nos. 4,683,195 and 4,683,202), such as anchor PCR or RACE 
1 5 PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegren et al. (1988) Science 
241:1077-1080; and Nakazawa et al. (1994) PNAS 91:360-364), the latter of which can be 
particularly useful for detecting point mutations in the THAP-family-gene (see Abravaya et al. 
(1995) Nucleic Acids Res. 23:675-682). This method can include the steps of collecting a sample of 
cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the 
20 sample, contacting the nucleic acid sample with one or more primers which specifically hybridize 
to a THAP-family gene under conditions such that hybridization and amplification of the THAP- 
family-gene (if present) occurs, and detecting the presence or absence of an amplification product, 
or detecting the size of the amplification product and comparing the length to a control sample. It is 
anticipated that PCR and/or LCR may be desirable to use as a preliminary amplification step in 
25 conjunction with any of the techniques used for detecting mutations described herein. 

Genotyping assays for diagnostics generally require the previous amplification of the DNA 
region carrying the biallelic marker of interest. However, ultrasensitive detection methods which 
do not require amplification are also available. Methods well-known to those skilled in the art that 
can be used to detect biallelic polymorphisms include methods such as, conventional dot blot 
30 analyzes, single strand conformational polymorphism analysis (SSCP) described by Orita et al., 
PNAS 86 : 2766-2770 (1989), denaturing gradient gel electrophoresis (DGGE), heteroduplex 
analysis, mismatch cleavage detection, and other conventional techniques as described in Sheffield 
et al. (1991), White et al. (1992), and Grompe et al. (1989 and 1993) (Sheffield, V.C. et al, Proc. 
Natl. Acad. Sci. U.S.A 49:699-706 (1991); White, M.B. et al., Genomics 12:301-306 (1992); 
35 Grompe, M. et al., Proc. Natl. Acad. Sci. U.S.A 86:5855-5892 (1989); and Grompe, M. Nature 
Genetics 5 : 1 1 1 - 1 1 7 ( 1 993)). Another method for determining the identity of the nucleotide present 
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at a particular polymorphic site employs a specialized exonucl ease-resistant nucleotide derivative as 
described in U.S. patent 4,656,127. Further methods are described as follows. 

The nucleotide present at a polymorphic site can be determined by sequencing methods. In 
a preferred embodiment, DNA samples are subjected to PCR amplification before sequencing as 
described above. DNA sequencing methods are described in "Sequencing Of Amplified Genomic 
DNA And Identification Of Single Nucleotide Polymorphisms". Preferably, the amplified DNA is 
subjected to automated dideoxy terminator sequencing reactions using a dye-primer cycle 
sequencing protocol. Sequence analysis allows the identification of the base present at the biallelic 
marker site. 

In microsequencing methods, the nucleotide at a polymorphic site in a target DNA is 
detected by a single nucleotide primer extension reaction. This method involves appropriate 
microsequencing primers which, hybridize just upstream of the polymorphic base of interest in the 
target nucleic acid. A polymerase is used to specifically extend the 3' end of the primer with one 
single ddNTP (chain terminator) complementary to the nucleotide at the polymorphic site. Next the 
identity of the incorporated nucleotide is determined in any suitable way. Typically, 
microsequencing reactions are carried out using fluorescent ddNTPs and the extended 
microsequencing primers are analyzed by electrophoresis on ABI 377 sequencing machines to 
determine the identity of the incorporated nucleotide as described in EP 412 883. Alternatively 
capillary electrophoresis can be used in order to process a higher number of assays simultaneously. 
Different approaches can be used for the labeling and detection of ddNTPs. A homogeneous phase 
detection method based on fluorescence resonance energy transfer has been described by Chen and 
Kwok (1997) and, Chen and Kwok {Nucleic Acids Research 25:347-353 1997) and Chen et al. 
(Proc. Natl. Acad ScL USA 94/20 10756-10761,1997)). In this method, amplified genomic DNA 
fragments containing polymorphic sites are incubated with a 5'-fluorescein-labeled primer in the 
presence of allelic dye-labeled dideoxyribonucleoside triphosphates and a modified Taq 
polymerase. The dye-labeled primer is extended one base by the dye-terminator specific for the 
allele present on the template. At the end of the genotyping reaction, the fluorescence intensities of 
the two dyes in the reaction mixture are analyzed directly without separation or purification. All 
these steps can be performed in the same tube and the fluorescence changes can be monitored in 
real time. Alternatively, the extended primer may be analyzed by MALDI-TOF Mass 
Spectrometry. The base at the polymorphic site is identified by the mass added onto the 
microsequencing primer (see Haff and Smimov, 1997, Genome Research, 7:378-388, 1997). In 
another example, Pastinen et al., (Genome Research 7:606-614, 1997)) describe a method for 
multiplex detection of single nucleotide polymorphism in which the solid phase minisequencing 
principle is applied to an oligonucleotide array format. High-density arrays of DNA probes 
attached to a solid support (DNA chips) are further described below. 
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Other assays include mismatch detection assays, based on the specificity of polymerases 
and ligases. Polymerization reactions places particularly stringent requirements on correct base 
pairing of the 3' end of the amplification primer and the joining of two oligonucleotides hybridized 
to a target DNA sequence is quite sensitive to mismatches close to the ligation site, especially at the 
5 3' end. 

A preferred method of determining the identity of the nucleotide present at an allele 
involves nucleic acid hybridization. Any hybridization assay may be used including Southern 
hybridization, Northern hybridization, dot blot hybridization and solid-phase hybridization (see 
Sambrook et al., Molecular Cloning - A Laboratory Manual, Second Edition, Cold Spring Harbor 
10 Press, N.Y., 1989)). Hybridization refers to the formation of a duplex structure by two single 
stranded nucleic acids due to complementary base pairing. Hybridization can occur between 
exactly complementary nucleic acid strands or between nucleic acid strands that contain minor 
regions of mismatch. Specific probes can be designed that hybridize to one form of a biallelic 
marker and not to the other and therefore are able to discriminate between different allelic forms. 

15 Allele-specific probes are often used in pairs, one member of a pair showing perfect match to a 
target sequence containing the original allele and the other showing a perfect match to the target 
sequence containing the alternative allele. Hybridization conditions should be sufficiently stringent 
that there is a significant difference in hybridization intensity between alleles, and preferably an 
essentially binary response, whereby a probe hybridizes to only one of the alleles. Stringent, 

20 sequence specific hybridization conditions, under which a probe will hybridize only to the exactly 
complementary target sequence are well known in the art (Sambrook et al., 1989). The detection of 
hybrid duplexes can be carried out by a number of methods. Various detection assay formats are 
well known which utilize detectable labels bound to either the target or the probe to enable 
detection of the hybrid duplexes. Typically, hybridization duplexes are separated from 

25 unhybridized nucleic acids and the labels bound to the duplexes are then detected. Further, standard 
heterogeneous assay formats are suitable for detecting the hybrids using the labels present on the 
primers and probes, (see Landegren U. et al., Genome Research, 8:769-776,1998). 

Hybridization assays based on oligonucleotide arrays rely on the differences in 
hybridization stability of short oligonucleotides to perfectly matched and mismatched target 

30 sequence variants. Efficient access to polymorphism information is obtained through a basic 
structure comprising high-density arrays of oligonucleotide probes attached to a solid support (e.g., 
the chip) at selected positions. Chips of various formats for use in detecting biallelic 
polymorphisms can be produced on a customized basis by Affymetrix (GeneChip), Hyseq (HyChip 
and HyGnostics), and Protogene Laboratories. 

35 m general, these methods employ arrays of oligonucleotide probes that are complementary 

to target nucleic acid sequence segments from an individual which, target sequences include a 
polymorphic marker. EP 785280, describes a tiling strategy for the detection of single nucleotide 
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polymorphisms. Briefly, arrays may generally be "tiled" for a large number of specific 
polymorphisms, further described in PCT application No. WO 95/11995. Upon completion of 
hybridization with the target sequence and washing of the array, the array is scanned to determine 
the position on the array to which the target sequence hybridizes. The hybridization data from the 
5 scanned array is then analyzed to identify which allele or alleles of the biallelic marker are present 
in the sample. Hybridization and scanning may be carried out as described in PCT application No. 
WO 92/10092 and WO 95/1 1995 and US patent No. 5,424,186. Solid supports and polynucleotides 
of the present invention attached to solid supports are further described in "Oligonucleotide Probes 
And Primers". 

10 DETECTING CHEMOKINES 

Some aspects of the present invention relate to the detection of chemokines by contacting a 
chemokine or a sample containing a chemokine with a THAP-type chemokine-binding agent. In 
some embodiments, the chemokines or the THAP-type chemokine-binding agents are labeled. 
Many labels and methods of conjugating such labels to a chemokine or a THAP-type chemokine- 

15 binding agent are known in the art. Additionally, labeled molecules, such as antibodies, which have 
an affinity for a THAP-type chemokine-binding agent can be used to detect the chemokine that is 
bound to a THAP-type chemokine-binding agent using a number of assay formats that are well 
known in the art. 

An exemplary method for detecting the presence (quantitative or not) or absence of a 

20 chemokine, including, but not limited to, a chemokine in a biological sample, involves obtaining a 
chemokine or a sample containing a chemokine and contacting it with a compound or an agent 
capable of detecting the chemokine. In some embodiments, such an agent is a THAP-type 
chemokine-binding agent. Chemokines which can be detected using a method that employs a 
THAP-type chemokine-binding agent include, but are not limited to, XCL1, XCL2, CCL1, CCL2, 

25 CCL3, CCL3L1, SCYA3L2, CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, SCYA10, 
CCL11, SCYA12, CCL13, CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, CCL21, 
CCL22, CCL23, CCL24, CCL25, CCL26, CCL27, CCL28, clone 391, CARP CC-1, CCL1, CK-1, 
regakine-1, K203, CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, CXCL5, CXCL6, PPBP, 
SPBPBP, IL8, CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, CXCL15, CXCL16, NAP-4, 

30 LFCA-1, Scyba, JSC, VHSV-induced protein, CX3CL1 and fCLl. 

In some embodiments, the detection method comprises detecting, in a biological sample, 
such as a tissue or fluid sample from a subject (such as, a human patient), the presence or absence 
of a chemokine by contacting the biological sample with a THAP-type chemokine-binding agent 
and detecting a complex between the chemokine and the THAP-type chemokine-binding agent or 

35 detecting a THAP-type chemokine-binding agent which was previously bound to the chemokine but 
which has been released from the chemokine. 
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In some embodiments of the present invention, the THAP-type chemokine-binding agent is 
labeled directly. In other embodiments, the THAP-type chemokine-binding agent is detected using 
a labeled antibody having affinity for the THAP-type chemokine-binding agent. Such antibodies 
may directly carry the detectable label or be recognized by a labeled second antibody. Antibodies 
can be polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., 
Fab or F(abOi) can be used. The term "labeled", with regard to the antibody or other detectable 
molecule, is intended to encompass direct labeling of the antibody or molecule by coupling (i.e., 
physically linking) a detectable substance to the antibody or molecule, as well as indirect labeling 
of the antibody or molecule by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled secondary 
antibody and end-labeling of a THAP-type chemokine-binding agent with biotin such that it can be 
detected with fluorescently labeled streptavidin. The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. Accordingly, the detection method can be used to detect a chemokine in a 
biological sample in vitro as well as in vivo. For example, in vitro techniques for detection of a 
chemokine include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations and immunofluorescence. In vivo techniques for detection of a chemokine 
include introducing into a subject a labeled THAP-type chemokine-binding agent. For example, the 
THAP-type chemokine-binding agent can be labeled with a radioactive marker whose presence and 
20 location in a subject can be detected by standard imaging techniques. 

OtJier aspects of the present invention relate to a system for chemokine detection. Such a 
chemokine detection system comprises a THAP-type chemokine-binding agent bound to a solid 
support. A number of adequate solid support materials are known in the art and include, but are not 
limited to, cellulose, nylon or other polymer backings, plastics such as microliter plates, synthetic 
25 beads and resins such as sepharose, glass, magnetic beads, latex particles, sheep (or other animal) 
red blood cells, duracytes and others. Suitable methods for immobilizing the THAP-type 
chemokine-binding agent to the solid support are well known in the art. 

Some embodiments of the present invention relate to kits which comprise a THAP-type 
chemokine-binding agent and instructions which describe detecting or inhibiting chemokines with 
30 the THAP-type chemokine-binding agent. For example, the kit includes an ampule of THAP-type 
chemokine-binding agent that is stored so as to prevent damage or inactivation of the agent upon 
prolonged storage. Such methods can include, but are not limited to, lyophilization and freezing in 
an appropriate buffer. The kit also can contain chemokines to serve as a positive control sample 
when the kit is used for chemokine binding, detection or inhibition. 
35 In some embodiments of the present invention, kits are packaged containing a 

heterogeneous mixture of THAP-type chemokine-binding agents, wherein each of the agents has a 
different affinity for one or more chemokines. Alternatively, some kits comprise a panel of THAP- 
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type chemokine-binding agents, wherein each THAP-type chemokine binding agent has a different 
affinity for a particular chemokine. For example, the kit can comprise a panel of three THAP-type 
chemokine-binding agents, wherein the first agent has a high affinity for SLC but a low affinity for 
CXCL9, the second agent has a moderate affinity for both SLC and CXCL9, and the third agent has 
5 a low affinity for SLC and a high affinity for CXCL9. Panels of THAP-type chemokine-binding 
agents can be larger or small than that exemplified above and the number and types of chemokines 
that are detected can be more or less than that exemplified above. Kits containing such panels of 
THAP-type chemokine-binding agents can be used to reliably distinguish mixed samples of 
chemokines. Additionally, such panels can be used to bind or inhibit multiple different chemokines 
10 in a mixed chemokine sample. 

Having generally described this invention, a further understanding can be obtained by 
reference to certain specific examples which are provided herein for purposes of illustration only, 
and are not intended to be limiting unless otherwise specified. 



15 EXAMPLES 

EXAMPLE 1 

Isolation of the THAP1 cDNA in a two-hybrid screen with chemokine SLC/CCL21 
In an effort to define the function of novel HEVEC proteins and the cellular pathways 
involved, we used different baits to screen a two-hybrid cDNA library generated from 

20 microvascular human HEV endothelial cells (HEVEC). HEVEC were purified from human tonsils 
by immunomagnetic selection with monoclonal antibody MECA-79 as previously described (Girard 
and Springer (1995) Immunity 2:113-123). The SMART PCR cDNA library Construction Kit 
(Clontech, Palo Alto, CA, USA) was first used to generate full-length cDNAs from 1 jig HEVEC 
total RNA. Oligo-dT-primed HEVEC cDNA were then digested with Sfil and directionally cloned 

25 into pGAD424-Sfi, a two-hybrid vector generated by inserting a Sfil linker (5'- 
G AATTC GGCC ATTATGGCC TGCAGG ATCC GGCCGCCTCGGCC C AGO ATCC-3 ' ^ (SEQ ID 
NO: 181) between EcoRJ and BamHI cloning sites of pGAD424 (Clontech). The resulting 
pGAD424-HEVEC cDNA two-hybrid library (mean insert size > 1 kb, - 3x1 0 6 independant clones) 
was amplified in E. coli. To identify potential protein partners of chemokine SLC/6Ckine, 

30 screening of the two-hybrid HEVEC cDNA library was performed using as bait a cDNA encoding 
the mature form of human SLC/CCL21 (amino acids 24-134, GenBank Accession No: NP_002980, 
SEQ ID NO: 182), amplified by PCR from HEVEC RNA with primers hSLC.5' (5'- 
GCGGGATCCGTAGTGATGGAGGGGCTCAGGACTGTTG-3') (SEQ ID NO: 183) and 
hSLC.3' (5 '-GCGGGATCCCTATGGCCCTTTAGGGGTCTGTG AGC-3 ') (SEQ ID NO: 184), 

35 digested with BamHI and inserted into the BamHI cloning site of MATCHMAKER two-hybrid 
system 2 vector pGBT9 (Clontech). Briefly, pGBT9-SLC was cotransformed with the pGAD424- 
HEVEC cDNA library in yeast strain Y190 (Clontech). 1.5xl0 7 yeast transformants were screened 
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and positive protein interactions were selected by His auxotrophy. The plates were incubated at 
30°C for 5 days. Plasmid DNA was extracted from positive colonies and used to verify the 
specificity of the interaction by ^transformation in AH 109 with pGBT9-SLC or control baits 
pGBT9, pGBT9-lamin. Eight independent clones isolated in this two-hybrid screen were 

5 characterized. They were found to correspond to a unique human cDNA encoding a novel human 
protein of 213 amino acids, designated THAP1, that exhibits 93% identity with its mouse 
orthologue (Figure 1A). The only noticeable motifs in the THAP1 predicted protein sequence were 
a short proline-rich domain in the middle part and a consensus nuclear localization sequence (NLS) 
in the carboxy terminal part (Figure IB). Databases searches with the THAP1 sequence failed to 

1 0 reveal any significant similarity to previously characterized proteins with the exception of the first • 
90 amino acids that may define a novel protein motif associated with apoptosis, hereafter referred to 
as THAP domain (see Figure IB, Figures 9A-9C, and Figure 10). 

EXAMPLE 2 
Northern Blot 

15 To determine the tissue distribution of THAP1 mRNA, we performed Northern blot 

analysis of 12 different adult human tissues (Fig 2). Multiple Human Tissues Northern Blots 
(CLONTECH) were hydridized according to manufacturer's instructions. The probe was a PCR 
product corresponding to the THAP1 ORF, Vlabeled with the Prime-a-Gene Labeling System 
(PROMEGA).A 2.2-kb mRNA band was detected in brain, heart, skeletal muscle, kidney, liver, and 

20 placenta. In addition to the major 2.2 kb band, lower molecular weight bands were detected, that are 
likely to correspond to alternative splicing or polyadenylation of the THAP1 pre-mRNA. The 
presence of THAP1 mRNAs in many different tissues suggests that THAP1 has a widespread, 
although not ubiquitous, tissue distribution in the human body. 

EXAMPLE 3 

25 Analysis of the subcellular TH API localization 

To analyze the subcellular localization of the THAP1 protein, the THAP1 cDNA was fused 
to the coding sequence of GFP (Green Fluorescent Protein). The full-length coding region of 
THAP1 was amplified by PCR from HEVEC cDNA with primers 2HMR10 (5'- 
CCG AATTC AGG ATGGTGCAGTCCTGCTCCGCCT-3 ' ) (SEQ ED NO: 185) and 2HMR9 (5'- 

30 CGCGG ATCCTGCTGGTACTTCAACTATTTCAAAGTAGTC-3 ' ) (SEQ ID NO: 186), digested 
with EcoRI and BamHI, and cloned in frame downstream of the Enhanced Green Fluorescent 
Protein (EGFP) ORF in pEGFP.C2 vector (Clontech) to generate pEGFP.C2-THAP 1 . The 
GFP/THAP1 expression construct was then transfected into human primary endothelial cells from 
umbilical vein (HUVEC, PromoCell, Heidelberg, Germany). HUVEC were grown in complete 

35 ECGM medium (PromoCell, Heidelberg, Germany), plated on coverslips and transiently 
transfected in RPMI medium using GeneJammer transfection reagent according to manufacturer 
instructions (Stratagene, La Jolla, CA, USA). Analysis by fluorescence microscopy 24h later 
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revealed that the GFP/THAP1 fusion protein localizes exclusively in the nucleus with both a diffuse 
distribution and an accumulation into speckles while GFP alone exhibits only a diffuse staining 
over the entire cell. To investigate the identity of the speckled domains with which GFP/THAP1 
associates, we used indirect immunofluorescence microscopy to examine a possible colocalization 
of the nuclear dots containing GFP/THAP1 with known nuclear domains (replication factories, 
splicing centers, nuclear bodies). 

Cells transfected with GFP-tagged expression constructs were allowed to grow for 24 h to 
48 h on coverslips. Cells were washed twice with PBS, fixed for 15 min at room temperature in 
PBS containing 3.7% formaldehyde, and washed again with PBS prior to neutralization with 50mM 
NH 4 C1 in PBS for 5 min at room temperature. Following one more PBS wash, cells were 
permeabilized 5 min at room temperature in PBS containing 0.1% Triton-XlOO, and washed again 
with PBS. Permeabilized cells were then blocked with PBS-BSA (PBS with 1% bovine serum 
albumin) for 10 min and then incubated 2 hr at room temperature with the following primary 
antibodies diluted in PBS-BSA: rabbit polyclonal antibodies against human Daxx (1/50, M-112, 
Santa Cruz Biotechnology) or mouse monoclonal antibodies anti-PML (mouse IgGl, 1/30, mAb 
PG-M3 from Dako, Glostrup, Denmark). Cells were then washed three times 5 min at room 
temperature in PBS-BSA, and incubated for 1 hr with Cy3 (red fluorescence)-conjugated goat anti- 
mouse or anti-rabbit IgG (1/1000, Amersham Pharmacia Biotech) secondary antibodies, diluted in 
PBS-BSA. After extensive washing in PBS, samples were air dried and mounted in Mowiol. 
Images were collected on a Leica confocal laser scanning microscope. The GFP (green) and Cy3 
(red) fluorescence signals were recorded sequentially for identical image fields to avoid cross-talk 
between the channels. 

This analysis revealed that GFP-THAP1 staining exhibits a complete overlap with the 
staining pattern obtained with antibodies directed against PML. The colocalization of GFP/THAP1 
and PML was observed both in nuclei with few PML-NBs (less than ten) and in nuclei with a large 
number of PML-NBs. Indirect immunofluorescence staining with antibodies directed against Daxx, 
another well characterized component of PML-NBs, was performed to confirm the association of 
GFP/THAP1 with PML-NBs. We found a complete colocalization of GFP/THAP1 and Daxx in 
PML-NBs. Together, these results reveal that THAP1 is a novel protein associated with PML-NBs. 

EXAMPLE 4 

Identification of proteins interacting with THAP1 in human HEVECs: two-hybrid assay 
THAP1 forms a complex with the pro-apoptotic protein PAR4 

To identify potential protein partners of THAP1, screening of the two-hybrid HEVEC 
cDNA library was performed using as a bait the human THAP1 full length cDNA inserted into the 
MATCHMAKER two-hybrid system 3 vector pGBKT7 (Clontech). Briefly, the full-length coding 
region of THAP1 was amplified by PCR from HEVEC cDNA with primers 2HMR10 (5'- 
CCG AATTCAGG ATGGTGCAGTCCTGCTCCGCCT-3 * ) (SEQ ID NO: 187) and 2HMR9 (5>- 
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CGCGGATCCTGCTGGTACTTCAACTATTTCAAAGTAGTC-3') (SEQ ID NO: 188), digested 

with EcoRI and BamHI, and cloned in frame downstream of the Gal4 Binding Domain (Gal4-BD) 
in pGBKT7 vector to generate pGBKT7-THAPl. pGBKT7-TH AP 1 was then cotransformed with 
the pG AD424-HEVEC cDNA library in yeast strain AH109 (Clontech). 1.5xl0 7 yeast 
transformants were screened and posit.ve protein interact.ons were selected by His and Ade double 
auxotrophy according to manufacturer's instructions (MATCHMAKER two-hybrid system 3, 
Clontech). The plates were incubated at 30°C for 5 days. Plasmid DNA was extracted from these 
positive colonies and used to verify the specificity of the interaction by ^transformation in AH 109 
with pGBKT7-TH AP 1 or control baits pGBKT7, pGBKT7-lamin and pGBKT7-hevin. Three 
clones which specifically interacted with THAP1 were obtained in the screen; sequencing of these 
clones revealed three identical library plasmids that corresponded to a partial cDNA coding for the 
last 147 amino acids (positions 193-342) of the human pro-apoptotic protein PAR4 (Fig 3 A). 
Positive interaction between THAP1 and Par4 was confirmed using full length Par4 bait (pGBKT- 
Par4) and prey (pGADT7-Par4). Full-length human Par4 was amplified by PCR from human 
15 thymus cDNA (Clontech), with primers Par4.8 (5*- 

GCGGAATTCATGGCGACCGGTGGCTACCGGACC-3') (SEQ ID NO: 189) and Par4.5 (5'- 
GCGGGATCCCTCTACCTGGTCAGCTGACCCACAAC-3') (SEQ ID NO: 190), digested with 
EcoRI and BamHI, and cloned in pGBKT7 and pGADT7 vectors, to generate pGBKT7-Par4 and 
P GADT7-Par4. Positive interaction between THAP1 and Par4 was confirmed by ^transformation 
20 of AH109 with P GBKT7-THAP1 and P GADT7-Par4 or pGBKT7-Par4 and pGADT7-THAP 1 and 
selection of transformants by His and Ade double auxotrophy according to manufacturer's 
instructions (MATCHMAKER two-hybrid system 3, Clontech). To generate pGADT7-THAP 1 , the 
full-length coding region of THAP1 was amplified by PCR from HEVEC cDNA with primers 
2HMR10 (5 '-CCGAATTCAGGATGGTGCAGTCCTGCTCCGCCT-3 ') (SEQ ID NO: 191) and 
25 2HMR9 (5 '-CGCGG ATCCTGCTGGTACTTCAACTATTTCAA AGTAGTC-3 ') (SEQ ID NO: 

192) , digested with EcoRI and BamHI, and cloned in frame downstream of the Gal-4 Activation 
Domain (Gal4-AD) in pGADT7 two-hybrid vector (Clontech). 

We then examined whether the leucine zipper/death domain at the C-terminus of Par4, 
previously shown to be involved in Par4 binding to WT-1 and aPKC, was required for the 

30 interaction between THAP1 and Par4. Two Par4 mutants were constructed for that purpose, Par4A 
and Par4DD. Par4A lacks the leucine zipper/death domain while Par4DD contains this domain. 
pGBKT7-Par4A(amino acids 1-276) and P GADT7-Par4A. were constructed by sub-cloning a 
EcoRI-Bglll fragment from P GADT7-Par4 into the EcoRI and BamHI sites of pGBKT7 and 
pGADT7. Par4DD (amino acids 250-342) was amplified by PCR, using pGBKT7-Par4 as 

35 template, with primers Par t 4.4 (5'- 

CGCGAATTCGCCATCATGGGGTTCCCTAGATATAACAGGGATGCAA-3') (SEQ ID NO: 

193) and Par4.5, and cloned into the EcoRI and BamHI sites of pGBKT7 and pGADT7 to obtain 
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pGBKT7-Par4DD and pGADT7-Par4DD. Two-hybrid interaction between THAP1 and Par4 
mutants was tested by cotransformation of AH 109 with pGBKT7-THAPl and pGADT7-Par4A or 
pGADT7-Par4DD and selection of transformants by His and Ade double auxotrophy according to 
manufacturer's instructions (MATCHMAKER two-hybrid system 3, Clontech). We found that the 
5 Par4 leucine zipper/death domain (Par4DD) is not only required but also sufficient for the 
interaction with THAP1 (Fig 3 A). Similar results were obtained when two-hybrid experiments were 
performed in the opposite orientation using Par4 or Par4 mutants (Par4A and Par4DD) as baits 
instead of THAP1 (Fig 3 A). 

EXAMPLE 5 

10 In vitro THAP1/Par4 interaction assay 

To confirm the interaction observed in yeast, we performed in vitro GST pull down assays. 
Par4DD, expressed as a GST-tagged fusion protein and immobilized on glutathione sepharose, was 
incubated with radiolabeled in vitro translated THAP1. To generate the GST-Par4DD expression 
vector, Par4DD (amino acids 250-342) was amplified by PCR with primers Par4.I0 (5'- 

15 GCCGG ATCCGGGTTCCCT AG AT AT AAC AGGG ATGC AA-3 ' ) (SEQ ID NO: 194) and Par4.5, 
and cloned in frame downstream of the Glutathion S-Transferase ORF, into the BamHI site of the 
pGEX-2T prokaryotic expression vector (Amersham Pharmacia Biotech, Saclay, France). GST- 
Par4DD(amino acids 250-342) fusion protein encoded by plasmid pGEX-2T-Par4DD and control 
GST protein encoded by plasmid pGEX-2T, were then expressed in E.Coli DH5ct and purified by 

20 affinity chromatography with glutathione sepharose according to supplier's instructions (Amersham 
Pharmacia giotech). The yield of proteins used in GST pull-down assays was determined by SDS- 
Polyarylamide Gel Electrophoresis (PAGE) and Coomassie blue staining analysis. In vitro- 
translated THAP1 was generated with the TNT-coupled reticulocyte lysate system (Promega, 
Madison, WI, USA) using pGBKT7-THAP 1 vector as template. 25 \i\ of 35 S-labelled wild-type 

25 THAP1 was incubated with immobilized GST-Par4 or GST proteins overnight at 4 °C, in the 
following binding buffer: 10 mM NaP04 pH 8.0, 140 mM NaCl, 3 mM MgC12, ImM 
dithiothreitol (DTT), 0.05% NP40, and 0.2 mM phenylmethyl sulphonyl fluoride (PMSF), 1 mM 
Na Vanadate, 50mM p Glycerophosphate, 25 |ag/ml chimotrypsine, 5 |xg/ml aprotinin, 10 u.g/ml 
Leupeptin. Beads were then washed 5 times in 1 ml binding buffer. Bound proteins were eluted 

30 with 2X Laemmli SDS-PAGE sample buffer, fractionated by 10% SDS-PAGE and visualized by 
fiuorography using Amplify (Amersham Pharmacia Biotech). As expected, GST/Par4DD interacted 
with THAP1 (Fig 3B). In contrast, THAP1 failed to interact with GST beads. 

EXAMPLE 6 
In vivo THAP1/Par4 interaction assay 

35 To provide further evidence for a physiological interaction between THAP1 and Par4 in 

vivo interactions between THAP1 and PAR4 were investigated. For that purpose, confocal 
immunofluorescence microscopy was used to analyze the subcellular localization of epi tope-tagged 
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Par4DD in primary human endothelial cells transiently cotransfected with pEF-mycPar4DD 
eukaryotic expression vector and GFP or GFP-THAP1 expression vectors (pEGFP.C2 and 
pEGFP.C2-THAPl, respectively). To generate P EF-mycPai4DD, mycPar4DD (amino acids 250- 
342) was amplified by PCR using pGBKT7-Par4DD as template, with primers myc.BD7 (5'- 
GCGCTCTAGAGCCATCATGGAGGAGCAGAAGCTGATC-3') (SEQ ID NO: 195) and Par4.9 
(5'-CTTGCGGCCGCCTCTACCTGGTCAGCTGACCCACAAC-3') (SEQ ID NO: 196), and 
cloned into the Xbal and NotI sites of the pEF-BOS expression vector (Mizushima and Nagata, 
Nucleic Acids Research, 18:5322, 1990). Primary human endothelial cells from umbilical vein 
(HUVEC, PromoCell, Heidelberg, Germany) were grown in complete ECGM medium (PromoCell, 
Heidelberg, Germany), plated on coverslips and transiently transfected in RPMI medium using 
GeneJammer transfection reagent according to manufacturer instructions (Stratagene, La Jolla, CA, 
USA). Cells co-transfected with P EF-mycPar4DD and GFP-tagged expression constructs were 
allowed to grow for 24 h to 48 h on coverslips. Cells were washed twice with PBS, fixed for 15 
min at room temperature in PBS containing 3.7% formaldehyde, and washed again with PBS prior 
15 to neutralization with 50mM NH«C1 in PBS for 5 min at room temperature. Following one more 
PBS wash, cells were permeabilized 5 min at room temperature in PBS containing 0.1% Triton- 
XI 00, and washed again with PBS. Permeabilized cells were then blocked with PBS-BSA (PBS 
with 1% bovine serum albumin) for 10 min and then incubated 2 hr at room temperature with 
mouse monoclonal antibody anti-myc epitope (mouse IgGl, 1/200, Clontech) diluted in PBS-BSA. 
20 Cells were then washed three times 5 min at room temperature in PBS-BSA, and incubated for 1 hr 
with Cy3 {red fluorescence)-conjugated goat anti-mouse (1/1000, Amersham Pharmacia Bwtech) 
secondary antibodies, diluted in PBS-BSA. After extensive washing in PBS, samples were air dried 
and mounted in Mowiol. Images were collected on a Leica confocal laser scanning microscope. 
The GFP (green) and Cy3 (red) fluorescence signals were recorded sequentially for identical image 
25 fields to avoid cross-talk between the channels. 

In cells transiently co-transfected with P EF-/«ycPar4DD and GFP expression vector, 
ectopically expressed myc-Par4DD was found to accumulate both in the cytoplasm and the nucleus 
of the majority of the cells. In contrast, transient cotransfection of pEF-wrycPar4DD and GFP- 
THAP1 expression vectors dramatically shifted myc-Par4DD from a diffuse cytosolic and nuclear 
30 localization to a preferential association with PML-NBs. The effect of GFP-THAP1 on myc- 
Par4DD localization was specific since it was not observed with GFP-APS kinase- 1 (APSK-1), a 
nuclear enzyme unrelated to THAP1 and apoptosis [Besset et a)., Faseb J, 14:345-354, 2000]. This 
later result shows that GFP-THAP1 recruits myc-Par4DD at PML-NBs and provides in vivo 
evidence for a direct interaction of THAP1 with the pro-apoptotic protein Par4. 
35 EXAMPLE 7 

Identification of a novel arginine-rich Par4 binding motif 
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To identify the sequences mediating THAP1 binding to Par4, a series of THAP1 deletion 
constructs was generated. Both amino-terminal (THAP1-C1, -C2, -C3) and carboxy-terminal 
(THAP1-N1, -N2, -N3) deletion mutants (Figure 4A) were amplified by PCR using plasmid 
pEGFP.C2-THAPl as a template and the following primers: 2HMR12 (5*- 
5 GCGGAATTCAAAGAAG ATCTTCTGG AGCCACAGGAAC-3 ') (SEQ ID NO: 1 97) 

and 2HMR9 (5 '-CGCGGATCCTGCTGGTACTTC AACTATTTCAAAGTAGTC-3 *) (SEQ ED 
NO: 198)forTHAPl-Cl (amino acids 90-213); 

PAPM2 (5 '-GCGG AATTC ATGCCGCCTCTTCAG ACCCCTGTTAA-3 ') (SEQ ID NO: 199) 
and 2HMR9 for THAP1-C2 (amino acids 120-213); 
1 0 PAPM3 (5 '-GCGGAATTCATGCACCAGCGGAAAAGGATTCATCAG-3') (SEQ ED NO: 200) 
and 2HMR9 for THAP1-C3 (amino acids 143-213); 

2HMR10 (5 '-CCG AATTCAGGATGGTGCAGTCCTGCTCCGCCT-3 ') (SEQ ID NO: 201) 
and 2HMR1 7 (5 '-GCGGG ATCCCTTGTCATGTGGCTCAGTACAAAG AAATAT-3 *) (SEQ ID 
NO: 202)forTHAPl-Nl (amino acids 1-90); 
1 5 2HMR10 and PAPN2 (5 '-CGGGATCCTGTGCGGTCTTG AGCTTCTTTCTG AGO ') (SEQ ID 
NO: 203) for THAP1-N2 (amino acids 1-166); and 

2HMR10 and PAPN3 (5 '-GCGGG ATCCGTCGTCTTTCTCTTTCTGGAAGTG AAC-3 ') (SEQ ID 
NO: 204) for THAP1-N3 (amino acids 1-192). 

The PCR fragments, thus obtained, were digested with EcoRI and BamHI, and cloned in 

20 frame downstream of the Gal4 Binding Domain (Gal4-BD) in pGBKT7 two-hybrid vector 
(Clontech) Jo generate pGBKT7-THAPl-Cl, -C2, -C3, -Nl, -N2 or -N3, or downstream of the 
Enhanced Green Fluorescent Protein (EGFP) ORF in pEGFP.C2 vector (Clontech) to generate 
pEGFP.C2-THAPl-Cl, -C2, -C3, -Nl, -N2 or -N3. 

Two-hybrid interaction between THAP1 mutants and Par4DD was tested by 

25 cotransformation of AH 109 with pGBKT7-THAPl-Cl, -C2, -C3, -Nl, -N2 or -N3 and pGADT7- 
Par4DD and selection of transformants by His and Ade double auxotrophy according to 
manufacturer's instructions (MATCHMAKER two-hybrid system 3, Clontech). Positive two-hybrid 
interaction with Par4DD was observed with mutants THAP1-C1, -C2, -C3, -and -N3 but not with 
mutants THAP1-N1 and -N2, suggesting the Par4 binding site is found between THAP1 residues 

30 143 and 192. 

THAP1 mutants were also tested in the in vitro THAP1/Par4 interaction assay. In vitro- 
translated THAP1 mutants were generated with the TNT-coupled reticulocyte lysate system 
(Promega, Madison, WI, USA) using pGBKT7-THAPl-Cl, -C2, -C3, -Nl, -N2 or -N3 vector as 
template. 25 nl of each 35 S-labelled THAP1 mutant was incubated with immobilized GST or GST- 
35 Par4 protein overnight at 4 °C, in the following binding buffer : 10 mM NaP04 pH 8.0, 140 mM 
NaCl, 3 mM MgC12, ImM dithiothreitol (DTT), 0.05% NP40, and 0.2 mM phenylmethyl 
sulphonyl fluoride (PMSF), 1 mM Na Vanadate, 50mM |3 Glycerophosphate, 25 jig/ml 
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chimotrypsine, 5 pg/ml aprotinin, 10 pg/ml Leupeptin. Beads were then washed 5 times in 1 ml 
binding buffer. Bound proteins were eluted with 2X Laemmli SDS-PAGE sample buffer, 
fractionated by 10% SDS-PAGE and visualized by fluorography using Amplify (Amersham 
Pharmacia Biotech). As expected, THAP1-C1, -C2, -C3, -and -N3 interacted w.th GST/Par4DD 
(Figure 4B). In contrast, THAP1-N1 and -N2 failed to interact with GST/Par4DD beads. 

Finally, Par4 binding activity of THAP1 mutants was also analyzed by the in vivo 
THAP1/Par4 interaction assay as described in Example 6 using pEF-mycPar4DD and pEGFP.C2- 
THAP1-C1 , -C2, -C3, -Nl, -N2 or -N3 expression vectors. 

Essentially identical results were obtained with the three THAP1/Par4 interactions assays 
(Figure 4A). That is, the Par4 binding site was found between residues 143 and 192 of human 
THAP1. Comparison of this region with the Par4 binding domain of mouse ZIP kinase, another 
Par4-interacting protein, revealed the existence of a conserved arginine rich-sequence motif (SEQ 
ID NOs: 205, 263 and 15), that may correspond to the Par4 binding site (Figure 5A). Mutations in 
this arginine rich-sequence motif were generated by site directed mutagenesis. These two novel 
15 THAP1 mutants, THAP1 RR/AA (replacement of residues R171A and R172A) and 
THAP1 AQRCRR (deletion of residues 168-172), were generated by two successive rounds of PCR 
using pEGFP.C2-THAPl as template and primers 2HMR10 and 2HMR9 together with primers 
RR/AA-1 (5'-CCGCACAGCAGCGATGCGCTGCTCAAGAACGGCAGCTTG-3') (SEQ ID NO: 

206) and 

20 RR/AA-2 (5'-CAAGCTGCCGTTCTTGAGCAGCGCATCGCTGCTGTGCGG-3') (SEQ ID NO: 

207) for mjutant THAP1 RR/AA or 

primers ARR-1 (5'-GCTCAAGACCGCACAGCAAGAACGGCAGCTTG-3'(SEQ ID NO: 208) 
and 

ARR-2 (5'-CAAGCTGCCGTTCTTGCTGTGCGGTCTTGAGC-3') (SEQ ED NO: 209) for mutant 
25 THAP1 AQRCRR. The resulting PCR fragments were digested with EcoRI and Bam HI, and cloned 
in frame downstream of the Gal4 Binding Domain (Gal4-BD) in pGBKT7 two-hybrid vector 
(Clontech) to generate pGBKT7-THAPl -RR/AA and -A(QRCRR), or downstream of the Enhanced 
Green Fluorescent Protein (EGFP) ORF in P EGFP.C2 vector (Clontech) to generate P EGFP.C2- 
THAP1 -RR/AA and -A(QRCRR). THAP1 RR/AA and TH AP 1 AQRCRR THAP1 mutants were 
30 then tested in the three THAP1/Par4 interaction assays (two-hybrid assay, in vitro THAP1/Par4 
interaction assay, in vivo THAP1/Par4 interaction assay) as described above for the THAP1-C1, - 
C2, -C3, -Nl, -N2 or -N3 mutants. This analysis revealed that the two mutants were deficient for 
interaction with Par4 in all three assays (Figure 5B), indicating that the novel arginine-rich 
sequence motif, we have identified, is a novel Par4 binding motif. 
35 EXAMPLE 8 

PAR4 is a novel component of PML-NBs that colocali zes with THAP1 in vivo 
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We then wished to determine if PAR4 colocalizes with THAP1 in vivo in order to provide 
further evidence for a physiological interaction between THAP1 and PAR4. We first analyzed Par4 
subcellular localization in primary human endothelial cells. Confocal immunofluorescence 
microscopy using affinity-purified anti-PAR4 antibodies (Sells et al., 1997 ; Guo et al ; 1998) was 
5 performed on HUVEC endothelial cells fixed with methanol/acetone, which makes PML-NBs 
components accessible for antibodies (Sternsdorf et al., 1997). Cells were fixed in methanol for 5 
min at -20°C, followed by incubation in cold acetone at -20°C for 30 sec. Permeabilized cells were 
then blocked with PBS-BSA (PBS with 1% bovine serum albumin) for 10 min and then incubated 2 
hr at room temperature with rabbit polyclonal antibodies against human Par4 (1/50, R-334, Santa 

10 Cruz Biotechnology, Santa Cruz, CA, USA) and mouse monoclonal antibody anti-PML (mouse 
IgGl, 1/30, mAb PG-M3 from Dako, Glostrup, Denmark). Cells were then washed three times 5 
min at room temperature in PBS-BSA, and incubated for 1 hr with Cy3 (red fluorescence) - 
conjugated goat anti-rabbit IgG (1/1000, Amersham Pharmacia Biotech) and FITC-labeled goat 
anti-mouse-IgG (1/40, Zymed Laboratories Inc., San Francisco, CA, USA) secondary antibodies, 

15 diluted in PBS-BSA. After extensive washing in PBS, samples were air dried and mounted in 
Mowiol. Images were collected on a Leica confocal laser scanning microscope. The FITC (green) 
and Cy3 (red) fluorescence signals were recorded sequentially for identical image fields to avoid 
cross-talk between the channels. This analysis showed an association of PAR4 immunoreactivity 
with nuclear dot-like structures, in addition to diffuse nucleoplasms and cytoplasmic staining. 

20 Double immunostaining with anti-PML antibodies, revealed that the PAR4 foci colocalize perfectly 
with PML-NBs in cell nuclei. Colocalization of Par4 with GFP-THAP1 in PML-NBs was analyzed 
in transfected HUVEC cells expressing ectopic GFP-THAPL HUVEC were grown in complete 
ECGM medium (PromoCell, Heidelberg, Germany), plated on coverslips and transiently 
transfected with GFP/THAP1 expression construct (pEGFP.C2-THAPl) in RPMI medium using 

25 Gene Jammer transfection reagent according to manufacturer instructions (Stratagene, La Jolla, CA, 
USA). Analysis of transfected cells by indirect immunofluorescence microscopy 24h later, with 
anti-Par4 rabbit antibodies, revealed that all endogenous PAR4 foci colocalize with ectopic GFP- 
THAP1 in PML-NBs further confirming the association of the THAP1/PAR4 complex with PML- 
NBs in vivo. 

30 EXAMPLE 9 

PML recruits the THAP1/PAR4 complex to PML-NBs 
Since it has been shown that PML plays a critical role in the assembly of PML-NBs by 
recruiting other components, we next wanted to determine whether PML plays a role in the 
recruitment of the THAP1/PAR4 complex to PML-NBs. For this purpose, we made use of the 
35 observation that both endogenous PAR4 and ectopic GFP-THAP1 do not accumulate in PML-NBs 
in human Hela cells. Expression vectors for GFP-THAP1 and HA-PML (or HA-SP100) were 
cotransfected into these cells and the localization of endogenous PAR4, GFP-THAP1 and HA-PML 
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(or HA-SP100) was analyzed by triple staining confocal microscopy. 

Human Hela cells (ATCC) were grown in Dulbecco's Modified Eagle's Medium 
supplemented with 10% Fetal Calf Serum and 1% Penicillin-streptomycin (all from Life 
Technologies, Grand Island, NY, USA), plated on coverslips, and transiently transfected with 
calcium phosphate method using 2 ug P EGFP.C2-THAP1 and pcDNA.3-HA-PML3 or pSG5-HA- 
SplOO (a gift from Dr Dejean, Institut Pasteur, Paris, France) plasmid DNA. pcDNA.3-HA-PML3 
was constructed by sub-cloning a BgHI-BamHI fragment from pGADT7-HA-PML3 into the 
BamHI site of pcDNA3 expression vector (Invitrogen, San Diego, CA, USA). To generate 
P GADT7-HA-PML3, PML3 ORF was amplified by PCR, using pACT2-PML3 (a gift from Dr De 
The, Paris, France) as template, with primers 

PML-1 (5 '-GCGGGATCCCTAAATTAG AAAGGGGTGGGGGT AGCC-3 ') (SEQ ID NO: 210) 
and 

PML-2 (5'-GCGGAATTCATGGAGCCTGCACCCGCCCGATC-3') (SEQ ID NO: 211), and 

cloned into the EcoRI and BamHI sites of pGADT7. 

Hela cells transfected with GFP-tagged and HA-tagged expression constructs were allowed 
to grow for 24 h to 48 h on coverslips. Cells were washed twice with PBS, fixed in methanol for 5 
min at-20°C, followed by incubation in cold acetone at -20°C for 30 sec. Permeabilized cells were 
then blocked with PBS-BSA (PBS with 1% bovine serum albumin) for 10 min and then incubated 2 
hr at room temperature with the following primary antibodies diluted in PBS-BSA: rabbit 
polyclonal antibodies against human Par4 (1/50, R-334, Santa Cruz Biotechnology, Santa Cruz, 
CA, USAXand mouse monoclonal antibody ant,-HA tag (mouse IgGl, 1/1000, mAb 16B12 from 
BabCO, Richmond, CA, USA). Cells were then washed three times 5 min at room temperature in 
PBS-BSA, and incubated for 1 hr with Cy3 (red fluorescence)-conjugated goat anti-rabbit IgG 
(1/1000, Amersham Pharmacia Biotech) and Alexa Fluor-633 (blue fluorescence) goat anti-mouse 
IgG conjugate (1/100, Molecular Probes, Eugene, OR, USA) secondary antibodies, diluted in PBS- 
BSA. After extensive washing in PBS, samples were air dried and mounted in Mowiol. Images 
were collected on a Leica confocal laser scanning microscope. The GFP (green), Cy3 (red) and 
Alexa 633 (blue) fluorescence signals were recorded sequentially for identical image fields to avoid 
cross-talk between the channels. 

In Hela cells transfected with HA-PML, endogenous PAR4 and GFP-THAP1 were 
recruited to PML-NBs, whereas in cells transfected with HA-SP100, both PAR4 and GFP-THAP1 
exhibited diffuse staining without accumulation in PML-NBs. These findings indicate that 
recruitment of the THAP1/PAR4 complex to PML-NBs depends on PML but not SP100. 

EXAMPLE 10 
THAP1 is an anoptosis inducing polypeptide 
THAP1 is a novel proapoptotic factor 
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Since PML and PML-NBs have been linked to regulation of cell death and PAR4 is a well 
established pro-apoptotic factor, we examined whether THAP1 can modulate cell survival. Mouse 
3T3 cells, which have previously been used to analyze the pro-apoptotic activity of PAR4 (Diaz- 
Meco et al , 1996 ; Berra et ah, 1997), were transfected with expression vectors for GFP-THAP1, 
5 GFP-PAR4 and as a negative control GFP-APS kinase- 1 (APSK-1), a nuclear enzyme unrelated to 
THAP1 and apoptosis (Girard et al., 1998; Besset et al., 2000). We then determined whether ectopic 
expression of THAP1 enhances the apoptotic response to serum withdrawal. Transfected cells were 
deprived of serum for up to twenty four hours and cells with apoptotic nuclei, as revealed by DAPI 
staining and in situ TUNEL assay, were counted. 

10 Cell death assays: Mouse 3T3-TO fibroblasts were seeded on coverslips in 12-well plates 

at 40 to 50% confluency and transiently transfected with GFP or GFP-fusion protein expression 
vectors using Lipofectamine Plus reagent (Life Technologies) according to supplier's instructions. 
After 6h at 37°C, the DNA-lipid mixture was removed and the cells were allowed to recover in 
complete medium for 24 h. Serum starvation of transiently transfected cells was induced by 

1 5 changing the medium to 0% serum, and the amount of GFP-positive apoptotic cells was assessed 24 
h after induction of serum starvation. Cells were fixed in PBS containing 3.7% formaldehyde and 
permeabilized with 0.1% Triton-XlOO as described under immunofluorescence, and apoptosis was 
scored by in situ TUNEL (terminal deoxynucleotidyl transferase-mediated dUTP nick end labeling) 
and/or DAPI (4,6-Diamidino-2-phenylindole) staining of apoptotic nuclei exhibiting nuclear 

20 condensation. The TUNEL reaction was performed for 1 hr at 37°C using the in situ cell death 
detection kit, TMR red (Roche Diagnostics, Meylan, France). DAPI staining with a final 
concentration of 0.2 :g/ml was performed for 10 min at room temperature. At least 100 cells were 
scored for each experimental point using a fluorescence microscope. 

Basal levels of apoptosis in the presence of serum ranged from 1-3 %. Twenty four hours 

25 after serum withdrawal, apoptosis was found in 18% of untransfected 3T3 cells and in 3T3 cells 
overexpressing GFP- APSK-1. Levels of serum withdrawal induced apoptosis were significantly 
increased to about 70% and 65% in cells overexpressing GFP-PAR4 and GFP-THAP1, respectively 
(Figure 6A). These results demonstrate that THAP1, similarly to PAR4, is an apoptosis inducing 
polypeptide. 

30 TNFa-induced apoptosis assays were performed by incubating transiently transfected cells 

in complete medium containing 30 ng/ml of mTNFa (R & D, Minneapolis, MN, USA) for 24 h. 
Apoptosis was scored as described for serum withdrawal-induced apoptosis. The results are shown 
in Figure 6B. As shown in Figure 6B, THAP1 induced apoptosis. 

EXAMPLE 11 

35 The THAP domain is essential for THAP1 pro-apoptotic activity 

To determine the role of the amino-terminal THAP domain (amino acids 1 to 89) in the 
functional activity of THAP1, we generated a THAP1 mutant that is deleted of the THAP domain 
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(THAP1ATHAP). THAP1ATHAP (amino acids 90-213) was amplified by PCR, using pEGFP.C2- 
THAP1 as template, with primers 2HMR12 (5'- 

GCGGAATTCAAAGAAGATCTTCTGGAGCCACAGGAAC-3') (SEQ ID NO: 212) and 2HMR9 
(5'-CGCGGATCCTGCTGGTACTTCAACTATTTCAAAGTAGTC-3') (SEQ ID NO: 213), 
5 digested with EcoRI and BamHI, and cloned in pGBKT7 and P EGFP-C2 vectors, to generate 
PGBKT7-THAP1ATHAP and pEGFP.C2-TH AP 1 ATH AP expression vectors. The role of the 
THAP domain in PML NBs localization, binding to Par4, or pro-apoptotic activity of THAP1 was 
then analyzed. 

To analyze the subcellular localization of THAP 1 ATH AP, the GFP/ THAP 1 ATH AP 
10 expression construct was transfected into human primary endothelial cells from umbilical vein 
(HUVEC, PromoCell, Heidelberg, Germany). HUVEC were grown in complete ECGM medium 
(PromoCell, Heidelberg, Germany), plated on coverslips and transiently transfected in RPMI 
medium using GeneJammer transfection reagent according to manufacturer instructions 
(Stratagene, La Jolla, CA, USA). Transfected cells were allowed to grow for 48 h on coverslips. 
15 Cells were then washed twice with PBS, fixed for 15 min at room temperature in PBS containing 
3.7% formaldehyde, and washed again with PBS prior to neutralization with 50mM NH 4 C1 in PBS 
for 5 min at room temperature. Following one more PBS wash, cells were permeabilized 5 min at 
room temperature in PBS containing 0.1% Triton-XlOO, and washed again with PBS. 
Permeabilized cells were then blocked with PBS-BSA (PBS with 1% bovine serum albumin) for 
20 10' and then incubated 2 hr at room temperature with mouse monoclonal antibody anti- 
PML (mouse IgGl, 1/30, mAb PG-M3 from Dako, Glostrup, Denmark) diluted in PBS-BSA. Cells 
were then washed three times 5 min at room temperature in PBS-BSA, and incubated for 1 hr with 
Cy3 (red fluorescence)-conjugated goat anti-mouse IgG (1/1000, Amersham Pharmacia Biotech) 
secondary antibodies, diluted in PBS-BSA. After extensive washing in PBS, samples were air dried 
25 and mounted in Mowiol. Images were collected on a Leica confocal laser scanning microscope. 
The GFP (green) and Cy3 (red) fluorescence signals were recorded sequentially for identical image 
fields to avoid cross-talk between the channels. 

This analysis revealed that GFP- THAP1ATHAP staining exhibits a complete overlap with 
the staining pattern obtained with antibodies directed against PML, indicating the THAP domain is 
30 not required for THAP1 localization to PML NBs. 

To examine the role of the THAP domain in binding to Par4, we performed in vitro GST 
pull down assays. Par4DD, expressed as a GST-tagged fusion protein and immobilized on 
glutathione sepharose, was incubated with radiolabeled in vitro translated THAP1 ATHAP. In vitro- 
translated THAP 1 ATHAP was generated with the TNT-coupled reticulocyte lysate system 
35 (Promega, Madison, WL USA) using pGBKT7-THAP 1 ATHAP vector as template. 25 pi of 35 S- 
labelled THAP 1 )ATHAP was incubated with immobilized GST-Par4 or GST proteins overnight at 
4 °C, in the following binding buffer : 10 mM NaP04 pH 8.0, 140 mM NaCl, 3 mM MgC12, ImM 
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dithiothreitol (DTT), 0.05% NP40, and 0.2 mM phenylmethyl sulphonyl fluoride (PMSF), 1 mM 
Na Vanadate, 50mM P Glycerophosphate, 25 u.g/ml chimotrypsine, 5 ^xg/ml aprotinin, 10 jig/ml 
Leupeptin. Beads were then washed 5 times in 1 ml binding buffer. Bound proteins were eluted 
with 2X Laemmli SDS-PAGE sample buffer, fractionated by 10% SDS-PAGE and visualized by 
5 fluorography using Amplify (Amersham Pharmacia Biotech). 

This analysis revealed that THAP1ATHAP interacts with GST/Par4DD, indicating that the 
THAP domain is not involved in THAP1/Par4 interaction (Figure 7A). 

To examine the role of the THAP domain in THAP1 pro-apoptotic activity, we performed 
cell death assays in mouse 3T3 cells. Mouse 3T3-TO fibroblasts were seeded on coverslips in 12- 

10 well plates at 40 to 50% confluency and transiently transfected with GFP-APSK1, GFP-THAP1 or 
GFP-THAP1ATHAP fusion proteins expression vectors using Lipofectamine Plus reagent (Life 
Technologies) according to supplier's instructions. After 6h at 37°C, the DNA-lipid mixture was 
removed and the cells were allowed to recover in complete medium for 24 h. Serum starvation of 
transiently transfected cells was induced by changing the medium to 0% serum, and the amount of 

15 GFP-positive apoptotic cells was assessed 24 h after induction of serum starvation. Cells were 
fixed in PBS containing 3.7% formaldehyde and permeabilized with 0.1% Triton-XlOO as 
described under immunofluorescence, and apoptosis was scored by in situ TUNEL (terminal 
deoxynucleotidyl transferase-mediated dUTP nick end labeling) and/or DAPI (4,6-Diarmdino-2- 
phenylindole) staining of apoptotic nuclei exhibiting nuclear condensation. The TUNEL reaction 

20 was performed for 1 hr at 37°C using the in situ cell death detection kit, TMR red (Roche 
Diagnostics Meylan, France). DAPI staining with a final concentration of 0.2 jig/ml was performed 
for 10 min at room temperature. At least 100 cells were scored for each experimental point using a 
fluorescence microscope. 

Twenty four hours after serum withdrawal, apoptosis was found in 18% of untransfected 

25 3T3 cells and in 3T3 cells overexpressing GFP-APSK-1. Levels of serum withdrawal induced 
apoptosis were significantly increased to about 70% in cells overexpressing GFP-THAPL Deletion 
of the THAP domain abrogated most of this effect since serum-withdrawal-induced apoptosis was 
reduced to 28 % in cells overexpressing GFP-THAP 1 ATHAP (Figure 7B). These results indicate 
that the THAP domain, although not required for THAP1 PML-NBs localization and Par4 binding, 

30 is essential for THAP1 pro-apoptotic activity. 

EXAMPLE 12 

The THAP domain defines a novel family of proteins, the THAP family 
To discover novel human proteins homologous to THAP1 and/or containing THAP 
domains, GenBank non-redundant, human EST and draft human genome databases at the National 
35 Center for Biotechnology Information (www.ncbi.nlm.nih.gov) were searched with both the 
nucleotide and amino acid sequences of THAP 1, using the programs BLASTN, TBLASTN and 
BLASTP (Altschul, S. F., Gish, W., Miller, W., Myers, E. W.and Lipman, D. J. (1990). Basic local 
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alignment search tool. J Mol Biol 215: 403-410). This initial step enabled us to identify 12, distinct 
human THAP-containing, proteins (hTHAPO to hTHAPll; Figure 8). In the case of the partial 
length sequences, assembly of overlapping ESTs together with GENESCAN (Burge, C.and Karlin, 
S. (1997). Prediction of complete gene structures in human genomic DNA. J Mol Biol 268: 78-94) 
and GENEWISE (Jareborg, N., Bimey, E.and Durbin, R. (1999). Comparative analysis of 
noncoding regions of 77 orthologous mouse and human gene pairs. Genome Res 9: 815-824) gene 
predictions on the corresponding genomic DNA clones, was used to define the full length human 
THAP proteins as well as their ^corresponding cDNAs and genes. CLUSTALW (H.ggins, D. G., 
Thompson, J. D. and Gibson, T. J. (1996). Using CLUSTAL for multiple sequence alignments. 
Methods Enzymol 266: 383-402) was used to carry out the alignment of the 12 human THAP 
domains with the DNA binding domain of Drosophila P-element transposase (Lee, C. C., Beall, E. 
L., and Rio, D. C. (1998) Embo J. 17:4166-74), which was colored using the computer program 
Boxshade (www.ch.embnet.org/software/BOX_form.html) (see F,gures 9A and 9B). Equivalent 
approach to the one described above was used in order to identify the mouse, rat, pig, and various 
other orthologs of the human THAP proteins (Figure 9C). Altogether, the in silico and 
experimental approaches led to the discovery of 12 distinct human members (hTHAPO to 
hTH API 1) of the THAP family of pro-apoptotic factors (Figure 8). 

EXAMPLE 13 
THAP2 and THAP3 interact with Par-4 
To assess whether THAP2 and THAP3 are able to interact with Par-4, yeast two hybrid 
assays using Par-4 wild type bait (Figure 10B) and in vitro GST pull down assays (Figure IOC), 
were performed as described above (Examples 4 and 5). As shown in Figures 10B and IOC, 
THAP2 and THAP3 are able to interact with Par-4. A sequence alignment showing the comparison 
of the THAP" domain and the PAR4-binding domain between THAP1, THAP2 and THAP3 is 
25 shown in Figure 10A. 

EXAMPLE 14 
THAP2 and THAP3 are able to induce apoptosis 
Serum-induced or TNFa apoptosis analyses were performed as described above (Example 
10) in cells transfected with GFP-APSK1, GFP-THAP2 or GFP-THAP3 expression vectors. 
30 Apoptosis was quantified by DAPI staining of apoptotic nuclei 24 hours after serum withdrawal or 
addition of TNFa. The results are shown in Figure 11 A (serum withdrawal) and Figure 11B 
(TNFa). These results indicate that, THAP-2 and THAP3 induce apoptosis. 

EXAMPLE 15 

Identification of the SLC/CCL21 chemokine-bindine do main of human THAP1 
35 To identify the SLC/CCL21 chemokine-binding domain of human THAP1, a series of 

THAP1 deletion constructs was generated as described in Example 7. 



20 
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Two-hybrid interaction between THAP1 mutants and chemokine SLC/CCL21 was tested 
by cotransformation of AH 109 with pGADT7-THAPl-Cl, -C2, -C3, -Nl, -N2 or -N3 and 
pGBKT7-SLC/CCL21 and selection of transformants by His and Ade double auxotrophy according 
to manufacturer's instructions (MATCHMAKER two-hybrid system 3, Clontech). pG^KT7- 
SLC/CCL21 vector was generated by subcloning the BamHI SLC/CCL2 1 fragment from pGBT9- 
SLC (see example 1) into the unique BamHI cloning site of vector pGBKT7 (Clontech). Positive 
two-hybrid interaction with chemokine SLC/CCL21 was observed with mutants THAP1-C1, -C2, - 
C3, but not with mutants THAP1-N1, -N2 and -N3, suggesting that the SLC/CCL2 1 chemokine- 
binding domain of human THAP1 is found between THAP1 residues 143 and 213 (Figure 12). 

EXAMPLE 16 
In vitro THAP1 /chemokine SLC-CCL21 interaction assay 

To confirm the interaction observed in yeast two-hybrid system, we performed in vitro GST 
pull down assays. THAP1, expressed as a GST-tagged fusion protein and immobilized on 
glutathione sepharose, was incubated with radiolabeled in vitro translated SLC/CCL21. 

To generate the GST-THAP1 expression vector, the full-length coding region of THAP1 
(amino acids 1-213) was amplified by PCR from HEVEC cDNA with primers 2HMR8 (5'- 
CGCGGATCCGTGCAGTCCTGCTCCGCCTACGGC-3') (SEQ ID NO: 214) and 2HMR1 1 {5 % - 
CCGAATT(^TATGCTGGTACTTCAACTATTTCAAAGTAG-3 , ) (SEQ ID NO: 215), digested 
with BamHI and EcoRI, and cloned in frame downstream of the Glutathion S-Transferase ORF, 
between the BamHI and EcoRI sites of the pGEX-2T prokaryotic expression vector (Amersham 
Pharmacia Biotech, Saclay, France). GST-THAP1 fusion protein encoded by plasmid pGEX-2T- 
THAP1 and control GST protein encoded by plasmid pGEX-2T, were then expressed in E.Coli 
DH5cc and purified by affinity chromatography with glutathione sepharose according to supplier's 
instructions (Amersham Pharmacia Biotech). The yield of proteins used in GST pull-down assays 
was determined by SDS-Polyarylamide Gel Electrophoresis (PAGE) and Coomassie blue staining 
analysis. 

In vi7ro-translated SLC/CCL21 was generated with the TNT-coupled reticulocyte lysate 
system (Promega, Madison, WI, USA) using as template pGBKT7-SLC/CCL21 vector (see 
Example 15). 25 ul of 35 S-Iabelled wild-type SLC/CCL21 was incubated with immobilized GST- 
THAP1 or GST proteins overnight at 4 °C, in the following binding buffer : 10 mM NaP04 pH 8.0, 
140 mM NaCl, 3 mM MgC12, ImM dithiothreitol (DTT), 0.05% NP40, and 0.2 mM phenylmethyl 
sulphonyl fluoride (PMSF), 1 mM Na Vanadate, 50mM p Glycerophosphate, 25 ^lg/ml 
chimotrypsine, 5 jag/ml aprotinin, 10 pg/ml Leupeptin. Beads were then washed 5 times in 1 ml 
binding buffer. Bound proteins were eluted with 2X Laemmli SDS-PAGE sample buffer, 
fractionated by 10% SDS-PAGE and visualized by fluorography using Amplify (Amersham 
Pharmacia Biotech). As expected, GST/THAP1 interacted with SLC/CCL21 (Figure 13). In 
contrast, SLC/CCL21 failed to interact with GST beads. 
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EXAMPLE 17 

identification of the THAP1 -bindin g domain of human chemokine SLC/CCL21 
To determine the THAPl-bmding site on human chemokine SLC/CCL21, a SLC/CCL21 deletion 
mutant (SLC/CCL2 1 ACOOH) lacking the SLC-specific basic carboxy-terminal extension (amino 
5 acids 102-134; GenBank Accession Number NPJ)02980) was generated. This 
SLC/CCL2 1 ACOOH mutant, which retains the CCR7 chemokine receptor binding domain of 
SLC/CCL21 (amino acids 24-101), was used both in yeast two-hybrid assays with THAP1 bait and 
in in vitro GST-pull down assays with GST-THAP1. 

For two-hybrid assays, yeast cells were cotransformed with BD7-THAP1 and AD7- 
10 SLC/CCL21 or AD7-SLC/CCL2 1 ACOOH expression vectors. AD7-SLC/CCL2 1 or AD7- 
SLC/CCL21 ACOOH expression vectors were generated by subcloning BamHI fragment (encoding 
SLC amino acids 24-134) or BamHI-PstI fragment (encoding SLC amino acids 24-102) from 
pGKT7-SLC/CCL21 (see example 15) into pGADT7 expression vector (Clontech). Transformants 
were selected on media lacking histidine and adenine. Figure 13 shows that both the SLC/CCL21 
15 wild type and the SLC/CCL2 1 ACOOH deletion mutants could bind to THAP1. Identical results 
were obtained by ^transformation of AD7-THAP1 with BD7-SLC/CCL2 1 or BD7- 
SLC/CCL21 ACOOH. 

GST pull down assays, using in ^-translated SLC/CCL21 ACOOH, generated with the 
TNT-coupled reticulocyte lysate system (Promega, Madison, WI, USA) using as template 
20 pGBKT7-SLC/CCL21 ACOOH, were performed as described in Example 16. Figure 13 shows that 
both the SLC/CCL2 1 wild type and the SLC/CCL2 1 ACOOH deletion mutants could bind to 
THAP1. 

EXAMPLE 18 
Pre paration of THAPl/Fc Fusion Proteins 
25 This example describes preparation of a fusion protein comprising THAP1 or the 

SLC/CCL21 chemokine-binding domain of THAP1 fused to an Fc region polypeptide derived from 
an antibody. An expression vector encoding the THAPl/Fc fusion protein is constructed as follows. 

Briefly, the full length coding region of human THAP1 (SEQ ID NO: 3; amino acids -1 to 
213) or the SLC/CCL21 chemokine-binding domain of human THAP1 (SEQ ID NO: 3; amino 
30 acids -143 to 213) is amplified by PCR. The oligonucleotides employed as 5' primers in the PCR 
contain an additional sequence that adds a Not I restriction site upstream. The 3' primer includes an 
additional sequence that encodes the first two amino acids of an Fc polypeptide, and a sequence that 
adds a Bgl II restriction site downstream of the THAP1 and Fc sequences. 
A recombinant vector containing the human THAP1 cDNA is employed as the template in 
35 the PCR, which is conducted according to conventional procedures. The amplified DNA is then 
digested with Not I and Bgl II, and the desired fragments are purified by electrophoresis on an 
agarose gel. 
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A DNA fragment encoding the Fc region of a human IgGl antibody is isolated by digesting 
a vector containing cloned Fc-encoding DNA with Bgl II and Not I. Bgl II cleaves at a unique Bgl 
II site introduced near the 5' end of the Fc-encoding sequence, such that the Bgl II site encompasses 
the codons for amino acids three and four of the Fc polypeptide. Not I cleaves downstream of the 
5 Fc-encoding sequence. The nucleotide sequence of cDNA encoding the Fc polypeptide, along with 
the encoded amino acid sequence, can be found in International Publication No: WO93/10151. 

In a three-way ligation, the above -described THAP1 (or SLC/CCL21 chemokine-binding 
domain of THAP1) -encoding DNA and Fc-encoding DNA are inserted into an expression vector 
that has been digested with Not I and treated with a phosphatase to minimize recircularization of 
10 any vector DNA without an insert. An example of a vector which can be used is pDC406 
(described in McMahan et aL, EMBO J. 10:2821, 1991), which is a mammalian expression vector 
that is also capable of replication in E. coli. 

E. coli cells are then transfected with the ligation mixture, and the desired recombinant 
vectors are isolated. The vectors encode amino acids-1 to 213 of the THAP1 sequence (SEQ ID 
15 NO: 3) or amino acids-143 to 213 of the THAP1 sequence of (SEQ ID NO: 3), fused to the N- 
terminus of the Fc polypeptide. The encoded Fc polypeptide extends from the N-terminal hinge 
region to the native C-terminus, i.e., is an essentially full-length antibody Fc region. 

CV-l/EBNA-1 cells are then transfected with the desired recombinant isolated from E. coli. 
CV-l/EBNA-1 cells (ATCC CRL 10478) can be transfected with the recombinant vectors by 
20 conventional procedures. The CVI-EBNA-1 cell line was derived from the African Green Monkey 
kidney cell line CV-1 (ATCC CCL 70), as described by McMahan et al. (1991). EMBO J. 10:2821. 
The transfected cells are cultured to allow transient expression of the THAPl/Fc or SLC/CCL21 
chemokine-binding domain of THAPl/Fc fusion proteins, which are secreted into the culture 
medium. The secreted proteins contain the mature form of THAP1 or the SLC/CCL21 chemokine- 
25 binding domain of THAP1, fused to the Fc polypeptide. The THAPl/Fc and SLC/CCL21 
chemokine-binding domain of THAPl/Fc fusion proteins are believed to form dimers, wherein two 
such fusion proteins are joined by disulfide bonds that form between the Fc moieties thereof. The 
THAPl/Fc and SLC/CCL2 1 chemokine-binding domain of THAPl/Fc fusion proteins can be 
recovered from the culture medium by affinity chromatography on a Protein A-bearing 
30 chromatography column. 

EXAMPLE 19 
The THAP domain defines a family of nuclear factors 
To determine the subcellular localization of the different human THAP proteins, a series of 
GFP-THAP expression constructs were transfected into primary human endothelial cells. In 
35 agreement with the possible functions of THAP proteins as DNA-binding factors, we found that all 
the human THAP proteins analyzed (THAP0, 1, 2, 3, 6, 7, 8, 10, 11) localize preferentially to the 
cell nucleus (Figure 14). In addition to their diffuse nuclear localization, some of the THAP 
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proteins also exhibited assoc.ation with distinct subnuclear structures: the nucleolus for THAP2 and 
THAP3, and punctuate nuclear bod.es for THAP7, THAP8 and THAPU. Indirect 
immunofluorescence microscopy with anti-PML antibodies revealed that the THAP8 and THAP11 
nuclear bodies colocalize with PML-NBs. Although the THAP7 nuclear bodies often appeared in 
close association with the PML-NBs, they never colocalized. 

Analysis of the subcellular localization of the GFP-THAP fusion proteins was performed as 
described above (Example 3). The GFP-THAP constructs were generated as follows: the human 
THAP0 coding region was amplified by PCR from Hevec cDNA with primers THAPO-J (5'- 
GCCGAATTCATGCCGAACTTCTGCGCTGCCCCC-3 ') (SEQ ID NO: 216) and THAPO-2 (5'- 
CGCGGATCCTTAGGTTATTTTCCACAGTTTCGGAATTATC-3') (SEQ ID NO: 217), digested 
with EcoRJ and BamHl, and cloned in the same sites of the P EGFP-C2 vector, to generate 
P EGFPC2-THAP0 ; the coding region of human THAP2, 3, 7, 6 and 8 were amplified by PCR 
respectively from Image clone No: 3606376 with primers THAP2-1 (5'- 
GCGCTGC AGC AAGCT A AATTT AAATG AAGGTACTCTTGG -3 ) (SEQ ID NO: 218) and 
THAP2-2 (5'-GCGAGATCTGGGAAATGCCGACCAATTGCGCTGCG-3') (SEQ ID NO: 219) 
digested with Bglll and PstI, from Image clone No: 4813302 and No: 3633743 with primers 
THAP3-J (5 '-AGAGGATCCTTAGCTCTGCTGCTCTGGCCCAAGTC-3 ') (SEQ ID NO: 220) 
THAP3-2 (5'-AGAGAATTCATGCCGAAGTCGTGCGCGGCCCG-3') (SEQ ID NO: 221) and 
primers THAP7-1 (5 '-GCGG AATTCATGCCGCGTCACTGCTCCGCCGC-3 ') (SEQ ID NO: 222) 
THAP7-2 (5>-GCGGGATCCTCAGGCCATGCTGCTGCTCAGCTGC-3') (SEQ ID NO: 223), 
digested with EcoRI and BamHI, from Image clone No: 757753 with primers THAP6-1 (5'- 
GCGAGATCTCGATGGTGAAATGCTGCTCCGCCATTGGA-3') (SEQ ED NO: 224) and 
THAP6-2 (5 '-GCGGGATCCTCATG AAATATAGTCCTGTTCTATGCTCTC-3 ') (SEQ ID NO: 
225) digested" with Bglll and BamHI, and from Image clone No: 4819178 with primers THAP8-1 
25 (5--GCGAGATCTCGATGCCCAAGTACTGCAGGGCGCCG-3') (SEQ ID NO: 226) and 
THAP8-2 (5 '-GCGGAATTCTTATGCACTGGGGATCCGAGTGTCCAGG-3 ') (SEQ ID NO: 
227), digested with Bglll and EcoRI and cloned in frame downstream of the Enhanced Green 
Fluorescent Protein (EGFP) ORF in P EGFPC2 vector (Clontech) digested with the same enzymes 
to generate pEGFPC2-THAP2, -THAP3, -THAP7, -THAP6 and -THAP8 ; the human THAP10 and 
30 THAP11 coding region were amplified by PCR from Hela cDNA respectively with primers 
THAP10-1 (5 ' -GCGG AATTCATGCCGGCCCGTTGTGTGGCCGC-3 ') (SEQ ID NO: 228) 
THAP10-2 (5'-GCGGGATCCTTAACATGTTTCTTCTTTCACCTGTACAGC-3') (SEQ ID NO: 
229) digested with EcoRI and BamHI, and with primers THAPU-1 (5'- 
GCGAGATCTCGATGCCTGGCTTTACGTGCTGCGTGC-3') (SEQ ID NO. 230) and TH API 1-2 
35 (5'-GCGGAATTCTCACATTCCGTGCTTCTTGCGGATGAC-3') (SEQ ID NO: 231), digested 
with Bgin and EcoRI, cloned in the same sites of the P EGFP-C2 vector, to generate P EGFPC2- 
THAP10 and -THAP1 1 . 



20 
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EXAMPLE 20 
The THAP domain shares structural similarities with 
the DNA-binding domain of nuclear hormone receptors 
In an effort to model the three-dimensional structure of the THAP domain, we searched the 
5 PDB crystallographic database. As sequence homology detection is more sensitive and selective 
when aided by secondary structure information, structural homologs of the THAP domain of human 
THAP1 were searched using the SeqFold threading program (Olszewski et al. (1999) Theor. Chem. 
Acc. 101, 57-61) which combines sequence and secondary structure alignment. The 
crystallographic structure of the thyroid hormone receptor P DBD (PDB code: 2NLL) gave the best 

10 score of the search and we used the resulting structural alignment, displayed in Figure 15 A, to 
derive a homology-based model of the THAP domain from human THAP1 (Figure 15B). Note that 
the distribution of Cys residues in the THAP domain does not fully match that of the thyroid 
hormone receptor 0 DBD (Figure 15 A) and hence cannot allow the formation of the two 
characteristic 'C^type' Zn-fingers (red color-coding in Figure 15A). However, a network of 

1 5 stacking interactions between aromatic/hydrophobic residues or aliphatic parts of lysine side-chains 
ensures the stability of the structure of the THAP domain (cyan color-coding in Figures 15A and 
15B). Interestingly the same threading method applied independently to the Drosophila P-element 
transposase DBD identified the crystallographic structure of the glucocorticoid receptor DBD (PDB 
code: 1 GLU) as giving the best score. In the same way, we used the resulting structural alignment, 

20 displayed in Figure 15D, to build a model of the transposase DBD (Figure 15C). Note the presence 
of an hydrophobic core equivalent to that of the THAP domain (cyan color-coding in Figures 15C 
and 1 5D). All the DNA-binding domains of the nuclear receptors fold into a typical pattern which is 
mainly based on two interacting a-helices, the first one inserting into the target DNA major groove. 
Our threading and modeling results indicate that the THAP domain and the D. melanogaster P- 

25 element transposase DBD likely share a common topology which is similar to that of the DBD of 
nuclear receptors. 

Molecular modeling was performed using the InsightH, SeqFold, Homology and Discover 
modules from the Accelrys (San Diego, CA) molecular modeling software (version 98), run on a 
Silicon Graphics 02 workstation. Optimal secondary structure prediction of the query protein 

30 domains was ensured by the DSC method within SeqFold. The threading-derived secondary 
structure alignments was used as input for homology-modeling, which was performed according to 
a previously described protocol (Manival et al. (2001) Nucleic Acids Res 29 :2223-2233). The 
validity of the models was checked both by Ramachandran analysis and folding consistency 
verification as previously reported (Manival et al. (2001) Nucleic Acids Res 29 :2223-2233). 

35 EXAMPLE 21 

Homodimerization domain of human THAP1 
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To identify the sequences mediating homodimerization of THAP1, a series of THAP1 
deletion constructs was generated as described in Example 7. 

Two-hybrid interaction between THAP1 mutants and THAP1 wild type was tested by 
cotransformation of AH109 with pG ADT7-TH AP 1 -C 1 , -C2, -C3, -Nl, -N2 or -N3 and pGBKT7- 
5 THAP1 wild-type and selection of transformants by His and Ade double auxotrophy according to 
manufacturer's instructions (MATCHMAKER two-hybrid system 3, Clontech). Positive two-hybrid 
interaction with THAP1 wild type was observed with mutants THAP1-C1, -C2, -C3, -and -N3 but 
not with mutants THAP1-N1 and -N2, suggesting the THAP1 homodimerization domain is found 
between THAP1 residues 143 and 192 (Figure 16A). 

10 To confirm the results obtained in yeast, THAP1 mutants were also tested in in vitro GST 

pull down assays. Wild type THAP1 expressed as a GST-tagged fusion protein and immobilized on 
glutathione sepharose (as described in example 16), was incubated with radiolabeled in vitro 
translated THAP1 mutants. In vz/ro-translated THAP1 mutants were generated with the TNT- 
coupled reticulocyte lysate system (Promega, Madison, WI, USA) using pGADT7-THAPl -CI, - 

15 C2, -C3, -Nl, -N2 or -N3 vector as template. 25 ul of each 35 S-labe!led THAP1 mutant was 
incubated with immobilized GST or GST-THAP1 wild-type protein overnight at 4 °C, in the 
following binding buffer: 10 mM NaP04 pH 8.0, 140 mM Nad, 3 mM MgC12, ImM 
dithiothreitol (DTT), 0.05% NP40, and 0.2 mM phenylmethyl sulphonyl fluoride (PMSF), 1 mM 
Na Vanadate, 50mM P Glycerophosphate, 25 ^g/ml chimotrypsine, 5 jag/ml aprotinin, 10 ug/ml 

20 Leupeptin. Beads were then washed 5 times in 1 mi binding buffer. Bound proteins were eluted 
with 2X Laemmli SDS-PAGE sample buffer, fractionated by 10% SDS-PAGE and visualized by 
fluorography using Amplify (Amersham Pharmacia Biotech). As expected, THAP1-C1, -C2, -C3, - 
and -N3 interacted with GST/THAP1 (Figure 16B). In contrast, THAPl-Nl and -N2 failed to 
interact with GST/THAP1 beads. Therefore, essentially identical results were obtained with the two 

25 THAP1/THAP1 interactions assays: the THAP1 homodimerization domain of THAP1 is found 
between residues 143 and 192 of human THAP1. 

EXAMPLE 22 
Alternatively spliced isoform of human THAP1 
The two distinct THAP1 cDNAs, THAPla and THAPlb have been discovered (Figure 

30 17A). These splice variants, were amplified by PCR from HEVEC cDNA with primers 2HMR10 
(5 '-CCG AATTCAGG ATGGTGCAGTCCTGCTCCGCCT-3 ') (SEQ ID NO: 232) and 2HMR9 (5'- 
CGCGGATCCTGCTGGTACTTCAACTATTTCAAAGTAGTC-3*) (SEQ ID NO: 233), digested 
with EcoRI and BamHI, and cloned in frame upstream of the Enhanced Green Fluorescent Protein 
(EGFP) ORF in pEGFP.N3 vector (Clontech) to generate pEGFP.N3-THAP 1 a and pEGFP- 

35 THAPlb. DNA sequencing revealed that THAPlb cDNA isoform lacks exon 2 (nucleotides 273- 
468) of the human THAP1 gene (Figure 17B). This alternatively spliced isoform of human THAP1 
(~ 2 kb mRNA) was also observed in many other tissues by Northern blot analysis (see Figure 2). 
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The THAPla/GFP and THAPlb/GFP expression constructs were then transfected into COS 7 cells 
(ATCC) and expression of the fusion proteins was analyzed by western blotting with anti-GFP 
antibodies. The results are shown in Figure 17C which demonstrates that the second isoform of 
human THAP1 (THAPlb) encodes a truncated THAP1 protein (THAP1 C3) lacking a substantial 
5 portion of the amino terminus (amino acids 1-142 of SEQ ED NO: 3). 

EXAMPLE 23 

High throughput screening assay for modulators of THAP family 
Polypeptide pro-apoptotic activity 
A high throughput screening assay for molecules that abrogate or stimulate THAP-family 
10 polypeptide proapoptotic activity was developed, based on serum-withdrawal induced apoptosis in 
a 3T3 cell line with tetracycline-regulated expression of a THAP family polypeptide. 

In a preferred example, the THAP1 cDNA with an in-frame myc tag sequence, was 
amplified by PCR using pGBKT7-THAPl as a template with primers myc.BDl (5'- 
GCGCTCT AG AGCC ATC ATGG AGG AGC AG AAGCTG ATC-3 ' ) (SEQ ID NO: 234) and 
15 2HMR15 (5 '-GCGCTCTAGATTATGCTGGTACTTCAACTATTTCAAAGTAG-3 ') (SEQ ID 
NO: 235), and cloned downstream of a tetracycline regulated promoter in plasmid vector pTRE 
(Clontech, Palo Alto, CA), using Xba / restriction site, to generate plasmid pTRE-mycTH AP 1 . To 
establish 3T3-TO-mycTHAPl stable cell lines, mouse 3T3-TO Fibroblasts (Clontech) were seeded 
at 40 to 50% confluency and co-transfected with the pREP4 plasmid (Invitrogen), which contains a 
20 hygromycin B resistance gene, and the mycTHAPl expression vector (pTRE-mycTH AP 1 ) at 1:10 
ratio, using^Lipofectamine Plus reagent (Life Technologies) according to supplier's instructions. 
Transfected cells were selected in medium containing hygromycin B (250 U/ml; Calbiochem) and 
tetracycline (2 ug/ml; Sigma). Several resistant colonies were picked and analyzed for the 
expression of 'mycTHAPl by indirect immunofluorescence using anti-myc epitope monoclonal 
25 antibody (mouse IgGl, 1/200, Clontech). A stable 3T3-TO cell line expressing mycTHAPl (3T3- 
TO-mycTHAPl) was selected and grown in Dulbecco's Modified Eagle's Medium supplemented 
with 10% Fetal Calf Serum, 1% Penicillin-streptomycin (all from Life Technologies, Grand Island, 
NY, USA) and tetracycline (2 ug/ml; Sigma). Induction of THAP1 expression into this 3T3-TO- 
mycTHAPl cell line was obtained 48 h after removal of tetracycline in the complete medium. 
30 A drug screening assay using the 3T3-TO-mycTHAPl cell line can be carried out as 

follows. 3T3-TO-mycTHAPl cells are plated in 96- or 384-wells microplates and THAP1 
expression is induced by removal of tetracycline in the complete medium. 48 h later, the apoptotic 
response to serum withdrawal is assayed in the presence of a test compound, allowing the 
identification of test compounds that either enhance or inhibit the ability of THAP 1 polypeptide to 
35 induce apoptosis. Serum starvation of 3T3-TO-mycTHAPl cells is induced by changing the 
medium to 0% serum, and the amount of cells with apoptotic nuclei is assessed 24 h after induction 
of serum starvation by TUNEL labeling in 96- or 384-wells microplates. Cells are fixed in PBS 
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containing 3.7% formaldehyde and permeabilized with 0.1% Triton-XlOO, and apoptosis is scored 
by in situ TUNEL (terminal deoxynucleoridyl transferase-mediated dUTP nick end labeling) 
staining of apoptotic nuclei for 1 hr at 37°C using the in situ cell death detection kit, TMR red 
(Roche Diagnostics, Meylan, France). The intensity of TMR red fluorescence in each well is then 
5 quantified to identify test compounds that modify the fluorescence signal and thus either enhance or 
inhibit THAP1 pro-apoptotic activity. 

EXAMPLE 24 

High throup h put two-hybrid screenin g assay for drugs that modulate THAP-family 
pnl ypeptide/TH AP-familv target protei n interaction 
10 To identify drugs that modulate THAP1/Par4 or THAP1/SLC interactions, a two-hybrid 

based high throughput screening assay can be used. 

As described in Example 17, AH 109 yeast cells (Clontech) cotransformed with plasmids 
pGBKT7-THAP 1 and P GADT7-Par4 or pGADT7-SLC can be grown in 384-well plates in 
selective media lacking histidine and adenine, according to manufacturer's instructions 
1 5 (MATCHMAKER two-hybrid system 3 , Clontech). 

Growth of the transformants on media lacking histidine and adenine is absolutely 
dependent on the THAP1/Par4 or THAP1/SLC two-hybrid interaction and drugs that disrupt 
THAP1/Par4 or THAP1/SLC binding will therefore inhibit yeast cell growth. 

Small molecules (5 mg/ml in DMSO; Chembridge) are added by using plastic 384-pin 
20 arrays (Genetix). The plates are incubated for 4 to 5 days at 30 °C, and small molecules which 
inhibit upgrowth of yeast cells by disrupting THAP1/Par4 or THAP1/SLC two-hybrid interaction 
are selected for further analysis. 

EXAMPLE 25 
High throughput in vitro assay to identify inhibitors of 
25 THAP-familv nolypeptide/TH AP-familv prote in target interaction 

To identify small molecule modulators of THAP function, a high-throughput screen based 
on fluorescence polarization (FP) is used to monitor the displacement of a fluorescently labelled 
THAP1 protein from a recombinant glutathione-S -transferase (GST)-THAP binding domain of 
Par4 (Par4DD) fusion protein or a recombinant GST-SLC/CCL2 1 fusion protein. 
30 Assays are carried out essentially as in Degterev et al, Nature Cell Biol. 3: 173-182 (2001) 

and Dandliker et al, Methods EnzymoL 74: 3-28 (1981). The assay can be calibrated by titrating a 
THAP1 peptide labelled with Oregon Green with increasing amounts of GST-Par4DD or 
GST-SLC/CCL2 1 proteins. Binding of the peptide is accompanied by an increase in polarization 
(mP, millipolarization). 

35 THAP 1 and PAR4 polypeptides and GST-fusions can be produced as previously 

described. The THAP1 peptide was expressed and purified using a QIAexpressionist kit (Qiagen) 
according to the manufacturer's instructions. Briefly, the entire THAP1 coding sequence was 
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amplified by PCR using pGBKT7-TH AP 1 as a template with primers 2HMR8 (5'- 
CGCGGATCCGTGCAGTCCTGCTCCGCCTACGGC-3 > ) (SEQ ID NO: 236) and 2HMR9 (5'- 
CGCGGATCCTGCTGGTACTTCAACTATTTCAAAGTAGTC-3 , ) (SEQ ID NO: 237), and 
cloned into the BamWl site of pQE30 vector (Qiagen). The resulting pQE30-HisTHAPl plasmid 
5 was transformed in E.coli strain M15 (Qiagen). 6xHis-tagged-THAPl protein was purified from 
inclusion bodies on a Ni-Agarose column (Qiagen) under denaturing conditions, and the eluate was 
used for in vitro interaction assays. To produce GST-Par4DD fusion protein, Par4DD (amino acids 
250-342) was amplified by PCR with primers Par4.10 (5'- 
GCCGGATCCGGGTTCCCTAGATATAACAGGGATGCAA-3') (SEQ ID NO: 238) and Par4.5 

10 (5 , -GCGGGATCCCTCTACCTGGTCAGCTGACCCACAAC-3 , ) (SEQ ID NO: 239), and cloned 
in frame downstream of the Glutathione S-Transferase (GST) ORF, into the BamHI site of the 
pGEX-2T prokaryotic expression vector (Amersham Pharmacia Biotech, Saclay, France). 
Similarly, to produce GST-SLC/CCL21 fusion protein, the mature form of human SLC/CCL21 
(amino acids 24-134) was amplified by PCR with primers hSLCbam.5' (5'- 

15 GCGGGATCCAGTGATGGAGGGGCTCAGGACTGTTG-3') (SEQ ED NO: 240) and 
hSLCbam.3' (5 *-GCGGGATCCCTATGGCCCTTTAGGGGTCTGTG ACC-3 ') (SEQ ID NO: 
241), digested with BamHI and inserted into the BamHI cloning site of the pGEX-2T vector. GST- 
Par4DD (amino acids 250-342) and GST-SLC/CCL2 1 (amino acids 24-134) fusion proteins were 
expressed in E.Coli DH5a (supE44, DELTAlacU169 (801acZdeltaM15), hsdR17, recAl, endAl, 

20 gyrA96, thi 1, relA 1) and purified by affinity chromatography with glutathione sepharose according 
to supplier's instructions (Amersham Pharmacia Biotech). 

For screening small molecules, THAP1 peptide is labelled with succinimidyl Oregon Green 
(Molecular Probes, Eugene, Oregon) and purified by HPLC. 33 nM labelled THAP1 peptide, 2uM 
GST-Par4DD or GST-SLC/CCL2 1 protein, 0.1% bovine gamma-globullin (Sigma) and 1 mM 

25 dithiothreitol mixed with PBS, pH 7.2 (Gibco), are added to 384-well black plates (Lab Systems) 
with Multidrop (Lab Systems). Small molecules (5 mg/ml in DMSO; Chembridge) are transferred 
by using plastic 384-pin arrays (Genetix). The plates are incubated for 1-2 hours at 25 °C, and FP 
values are determined with an Analyst plate reader (LJL Biosystems). 

EXAMPLE 26 

30 High throughput chip assay to identify inhibitors of 

THAP-family polvpeptide/THAP-familv protein target interaction . 
A chip based binding assay Degterev et al, (2001) Nature Cell Biol. 3: 173-182 using 
unlabeled THAP and THAP-family target protein may be used to identify molecules capable of 
interfering with THAP-family and THAP-family target interactions, providing high sensitivity and 
35 avoiding potential interference from label moieties. In this example, the THAP1 binding domain of 
Par4 protein (Par4DD) or SLC/CCL21 is covalently attached to a surface-enhanced laser 
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desorption/ionization (SELDI) chip, and binding of unlabeled THAP1 protein to immobilized 
protein in the presence of a test compound is monitored by mass spectrometry. 

Recombinant THAP1 protein, GST-Par4DD and GST-SLC/CCL2 1 fusion proteins are 
prepared as described in Example 25. Purified recombinant GST-Par4DD or GST-SLC/CCL2 1 

5 protein is coupled through its primary amine to SELDI chip surfaces derivauzed with 
cabonyldi.midazole (Ciphergen). THAP1 protein is incubated in a total volume of 1 pi for 12 
hours at 4 °C in a humidified chamber to allow binding to each spot of the SELDI chip, then 
washed with alternating high-pH and low-pH buffers (0.1M sodium acetate containing 0.5M NaCl, 
followed by 0.01 M HEPES, P H 7.3). The samples are embedded in an alpha-cyano-4- 

10 hydroxycinnamic acid matrix and analyzed for mass by matrix-assisted laser desorption ionization 
time-of-flight (MALDI-TOF) mass spectrometry. Averages of 100 laser shots at a constant setting 
are collected over 20 spots in each sample. 

EXAMPLE 27 
Hi ph throughput cell assay to identi fy inhibitors of 
15 THAP-familv polvneptide/THAP-familY protein target interaction 

A fluorescence resonance energy transfer (FRET) assay is carried out between THAP-1 
and PAR4 or SLC/CCL2 1 proteins fused with fluorescent proteins. Assays can be carried out as in 
Majhan et al, Nature B.otechnology 16: 547-552 (1998) and Degterev et al, Nature Cell Biol. 3: 
173-182 (2001). 

20 THAP-1 protein is fused to cyan fluorescent protein (CFP) and PAR4 or SLC/CCL21 

protein is fused to yellow fluorescent protein (YFP). Vectors containing THAP-family and THAP- 
family target proteins can be constructed essentially as in Majhan et al (1998). A THAP-1-CFP 
expression vector is generated by subcloning a THAP-1 cDNA into the pECFP-Nl vector 
(Clontech). PAR4-YFP or SLC/CCL2 1 -CYP expression vectors are generated by subcloning a 

25 PAR4 or a SLC/CCL2 1 cDNA into the pEYFP-N 1 vector (Clontech). 

Vectors are cotransfected to HEK-293 cells and cells are treated with test compounds. 
HEK-293 cells are transfected with THAP-1-CFP and PAR4-YFP or SLC/CCL2 1 - YFP expression 
vectors using Lipofect AMINE Plus (Gibco) or TransLT-1 (PanVera). 24 hours later cells are 
treated with test compounds and incubated for various time periods, preferably up to 48 hours. 

30 Cells are harvested in PBS, optionally supplemented with test compound, and fluorescence is 
determined with a C-60 fluorimeter (PTI) or a Wallac plate reader. Fluorescence in the samples 
separately expressing THAP-1-CFP and PAR4-YFP or SLC/CCL2 1 -YFP is added together and 
used to estimate the FRET value in the absence of THAP-1 /PAR4 or THAP 1 /SLC/CCL2 1 binding. 
The extent of FRET between CFP and YFP is determined as the ratio between the 

35 fluorescence at 527nm and that at 475nm after excitation at 433nm. The cotransfection of THAP-1 
protein and PAR4 or SLC/CCL21 protein results in an increase of FRET ratio over a reference 
FRET ratio of 1.0 (determined using samples expressing the proteins separately). A change in the 
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FRET ratio upon treatment with a test compound (over that observed after cotransfection in the 
absence of a test compound) indicates a compound capable of modulating the interaction of the 
THAP-1 protein and the PAR4 or the SLC/CCL21 protein. 

EXAMPLE 28 

In vitro assay t o identify THAP-familv polypeptide DNA targets 
DNA binding specificity of THAP1 was determined using a random oligonucleotide 
selection method allowing unbiased analysis of binding sites selected by the THAP domain of the 
THAP1 protein from a random pool of possible sites. The method was carried out essentially as 
described in Bouvet (2001) Methods MolBiol. 148:603-10. Also, see Pollack and Treisman (1990) 
Nuc. Acid Res. 18:6197-6204; Blackwell and Weintraub, (1990) Science 250: 1 104-1 1 10; Ko and 
Engel, (1993) Mol. Cell. Biol. 13:4011-4022; Merika and Orkin, (1993) Mol. Cell. Biol. 13: 3999- 
4010; and Krueger and Morimoto, (1994) Mol. Cell. Biol. 14:7592-7603). 
Recombinant THAP domain expression and purification 

A cDNA fragment encoding the THAP domain of human THAP-1 (amino acids 1-90, SEQ 
ID NO: 3) was cloned by PCR using as a template pGADT7-THAP-l (see Example 4) with the 
following primers 5 '-GCGCATATGGTGCAGTCCTGCTCCGCCTACGGC-3 ' (SEQ ID NO: 242) 
and 5 '-GCGCTCGAGTTTCTTGTCATGTGGCTCAGTACAAAG-3 ' (SEQ ID NO: 243). The 
PCR product was cloned as a Ndel-Xhol fragment into pET-21c prokaryotic expression vector 
(Novagen) in frame with a sequence encoding a carboxy terminal His tag, to generate pET-21c- 
THAP. 

For the expression of THAP-His6, pET-21c-THAP was transformed into Escherichia coli 
strain BL-21 pLysS. Bacteria were grown at 37°C to an optical density at 600nm of 0.6 and 
expression of the protein was induced by adding isopropyl-p-D-thiogalactoside (Sigma) at a final 
concentration of lmM and incubation was continued for 4 hours. 

The cells were collected by centrifugation and resuspended in ice cold of buffer A (50 mM 
sodium-phosphate pH 7.5, 300mM NaCl, 0.1% P-mercaptoethanol, 10 mM Imidazole). Cells were 
lysed by sonication and the lysate was cleared by centrifugation at 12000g for 45 min. The 
supernatant was loaded onto a Ni-NTA agarose column (Quiagen) equilibrated in buffer A. After 
washing with buffer A and Buffer A with 40 mM Imidazole, the protein was eluted with buffer B 
(same as A with 0.05%p-mercaptoethanol and 250 mM Imidazole). 

Fractions containing THAP-His6 were pooled and applied to a Superdex 75 gel filtration 
column equilibrated in Buffer C (Tris-HCl 50mM pH 7.5, 150 mM NaCl, 1 mM DTT). Fractions 
containing the THAP-His6 protein were pooled, concentrated by witlm YM-3 Amicon filter devices 
and stored at 4°C or frozen at -80°C in buffer C containing 20% glycerol. The purity of the sample 
was assessed by SDS-Polyarylamide Gel Electrophoresis (PAGE) and Coomassie blue staining 
analysis. The structural integrity of the protein preparation was checked by ESI mass spectrometry 
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and Peptide mass mapping using a MALDI-TOF Mass spectrometer/The protein concentration was 

determined with Bradford Protein Assay. 
Random Oligonucleotide Selection 

According to the SELEX protocol described in Bouvet (2001) Methods Mol Biol. 148.603- 
5 10, a 62 bp oligonucleotide having sequences as follows was synthesized: 
5 ' -TGGGC ACTATTTATATCAAC-N25-AATGTCGTTGGTGGCCC-3 (SEQ ID NO: 244) 
where N is any nucleotide, and primers complementary to each end. Primer P is: 5'- 
ACCGCAAGCTTGGGCACTATTTATATCAAC-3 ' (SEQ ID NO: 245), and primer R ' s 5*- 
GGTCTAG AGGGCC ACCAACGCATT-3 ' (SEQ ID NO: 246). 

10 The 62-mer oligonucleotide is made double stranded by PCR using the P and R primers generating 
a 80 bp random pool. 

About 250 ng of THAP-His6 was incubated with Ni-NTA magnetic beads in NT2 buffer 
(20 mM Tris-HCl P H 7.5, 100 mM NaCl, 0.05% NP-40) for 30 min at 4°C on a roller. The beads 
were washed 2 times with 500 pi of NT2 buffer to remove unbound protein. The immobilized 

15 THAP-His6 was incubated with the random pool of double stranded 80 bp DNA (2 to 5pg) in 100 
pi of Binding buffer (20 mM Tris-HCl P H 7.5, 100 mM NaCl, 0.05% NP-40, 0.5 mM EDTA, 100 
pg/ml BSA, and 20 to 50 pg of poly(dl-dQ) for 10 minutes at room temperature. The beads were 
then washed 6 times with 500 pi of NT2 buffer. The protein/DNA complex were then subjected to 
extraction with phenol/chloroform and precipitation with ethanol using 10 pg of glycogen as a 

20 carrier. About one fifth of the recovered DNA was then amplified by 15 to 20 cycles of PCR and 
used for the next round of selection. After 8 rouds of selection, the NaCl concentration was 
progressively increased to 150mM. 

After 12 rounds of selection by THAP-His6, pools of amplified oligonucleotides were 
digested with Xba I and Hind in and cloned into pBluescript II KS - (Stratagene) and individual 

25 clones were sequenced using Big Dye terminator Kit (Applied Biosystem). 

The results of the sequence analysis show that the THAP domain of human THAP1 is a 
site-specific DNA binding domain. Two consensus sequences were deduced from the alignment of 
two sets of nucleotide sequences obtained from the above SELEX procedure (each set containing 9 
nucleic acid sequences). In particular, it was found that the THAP domain recognizes GGGCAA or 

30 TGGCAA DNA target sequences preferentially organized as direct repeats with 5 nucleotide 
spacing (DR-5 motifs). The consensus sequence being GGGCAAnnnnnTGGCAA (SEQ ID NO: 
149). Additionally, THAP recognizes everted repeats with 11 nucleotide spacing (ER-11 motifs) 
having a consensus sequence of TTGCCAnnnnnnnnniuiGGGCAA (SEQ ID NO: 159). Although 
GGGCAA and TGGCAA sequences constitute the preferential THAP domain DNA binding sites, 

35 GGGCAT, GGGCAG and TGGCAG sequences are also DNA target sequences recognized by the 
THAP domain. 

EXAMPLE 28B 
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The THAP domain is a zinc-dependent sequence specific DNA-binding domain 
To confirm that the THAP domain is a novel sequence specific DNA-binding domain, 
electrophoretic mobility shift assays (EMSA) were carried out using wild-type or mutant THAP 
domain responsive elements (THRE) determined by SELEX (see Example 28 and Figures 18 and 
5 24). Double-stranded probes used in EMSA experiments were purified on 12% polyacrylamide 
gels, 32 P end labeled with T4 polynucleotide kinase and quantified by Cerenkov counting. About 
20 ng of purified THAP domain from human THAP1 (prepared as described in Example 28) was 
incubated with 30000 cpm of the appropriate probe (about 2 ng). Binding reactions were carried 
out for 10 minutes at room temperature in 20 u,l binding buffer (20 mM Tris-HCl pH 7.5, 100 mM 

10 KC1, 0.1% NP-40, 100 ug/ml BSA, 2.5 mM DTT, 5% glycerol, 200 ng poly (dl-dC)). 
Electrophoresis was performed on 8% (29:1) polyacrylamide gels containing 5% glycerol. Gels 
were run in 0.25X TBE at 150V and 4°C, dried and exposed on a phosphoimager screen (Molecular 
Dynamics). Sequences of wild type and mutant THRE oligonucleotides used in EMSA 
experiments were as follow (mutations are indicated in 9 bold): wild-type probe 3, 5'- 

15 AGC A AGT AAGGGC AAA CTACTTCAT-3 ' (SEQ ID NO: 313); mutant probe 3mutl, 5'- 
AGC A AGTAATTTC A AA CTACTTC AT-3 ' (SEQ ID NO: 314); mutant probe 3mut3, 5'- 
AGC AAGTAAGGTCAAA CTACTTCAT-3 * (SEQ ID NO: 319); mutant probe 3mut4, 5'- 
AGC A AGTAAGTGCA AA CTACTTC AT-3 ' (SEQ ID NO: 320); mutant probe 3mutl4, 5'- 
AGC AAGTAAGGGCCAA CTACTTCAT-3 ' (SEQ ID NO: 321); mutant probe 3mut5, 5'- 

20 AGCA AGTAAGGGAAAA CTACTTCAT-3 ' (SEQ ID NO: 322). 

These EMSA assays revealed that the THAP domain recognizes wild -type (probe 3) but not 
mutant THRE oligonucleotides (probes 3mutl, 3mut3, 3mut4, 3mutl4, 3mut5) (Figure 25A). For 
competition experiments, 50-, 150-, and 250-fold molar excess of unlabelled wild-type (THRE 
competitor, probe 3) or mutant (non-specific competitor, probe 3mutl) THRE oligonucleotides 

25 were added to the reaction mixture just before the addition of the probe. This analysis revealed that 
the DNA-binding activity of the THAP domain is abrogated by increasing amounts of the THRE 
competitor but not affected by the non-specific competitor (Figure 25B). Together, these 
experiments demonstrated that the THAP domain is a novel sequence-specific DNA-binding 
domain. 

30 Since the THAP domain is characterized by a C2-CH conserved motif that may function as 

a Zn-binding site, we then determined whether DNA-binding activity of the THAP domain is Zn- 
dependent. For metal chelation experiments, EDTA (5 mM or 50 mM) or 1,10 phenanthroline 
(Sigma, ImM or 5mM in methanol vehicle) were preincubated with the THAP domain in binding 
buffer for 20 minutes at room temperature, before the EMSA assay (Figure 26A). To reconstitute 

35 DNA-binding activity of the THAP domain in the presence of 1,10 phenanthroline (+ Phe, 5mM), 
Zn or Mg, as indicated, were added at 100 or 500 u.M final concentration in binding buffer (Figure 
26B). Reactions were allowed to equilibrate for 10 minutes at room temperature before the 
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addition of the EMSA THRE probe (probe 3). These analyses revealed that the DNA-binding 
activity of the THAP domain is abrogated by the metal-chelator 1,10 phenanthroline (Figure 26A) 
but specifically restored by the addition of Zinc (Figure 26B), indicating that the THAP domain is a 
novel zinc-dependent sequence-specific DNA-binding domain. 

EXAMPLE 29 
High throughput in vitro assay to id entify inhibitors of 
THAP-familv polypeptide or THAP-fam ilv interactions with nonspecific DNA targets 
High throughput assays for the detection and quantification of THAP 1 -nonspecific DNA 
binding is carried out using a scintillation proximity assay. Materials are available from Amersham 
(Piscataway, NJ) and assays can be carried out according to Gal S. et al, 6 ,h Ann. Conf. Soc. 
Biomol. Screening, 6-9 Sept 2000, Vancouver, B.C.). 

Random double stranded DNA probes are prepared and labeled using [ 3 H]TTP and 
terminal transferase to a suitable specific activity (e.g. approx. 420i/mmol). THAP1 protein or a 
portion thereof is prepared and the quantity of THAP1 protein or a portion thereof is determined via 
15 ELISA. For assay development purposes, electrophoretic mobility shift assays (EMSA) can be 
carried out to select suitable assay parameters. For the high throughput assay, 3 H labeled DNA, 
anti-THAPl monoclonal antibody and THAP1 in binding buffer (Hepes, P H 7.5; EDTA; DTT; 
lOmM ammonium sulfate; KC1 and Tween-20) are combined. The assay is configured in a 
standard 96-well plate and incubated at room temperature for 5 to 30 minutes, followed by the 
20 addition of 0.5 to 2 mg of PVT protein A SPA beads in 50-1 00 pi binding buffer. The radioactivity 
bound to the SPA beads is measured using a TopCount™ Microplate Counter (Packard Biosciences, 
Meriden, CT). 

EXAMPLE 30 
High throughput in vitro assay to iden tify inhibitors of 
25 THAP-familv polypeptide or THAP-familv interaction s with specific DNA targets 

High throughput assays for the detection and quantification of THAP1 specific DNA 
binding is carried out using a scintillation proximity assay. Materials are available from Amersham 
(Piscataway, NJ) and assays can be carried out according to Gal S. et al, 6 th Ann. Conf. Soc. 
Biomol. Screening, 6-9 Sept 2000, Vancouver, B.C.). 
30 THAP 1 -specific double stranded DNA probes corresponding to THAP1 DNA binding 

sequences obtained according to Example 28 are prepared. The probes are labeled using [ 3 H]TTP 
and terminal transferase to a suitable specific activity (e.g. approx. 420i/mmol). THAP1 protein or 
a portion thereof is prepared and the quantity of THAP 1 protein or a portion thereof is determined 
via ELISA. For assay development purposes, electrophoretic mobility shift assays (EMSA) can be 
35 carried out to select suitable assay parameters. For the high throughput assay, 3 H labeled DNA, 
anti-THAPl monoclonal antibody, lpg non-specific DNA (double or single stranded poly-dAdT) 
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and THAP1 protein or a portion thereof in binding buffer (Hepes, pH7.5; EDTA; DTT; lOmM 
ammonium sulfate; KC1 and Tween-20) are combined. The assay is configured in a standard 96- 
well plate and incubated at room temperature for 5 to 30 minutes, followed by the addition of 0.5 to 
2 mg of PVT protein A SPA beads in 50-100u.l binding buffer. The radioactivity bound to the SPA 
5 beads is measured using a TopCount™ Microplate Counter (Packard Biosciences, Meriden, CT). 

EXAMPLE 31 
Preparation of antibody compositions 
Substantially pure THAP1 protein or a portion thereof is obtained. The concentration of 
protein in the final preparation is adjusted, for example, by concentration on an Amicon filter 

10 device, to the level of a few micrograms per ml. Monoclonal or polyclonal antibodies to the protein 
can then be prepared as follows: Monoclonal Antibody Production by Hybridoma Fusion 
Monoclonal antibody to epitopes in the THAP1 protein or a portion thereof can be prepared from 
murine hybridomas according to the classical method of Kohler and Milstein (Nature, 256: 495, 
1975) or derivative methods thereof (see Harlow and Lane, Antibodies A Laboratory Manual, Cold 

15 Spring Harbor Laboratory, pp. 53-242, 1988). 

Briefly, a mouse is repetitively inoculated with a few micrograms of the THAP 1 protein or 
a portion thereof over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol 
with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on 

20 selective media comprising aminopterin (HAT media). The successfully fused cells are diluted and 
aliquots of Jhe dilution placed in wells of a microliter plate where growth of the culture is 
continued. Antibody-producing clones are identified by detection of antibody in the supernatant 
fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, 
E., Meth. Enzymol. 70: 419 (1980). Selected positive clones can be expanded and their monoclonal 

25 antibody product harvested for use. Detailed procedures for monoclonal antibody production are 
described in Davis, L. et al. Basic Methods in Molecular Biology, Elsevier, New York., 
Section 21-2. 

Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes in the THAP1 protein 
30 or a portion thereof can be prepared by immunizing suitable non-human animal with the THAP1 
protein or a portion thereof, which can be unmodified or modified to enhance immunogenicity. A 
suitable nonhuman animal, preferably a non-human mammal, is selected. For example, the animal 
may be a mouse, rat, rabbit, goat, or horse. Alternatively, a crude protein preparation which, has 
been enriched for THAP1 or a portion thereof can be used to generate antibodies. Such proteins, 
35 fragments or preparations are introduced into the non-human mammal in the presence of an 
appropriate adjuvant (e. g. aluminum hydroxide, RIBI, etc.) which is known in the art. In addition 
the protein, fragment or preparation can be pretreated with an agent which will increase 
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antigenicity, such agents are known in the art and include, for example, methylated bovine serum 
albumin (mBSA), bovine serum albumin (BSA), Hepatitis B surface antigen, and keyhole limpet 
hemocyanin (KLH). Serum from the immunized animal is collected, treated and tested according to 
known procedures. If the serum contains polyclonal antibodies to undesired epitopes, the 
polyclonal antibodies can be purified by immunoaffinity chromatography. 

Effective polyclonal antibody production is affected by many factors related both to the 
antigen and the host species. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 
level) of antigen admin.stered at multiple intradermal sites appears to be most reliable. Techniques 
for producing and processing polyclonal antisera are known in the art, see for example, Mayer and 
Walker (1987). An effective immunization protocol for rabbits can be found in Vaitukaitis, J. et al. 
J. Clin. Endocrinol. Metab. 33: 988-991 (1971). Booster injections can be given at regular 
intervals, and antiserum harvested when antibody titer thereof, as determined semi-quantitatively, 
for example, by double immunodiffusion in agar against known concentrations of the antigen, 
begins to fall. See, for example, Ouchterlony, O. et al., Chap. 19 in: Handbook of Experimental 
Immunology D. Wier (ed) Blackwell (1973). Plateau concentration of antibody is usually in the 
range of 0.1 to 0.2 mg/ml of serum (about 12: M). Affinity of the antisera for the antigen is 
determined by preparing competitive binding curves, as described, for example, by Fisher, D., 
Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For 
Microbiol., Washington, D. C. (1980). 

Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; or they are also used semi-quantitatively or qualitatively 
to identify the presence of antigen in a biological sample. The antibodies may also be used in 
therapeutic compositions for killing cells expressing the protein or reducing the levels of the protein 
in the body. 

EXAMPLE 32 
Two Hybrid THAPl/Chemokine Interaction Assay 
Two-hybrid interaction between THAP1 and chemokines CCL21, CCL19, CXCL9, 
CXCL10, CXCL1 1 or cytokine IFNy was tested by cotransformation of AH109 with pGADT7- 
THAP1 and pGBKT7-CCL2 1 , -CCL19, -CXCL9, -CXCL10, -CXCL11 and -IFNy plasmids and 
selection of transformants by His and Ade double auxotrophy according to manufacturer's 
instructions (MATCHMAKER two-hybrid system 3, Clontech). pGBKT7-chemokine vectors were 
generated using cDNAs encoding the mature forms of human chemokines CCL21 (see Example 15) 
(SLC polypeptide SEQ ID NO: 271, SLC cDNA SEQ ID NO: 272); CCL19 (amino acids 22-98 of 
GenBank Accession No. NM_006274) (CCL19 polypeptide SEQ ID NO: 273, CCL19 cDNA SEQ 
ID NO: 274); CXCL9 (amino acids 23-125 of GenBank Accession No. NM_002416) (CXCL9 
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polypept.de SEQ ID NO: 275, CXCL9 cDNA SEQ ID NO: 276); CXCL10 (amino adds 22-98 of 

GenBank Accession No. NM_O01565) (CXCL10 polypept.de SEQ ID NO: 277, CXCL10 cDNA 
SEQ ID NO: 278); CXCL11 (amino acids 22-94 of GenBank Accession No. NM 005409) 
(CXCL11 polypeptide SEQ ID NO: 323, CXCL11 cDNA SEQ ID NO: 324) or cytokine 
5 IFNy (amino acids 21-166 of GenBank Accession No. NM_000619) (IFNy polypeptide SEQ ID 
NO: 279, IFNy cDNA SEQ ID NO: 280), amplified by PGR, respectively, from Image clones No 
1707527 (HCCL19) with pnmers CCL19-, (5 >_ 

GCGGAATCATGGGCACCAATGATGCTGAAGACTG-3') (SEQ ID NO: 281) and CCL19-2 
(5 '-GCGGG ATCCTTAACTGCTGCGGCGCTTCATCTTG-3 ') (SEQ JD NO: 282), No. 5228247 
10 (hCXCL9) with primers CXCL9-1 (5 -GCCG AATTCACCCC AGTAGTG AG AAAGG GTCGCTG- 
30 (SEQ m NO- 283) and CXCL9-2 ( 5 -. 

CGCGGATCCTTATGTAGTCTTCTTTTGACGAGAACGTTG-3') (SEQ ID NO: 284), No. 
4274617 (hCXCLlO) with primers CXCL10-1 ( 5 ». 

GCCGAATTCGTACCTCTCTCTAGAACCGTACGCTGT-3') (SEQ ID NO: 285) and CXCL10-2 
15 (5 -GCGGG ATCCTTAAGG AGATCrTTTAGACATTTCCTTGCTA-3 ') (SEQ ID NO: 286), No 
3934139 (hCXCLll) with primers CXCL11-1 ' (5 - 

GGGGAATTCTTCCCCATGTTCAAAAG AGG AC-3 ') (SEQ ID NO: 325) and CXCL11-2 (5'- 
GGGGATCCTTAAAAATTCTTTCTTTCAAC-3 ') (SEQ ID NO: 326), No. 2403734 (hlFNy) with 
primers IFN-1 (5 '-GCGGAATCATGTGTTACTGCCAGGACCCATATG-3 ') (SEQ ID NO- 287) 
20 and IFN-2 (5 ' -GCGGGATCCTTACTGGGATGCTCTTCGACCTTG-3 ') (SEQ ID NO: 288). The 
PCR products were digested with EcoRI and BamHI, and cloned between EcoRI and BamHI 
cloning sites of vector pGBKTT (Clontech). Positive two-hybrid mteraction of THAP1 was 
observed with chemokines CCL21, CCL19, CXCL9 and CXCL11 while chemokine CXCL10 gave 
an mtermediate result (+/-) in this two-hybrid assay (see F.gure 19). The negative cytokine control, 
25 IFNy, did not have a positive interaction. 

It will be appreciated that the above-described methods can be used to determine whether 
any particular chemokine binds to any THAP-family polypeptide. For example, cDNAs encoding 
THAP-fam,.y members THAP1 to THAP1 1 as we,l as THAP0 from humans and other species can 
be Coned mto a first component vector of a two hybrid system. cDNAs encoding chemokines can 
be cloned mto a second component vector of a two hybrid system. The two vectors can be 
transformed mto an appropriate yeast strain, where.n the polypeptides are expressed and tested for 
mteract.on. For example, chemokine CCL5 (polypeptide SEQ ID NO: 289, cDNA SEQ ID NO- 
290) can be tested for interaction with full-length THAP-1 or particular portions of THAP1 such as 
a nested deletion series. Chemokines wh.ch can be tested for interaction with THAP-family 
5 P rote.ns mclude, but are not limited to, XCL1, XCL2, CCL1, CCL2, CCL3, CCL3L1 SCYA3L2 
CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, SCYA10, CCL11, SCYA12, CCL1 3 ' 
CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, CCL21, CCL22, CCL23 CCL24' 
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CCL25 CCL26, CCL27, CCL28, clone 391, CARP CC-1, CCL1, CK-1, regalane-1, K203, 
CXCLl, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, CXCL5, CXCL6, PPBP, SPBPBP, IL8, 
CXCL9, CXCL10, CXCLl 1, CXCLl 2, CXCL14, CXCLl 5, CXCLl 6, NAP-4, LFCA-1, Scyba, 
JSC, VHSV -induced protein, CX3CL1 and fCLl. 

EXAMPLE 33 
In Vitro THAP1 /Chemokine Interactio n Assay 
To confirm the interaction observed in yeast two-hybrid system, we performed in vitro GST 
pull down assays. THAP1, expressed as a GST-tagged fus,on protein and immobilized on 
glutath.one sepharose, was incubated with radiolabeled chemokmes that were translated in vitro. 

To generate the GST-THAP1 expression vector, the fiill-length coding region of THAP1 (a 
nucleic acid encoding amino acids 1-213 of THAP1) was amplified by PCR from HEVEC cDNA 
with primers 2HMR8 ( 5'-CGCGGATCCGTGCAGTCCTGCTCCGCCTACGGC-3') (SEQ ED NO: 
291 and 2HMR1 / (5' -CCG AATTCTTATGCTGGTACTTCAACTATTTCAAAGTAG-3 ') (SEQ 
ED NO: 292), digested with BamHl and EcoRI, and cloned in frame downstream of the Glutathione 
S-Transferase ORE, between the BamHI and EcoRI sites of the P GEX-2T prokaryotic expression 
vector (Amersham Pharmacia B,otech, Saclay, France). The GST-THAP1 fusion protein encoded 
by plasmid P GEX-2T-THAP1 and the control GST protein encoded by plasmid pGEX-2T, were 
then expressed in E. Coli DH5a and purified by affinity chromatography with glutathione 
sepharose according to supplier's instructions (Amersham Pharmacia Biotech). The yield of 
20 proteins used in GST pull-down assays was determined by SDS-Polyacrylamide Gel 
Electrophoresis (PAGE) and Coomassie blue staining analysis. 

In W/ro-translated chemokines were generated with the TNT-coupled reticulocyte lysate 
system (Promega, Madison, WI, USA) using as templates P GBKT7-CCL2 1 , -CCL19, -CXCL9, - 
CXCL10 and -CXCLl 1 chemokine vectors (see Example 32) or pCMV-SPORT6-CCL5 plasmid 
25 (Image clone No. 4185200). In v/7/-o-translated IFNy cytokine was generated similarly using as 
template plasmid P GBKT7-rFN Y . A 25 ul volume of ^-labelled chemokine was incubated with 
immobilized GST-THAP1 or GST proteins overnight at 4 °C, in the following binding buffer: 10 
mM NaP04 pH 8.0, 140 mM NaCl, 3 mM MgCl 2 , 1 mM dithiothreitol (DTT), 0.05% NP40, and 
0.2 mM phenylmethyl sulphonyl fluoride (PMSF), 1 mM Na vanadate, 50 mM P- 
30 glycerophosphate, 25 ug/ml chymotrypsine, 5 ug/ml aprotinin, and 10 ng/ml leupeptin. Beads were 
then washed 5 times in 1 ml binding buffer. Bound proteins were eluted with 2X Laemmli SDS- 
PAGE sample buffer, fractionated by 10% SDS-PAGE and visualized by fluorography using 
Amplify (Amersham Pharmacia Biotech). GST/THAP 1 specifically bound to chemokines CCL21, 
CCL19, CCL5, CXCL9, CXCL10 and CXCL11 but not cytokine IFNy (Figures 19 and 20). Figure 
35 19 shows that CCL21, CCL19, CCL5, CXCL9 and CXCLl 1 all strongly bound to immobilized 
GST-THAP1 (indicated by +++ in the column labelled "In vitro binding to GST-THAP1"). 
CXCLl 0 also bound to immobilized GST-THAP1 (indicated by ++ in the column labelled "In vitro 
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binding to GST-THAP1"). The cytokine IFNy did not bind to immobilized GST-THAP1 (indicated 
by - in the column labelled vitro binding to GST-THAP1"). Chemokines CCL21, CCL19, 
CCL5, CXCL9, CXCL10 and CXCL11 failed to interact with GST beads (negative control). 
Figure 20a shows that equivalent amounts of chemokine or cytokine were tested in the in vitro 
5 GST-THAP1 binding assays. Figure 20b shows that neither the cytokine, IFNy, nor any of the 
chemokines bound to immobilized GST alone. Figure 20c shows that chemokines, CXCL10, 
CXCL9 and CCL19, but not the cytokine IFNy, bound to immobilized GST-THAP1 fusions. 

It will be appreciated that the above-described methods can be used to determine whether 
any particular chemokine binds to any THAP-family polypeptide. For example, cDNAs encoding 

10 THAP-family members THAP1 to THAP1 1 as well as THAP0 from humans and other species can 
be cloned and expressed as a GST fusion protein and immobilized to a solid support. cDNAs 
encoding chemokines can be translated in vitro and the resulting proteins incubated with the 
immobilized GST-THAP family fusions. Furthermore, a nested deletion series and/or point 
mutants of the THAP-family polypeptides can also be generated as GST-fusions and tested to 

15 determine the exact location of the chemokine binding domain for any THAP-family polypeptide 
with respect to any chemokine. Chemokines which can be tested for binding to THAP-family 
proteins include, but are not limited to, XCL1, XCL2, CCL1, CCL2, CCL3, CCL3L1, SCYA3L2, 
CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, SCYA10, CCL11, SCYA12, CCL13, 
CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, CCL21, CCL22, CCL23, CCL24, 

20 CCL25, CCL26, CCL27, CCL28, clone 391, CARP CC-1, CCL1, CK-1, regakine-1, K203, 
CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, CXCL5, CXCL6, PPBP, SPBPBP, IL8, 
CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, CXCL15, CXCL16, NAP-4, LFCA-1, Scyba, 
JSC, VHSV-induced protein, CX3CL1 and fCLl. 

EXAMPLE 34 

25 Chemotaxis Bioassav: Inhibition of CCL21/CCL19-Induced 

Chemotaxis by THAP1 Oligomeric Forms 
To demonstrate inhibition of CCL21/CCL19-induced chemotaxis by THAP1 oligomers, 
fresh lymphocytes and a human cell line, each of which expresses the CCL21/CCL19 receptor 
CCR7, are assayed for a chemotactic response to chemokines in the presence or absence of 
30 oligomeric THAP1. Lymphocytes are purified from fresh heparinized human blood or mouse 
lymph nodes and grown in RPMI 1640 glutamax I (Invitrogen GIBCO). HuT78 cells are obtained 
from American Tissue Type Culture Collection (Accession Number TEB-161) and grown in 
Iscove*s modified Dulbecco's medium with 4 mM L-Glutamine adjusted to contain 1.5g/l sodium 
bicarbonate (Invitrogen GIBCO). Recombinant CCL21 and CCL19 human chemokines are 
35 obtained from commercial suppliers (for example, R&D or Chemicon). 

Chemokine CCL21 or CCL19 is diluted in the appropriate culture medium without serum 
at 20 ng/ml and 75 ]xl of this solution is transferred in appropriated wells of a 96-well microplate. 
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Recombinant human THAP1 oligomers (GST-THAP1 or Fc-THAPl chimera) are senally diluted 
starting at 500 nM and 25 ul of the different dilutions are transferred in appropriate wells. 
Transwells are set carefully on each well and 100 ul of a cell suspension at 8.10 6 cell/ml is added in 
the upper chamber. Following a 4-hour incubation at 37°C and 5% C0 2> the cells which have 
5 migrated to the lower chamber are quantified using the Celltiter Glo system (Promega). A staining 
of the filter is also performed to venfy that no cells adhered to the lower side of the filter after the 
migration. The degree of CCL21/CCL19 induced chemotaxis inhibition by THAP1 oligomeric 
forms is calculated by comparing the number of cells which have migrated in the presence or 

absence of the THAP1 oligomeric forms. 
1Q EXAMPLE 35 

inhibition of CCL21/CCL19-Indu r.firi Lymphocyte 
Adhesion to Endothelial Cells / » Ww. bv THAP1 Oligomeric Forms 
The capacity of THAP1 oligomeric forms to block the activity of CCL21/CCL19 in vivo, 
including CCL21/CCL19-induced lymphocyte adhesion to endothelial cells, is assessed by 
measuring the 'rolling/sticking phenotype' of lymphocytes in mouse lymph nodes HEVs (High 
endothelial venules) using intravital microscopy (microscopy on live animals) as described in von 
Andnan (1996) Microcirculation (3):287-300; and von Andrian UH, M'Rini C. (1998) Cell Adhes 
Commun. 6(2-3): 85-96). The rolling/sticking assay is performed as follows. In brief, the different 
steps of lymphocyte migration through HEVs (tethering, rolling, sticking, transendothelial 
20 migration) are analyzed by intravital microscopy in mice treated with recombinant human THAP1 
oligomers <GST-THAP1 or Fc-THAPl chimera). For observation of lymph nodes, HEVs vessels 
(and adhesion processes occurring in these vessels) by intravital microscopy, a microsurgical 
exposition of the subiliac (superf.cial inguinal) lymph node is made on an anaesthetized mouse. 
Briefly, BALB/c mice (Charles River, St Germain sur l'Arbresle, France) are anesthetized by 
25 intraperitoneal injection of 5 mg/mL ketamine and 1 mg/mL xylasine (10 mL/kg) and surgically 
prepared under a stereomicroscope (Leica Microsystems SA, Rueil-Malmaison, France) to allow 
exposure of the node vessels. A catheter is inserted in the contralateral femoral artery to permit 
subsequent retrograde injections of fluorescent cell suspensions or recombinant THAP1 oligomeric 
forms (GST-THAP1 or Fc-THAPl, 10-100 ug in 250 pi saline injected and allowed to bind for 5- 
30 30 min before injection of fluorescent cell suspensions). The mouse is then transferred to an 
intravital microscope (INM 100; Leica Microsystems SA). Body temperature is maintained at 37°C 
using a padding heater. Lymph node vessels and fluorescent cells are observed through 10 * or 20 * 
water immersion objective (Leica Microsystems SA) by transillumination or epifluorescence 
illumination. Transilluminated and fluorescent events are visualized using a silicon-intensified target 
35 camera (Hamamatsu Photonics, Massy, France) and recorded for later off-line analysis (DSR-11 
Sony, IEC-ASV, Toulouse). Lymphocyte behavior in lymph node vessels is analyzed off-line as 
previously described (von Andrian (1996) Microcirculation (3):287-300; and von Andrian UH, 
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M'Rini C. (1998) Cell Adhes Commun. 6(2-3):85-96). Briefly, the rolling fraction is determined in 
every visible lymph node HEV as the percentage of lymphocytes interacting with the endothelial 
lining over the total cell number entering the venule during an observation period. Rolling cells that 
become subsequently adherent are included in the rolling fraction. The sticking fraction is 
5 determined as percentage of rollers that becomes firmly adherent in HEVs for more than 
20 seconds. Only vessels with more than 10 rolling cells are included. The degree of inhibition of 
CCL21/CCL19-induced lymphocyte adhesion by THAP1 oligomers in vivo is calculated by 
comparing the number of lymphocytes sticking to endothelial cells (sticking fractions) in the 
presence or absence of the THAP1 oligomeric forms. 

10 EXAMPLE 36 

Use of THAP1 Oligomeric Forms to Antagonize 
Chemokines in a Mouse Model of Rheumatoid Arthritis 
This experiment is designed to test effect of antagonizing chemokines with THAP1 
oligomeric forms in a mouse model of rheumatoid arthritis, the well-known collagen-induced 

15 arthritis model. In each experiment, male DBA/1 mice are immunized with collagen on day 21 and 
are boosted on day 0. Mice are treated daily from days 0-14 with IP injections of THAP1 
oligomeric forms (GST-THAP1 or THAPl-Fc chimera) at 150, 50, 15, and 5 jig/day, and compared 
to mice treated with control proteins (GST or human IgGl), at 150 jag/day (n=15/group in each 
experiment). The incidence and severity of arthritis is monitored in a blind study. Each paw is 

20 assigned a score from 0 to 4 as follows: 0=normal; l=swelling in 1 to 3 digits; 2=mild swelling in 
ankles, forepaws, or more than 3 digits; 3=moderate swelling in multiple joints; 4=severe swelling 
with loss of function. Each paw is totaled for a cumulative score/mouse. The cumulative scores are 
then totaled for mice in each group for a mean clinical score. Groups of 15 mice are treated with 
the indicated doses of THAPl-Fc or with 150 ng/day of human IgGl. The capacity of THAP1 

25 oligomeric forms (GST-THAP1 or THAPl-Fc chimera) to reduce the disease incidence and 
severity of arthritis is determined by comparison with the control group. 

EXAMPLE 37 
Use of THAP1 Oligomeric Forms to Antagonize 
Chemokines in a Mouse of Inflammatory Bowel Disease 

30 The effect of blocking chemokines with THAPl-Fc chimera is analyzed in an 

experimentally induced model of Inflammatory Bowel Disease (IBD). One of the most widely used 
models of IBD is the DSS model (dextran sulphate salt). In this model, dextran sulphate salt (M.W. 
typically about 40,000 but molecular weights from 40,000 to 500,000 can be used) is given to mice 
(or other small mammals) in their drinking water at 2% to 8%. In some studies, C57BL/6 mice are 

35 given 2% DSS from day 0 to day 7 (DO - D7), wherein the number of mice per group equals four 
(n=4). The study groups are divided as follows: No DSS + human IgGl (250 jag/day/mouse DO - 
D7); 2% DSS + THAPl-Fc (100 ^g/day/mouse DO - D7); 2% DSS + THAPl-Fc (250 
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Hg/day/mouse DO - D7); 2% DSS + THAPl-Fc (500 u.g/day/mouse DO - D7); 2% DSS + human 
IgGl (250 |ig/day/mouse DO - D7). Mice are weighed daily. Weight loss is a clinical sign of the 
disease. Tissues are harvested at day 8 (D8). Histopathology is performed on the following tissues: 
small intestine, large intestine and mesenteric lymph nodes (MLN). The capacity of the THAPl-Fc 
5 chimera, to attenuate some of the weight loss associated with DSS-induced colitis, and to reduce 
inflammation in the large intestine is determined by comparing mice treated with THAPl-Fc to 
mice treated with control human IgG 1 . 

EXAMPLE 38 
THAP1 expression is linked to cell proliferation 

10 To investigate the subcellular localization of endogenous THAP1, human primary 

endothelial cells from umbilical vein (HUVEC, PromoCell, Heidelberg, Germany) were analyzed 
by indirect immunofluorescence with anti-THAPl specific antibodies. Anti-THAPl antibodies 
(anti-THAP antisera) were generated in rabbits against a peptide derived from the THAP domain of 
human THAP1, AVRRKNFKPTKYSSIC (amino acids 39-54 in SEQ ID: 3), and affinity-purified 

1 5 against the corresponding peptide (Custom polyclonal antibody production, Eurogentec). 

Endothelial cells were allowed to grow for 24 h to 48 h on coverslips. Cells were fixed in 
methanol for 5 min at -20°C, followed by incubation in cold acetone at -20°C for 30 sec. Cells 
were then blocked with PBS-BSA (PBS with 1% bovine serum albumin) for 10 min and then 
incubated 2 hi at room temperature with the rabbit polyclonal anti-THAP antibodies diluted in PBS- 

20 BSA (1/40). Cells were then washed three times 5 min at room temperature in PBS-BSA, and 
incubated for 1 hr with Cy3 (red fluorescence)-conjugated goat anti-rabbit IgG (1/1000, Amersham 
Pharmacia Biotech) secondary antibodies, diluted in PBS-BSA. Nuclei were revealed by staining 
with DAPI (4,6-Diamidino-2-phenylindole ; 0.2 ng/ml, 10 min at room temperature). After 
extensive washing in PBS, samples were air dried and mounted in Mowiol. Images were collected 

25 on a Leica confocal laser scanning microscope. To verify the specificity of the immunostaining, in 
some experiments, the anti-THAP antibodies were pre-incubated with 2.5 ug/mLof the THAP 
antigenic peptide AVRRKNFKPTKYSSIC (SEQ ID NO: 293) or 2.5 ug/ml of a control peptide 
(QVEKLRKKLKTAQQRC (SEQ ID NO: 294). 

This analysis revealed that expression of the endogenous THAP1 protein is linked to cell 

30 proliferation with very low or no expression in quiescent cells and high levels of expression in 
mitotic cells. Specifically, the micrographs showed that in primary human endothelial cells, 
expression of THAP1 was linked to the proliferation status of the cells and was preferentially 
observed in mitotic dividing cells. This immunostaining of mitotic cells with anti-THAP antibodies 
was specific since it was also observed in the presence of a control peptide but not after blocking 
35 with the THAP antigenic peptide. 

EXAMPLE 38B 
Cell cycle specific expression of THAP 1 in S/G2-M phases 
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To investigate the subcellular localization of endogenous THAP1 during the cell cycle, 
human U20S osteosarcoma cells (ATCC) were analyzed by ind.rect immunofluorescence with 
anti-THAPl specific antibodies (see Example 38). 

U20S cells were allowed to grow for 24 hours on coverslips, then synchronized in different 
phases of the cell cycle by treatment with cell cycle inhibitors, aphidicoline (Gl/S phase block, 1 
ug/ml for 24h, Sigma ref A0781) or nocodazole (M phase block, 100 ng/ml for 24h, Sigma ref 
M1404). Cells in Gl phase were obtained 14h after release from the nocodazole block while cells 
in S and G2 phases were obtained 3h or 6h, respectively, after release from the aphidicolin block. 
Cells were fixed in methanol for 5 mm at -20°C, followed by incubation in cold acetone at -20°C 
for 30 sec. Cells were then blocked with PBS-BSA (PBS with 1% bovine serum albumin) for 10 
mm and then incubated 2 hr at room temperature with the rabbit polyclonal anti-THAP antibodies 
diluted m PBS-BSA (1/40). Cells were then washed three times 5 min at room temperature in PBS- 
BSA, and incubated for 1 hr with Cy3 (red fluorescence)-conjugated goat anti-rabbit IgG (1/1000, 
Amersham Pharmacia Biotech) secondary antibodies, diluted in PBS-BSA. Nuclei were revealed 
by staining with DAPI (4 ) 6-d.am,dino-2-phenylindole; 0.2 ug/ml, 10 min at room temperature). 
After extensive wash.ng in PBS, samples were air dried and mounted in Mowiol. Images were 
collected on a Leica confocal laser scanning microscope. To verify the specificity of the 
immunostaining, in some experiments, the anti-THAP antibodies were pre-incubated with 2.5 ug/ml 
of the THAP antigenic peptide AVRRKNFKPTKYSSIC (SEQ ID NO: 293) or 2.5 ug/ml of a 
20 control peptide (QVEKLRKKLKTAQQRC (SEQ ID NO: 294). 

This analysis revealed that expression of the endogenous THAP1 protein in the nucleus is 
cell cycle dependent. Specifically, the micrographs showed that in human U20S osteosarcoma 
tumor cells, expression of THAP1 was linked to the proliferation status of the cells and was 
specifically observed in S/G2-M phases of the cell cycle. This immunostaining of cells in S/G2-M 
phases of the cell cycle with anti-THAP antibodies was specific smce it was also observed in the 
presence of a control peptide but not after blocking with the THAP antigenic peptide. 

EXAMPLE 39 
THAP1 mo dulates transcription 
To analyze the effects of THAP1 in transcriptional regulation, Gal4-luciferase reporter 
assays were performed. The method is carried out essentially as described in Vandel et al (2001) 
Mol Cell Biol 21:6484-6494, and Vaute et al. (2002) Nucleic Acids Res 30:475-481 The full- 
length coding region of THAP 1 (amino acids 1-213) was amplified by PCR from HEVEC cDNA 
with primers THAP1-Gal4.1 (5'-CCGAATTCAGGATGGTGCAGTCCTGCTCCGCCT-3') (SEQ 
NO: 295) and THAP1-Gal4.2 (5'. 

GCGCTCTAGATTATGCTGGTACTTCAACTATTTCAAAGTAG-3') (SEQ ID NO- 296) 
digested with EcoRI and Xbal, and cloned in frame downstream of the Ga.4 DNA-bindmg domain 
(DBD), between the EcoRI and Xbal sites of the P CMV-Gal4 expression vector (Vande. et al. 
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(2001) Mol Cell Biol 21:6484-6494; Vaute et al. (2002) Nucleic Acids Res 30:475-481), to 
generate pCMV-Gal4/THAPl expression vector. Increasing amounts of the pCMV-GaI4/THAPl or 
pCMV-Gal4 expression vectors (0.025 mg, 0.05 mg, 0.1 mg, 0.2 mg, 0.5 mg, 1 mg of plasmid 
DNA ) were co-transfected in COS7 cells, together with a pBS-luciferase reporter plasmid (plasmid 

5 Gal4-M2-luc, 2 mg) containing four Gal4-UAS upstream of the luciferase reporter gene, and a 
pCMV-lacZ (0.5 mg) coding for (3-galactosidase. A pCMV-Ga!4/Suv39Hl plasmid, encoding the 
transcriptional repressor Suv39Hl (Vandel et al. (2001) Mol Cell Biol 21:6484-6494 ; Vaute et al. 

(2002) Nucleic Acids Res 30:475-481), was used as a control for transcriptional repression. 48 h 
after transfection, luciferase activity was measured using a luciferase reporter assay kit (Roche). 

1 0 Dosage of p-galactosidase was used to standardize transfection efficiencies. 

These Gal4-luciferase reporter assays revealed that THAP1 is able to modulate 
transcription (Figures 21 A and 2 IB) and exhibits transcriptional repressor properties similar to 
those of the transcriptional repressor Suv39Hl (Vandel et al. (2001) Mol Cell Biol 21:6484-6494 ; 
Vaute et al. (2002) Nucleic Acids Res 30:475-481). 

15 EXAMPLE 40 

Analysis of the subcellular localization of chemokine SLC/CCL21 
To analyze the subcellular localization of the SLC/CCL2 1 protein, the cDNA encoding the 
mature form of human SLC/CCL21 (amino acids 24-134 of SEQ ID NO: 1 19) (GenBank Accession 
Number NP_002980) is cloned in frame downstream of the Enhanced Green Fluorescent Protein 

20 (EGFP) ORF in pEGFP.C2 vector (Clontech). The pEGFP.C2-SLC/CCL2 1 vector is generated by 
subcloningjhe BamHI SLC/CCL21 fragment from pGBKT7-SLC/CCL21 (see example 15) into 
the unique BamHI cloning site of vector pEGFPC2 (Clontech). The GFP- SLC/CCL2 1 expression 
construct is then transfected into human primary endothelial cells from umbilical vein (HUVEC, 
PromoCell, Heidelberg, Germany). HUVEC are grown in complete ECGM medium (PromoCell, 

25 Heidelberg, Germany), plated on coverslips and transiently transfected in RPMI medium using 
GeneJammer transfection reagent according to manufacturer instructions (Stratagene^ La Jolla, CA, 
USA). Analysis by fluorescence microscopy 24 hours later reveals that the GFP-SLC/CCL21 fusion 
protein localizes in the nucleus while GFP alone exhibits only a diffuse staining over the entire cell. 
To investigate the subcellular localization of endogenous SLC/CCL21, 

30 immunohistochemistry with anti-SLC/CCL2 1 antibodies is performed on human tissue sections. 
Tissue specimens of fresh palatine tonsils are embedded in OCT compound (TissueTek, Elkhart, 
IN) and then snap-frozen in liquid nitrogen. Cryosections (6 um) are air-dried overnight, and 
acetone fixed (10 min, -20°C). Following one PBS wash, sections are treated 5 min at room 
temperature in PBS containing 0.1% Triton-XlOO, and washed again with PBS. The tissue sections 

35 are then incubated with a mixture of rabbit polyclonal antibodies against human SLC/CCL21 
(1/100, Chemicon, USA) followed by a mixture of Cy3-conjugated goat anti-rabbit IgG (1/1000, 
Amersham Pharmacia Biotech) secondary antibodies, diluted in PBS-BSA. Nuclei are revealed by 
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stainmg with DAPI (4,6-diamidino-2-phenyIindole; 0.2 ug/ml, 10 min at room temperature). After 
extensive washing in PBS, samples are air dried and mounted in Mowiol. Microscopy is performed 
With a Nikon Eclipse TE300 fluorescence microscope equipped with a Nrkon digital camera 
DXM1200 (Nikon Corp., Tokyo, Japan). 

Experiments similar to those described above were performed in HeLa cells and GFP-SLC 
was shown to localize to the nucleus. Figure 27A shows areas of localization of GFP-SLC which 
correspond to nuclei as shown by DAPI counterstain (Figure 27B). 

EXAMPLE 40B 

Analysis of the .subcellular localis ation of chemokine MIG/C\ri 0 
To analyze the subcellular localization of the MIG/CXCL9 protein, the cDNA encoding 
the mature form of human MIG/CXCL9 (amino acids 23-125 of GenBank Accession No. 
NM_002416) (CXCL9 polypeptide SEQ ID NO: 275, CXCL9 cDNA SEQ ID NO: 276) is cloned 
in frame downstream of the Enhanced Green Fluorescent Protein (EGFP) ORF in pEGFP.C2 vector 
(Clontech). The pEGFP.C2-MIG/CXCL9 vector is generated by subcloning the EcoRI-BamHI 
MIG/CXCL9 fragment from P GBKT7-MIG/CXCL9 (see example 32) between the EcoRI-BamHI 
cloning sites of vector P EGFPC2 (Clontech). The GFP- MIG/CXCL9 expression construct is then 
transfected into human primary endothelial cells from umbilical vein (HUVEC, PromoCell, 
Heidelberg, Germany) or human immortalized Hela cells. HUVEC are grown in complete ECGM 
medium (PromoCell, Heidelberg, Germany), plated on covershps and transiently transfected in 
RPMI medium using GeneJammer transection reagent according to manufacturer instructions 
(Stratagene.La Jolla, CA, USA). Human Hela cells (ATCC) were grown in Dulbecco's Modifled 
Eagle's Medium supplemented with 10% Fetal Calf Serum and l»/ 0 Penicillin-streptomycm (all 
from Life Technologies, Grand Island, NY, USA), plated on coverslips, and transiently transfected 
w,th calcium phosphate method using 2 ug P EGFPC2-MIG/CXCL9 plasmid. Analysis of 
transfected HUVEC or Hela cells by fluorescence microscopy 24 hours later revealed that the GFP- 
MIG/CXCL9 fusion protein, similarly to GFP-SLC/CCL2 1 localizes in the nucleus while GFP 
alone exhibits only a diffuse staining over the entire cell. 

Experiments similar to those described above were performed in HeLa cells and GFP-MIG 
was shown to localize to the nucleus. Figure 27C shows areas of localization of GFP-MIG which 
correspond to nuclei as shown by DAPI counterstain (Figure 27D). 

EXAMPLE 40C 

CXCR3-dependent nuclear translocation of chemokine MTr./r-vr^ o 
To demonstrate the capacity of secreted chemokine MIG/CXCL9 to undergo CXCR3-dependent 
nuc ear translocation, the cDNA encoding the full length form of human MIG/CXCL9 (amino acids 
1-125 of GenBank Accession No. NM_002416) (CXCL9 polypeptide SEQ ID NO- 275 CXCL9 
cDNA SEQ ID NO: 276) was amplified by PGR from Image clone No. 5228247 with primers 
CXCL9-3 (5 -CCGAATTCCCACCATGAAGAAAAGTGGTGTTCrn-3') (SEQ ID NO: 327) 
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and CXCL9-4 (5 '-CCGG ATCCTGTAGTCTTCTTTTG ACGAG AACGTTG-3 ') (SEQ ID NO: 

328) , digested with EcoRI and BamHI, and cloned between EcoRl and BamHI cloning sites of 
vector pFLAG-CMV-5a (Sigma) to generate the phMIG-Flag expression vector. The CXCR3 
expression vector pEF-CXCR3 was generated by cloning the full length CXCR3 cDNA (amino 

5 acids 1-368 of GenBank Accession No. NM_001504) (CXCR3 polypeptide SEQ ID NO: 304, 
CXCR3 cDNA SEQ ID NO: 305), amplified by PCR from Image clone No. 5176136 with primers 
5'Xba-CXCR3 (5 , -CCTCTAGACCACCATGGTCC^TGAGGTGAGTGAC-3 , ) (SEQ ID NO: 

329) and 3 > Not-CXCR3 (5'-CCCGCGGCCGCTCACAAGCCCGAGTAGGAGGC-3') (SEQ ID 
NO: 330), between the Xbal and NotI sites of the pEF-BOS expression vector (Mizushima and 

10 Nagata, Nucleic Acids Research, 18:5322, 1990). The phMIG-Flag expression construct was then 
transfected into human U20S osteosarcoma cancer cells. Human U20S cells (ATCC) were grown 
in Dulbecco's Modified Eagle's Medium supplemented with 10% Fetal Calf Serum and 1% 
Penicillin-streptomycin (all from Life Technologies, Grand Island, NY, USA), plated on coverslips, 
and transiently transfected with calcium phosphate method using 2 u.g phMIG-Flag plasmid 

1 5 together with pEF-CXCR3 or pEF-Bos control vector. U20S cells co-transfected with phMIG-Flag 
and pEF-CXCR3 or pEF-Bos expression vectors were analysed 48h later by fluorescence 
microscopy. Cells were washed twice with PBS, fixed for 15 min at room temperature in PBS 
containing 3.7% formaldehyde, and washed again with PBS prior to neutralization with 50mM 
NH 4 CI in PBS for 5 min at room temperature. Following one more PBS wash, cells were 

20 permeabilized 5 min at room temperature in PBS containing 0.1% Triton-XlOO, and washed again 
with PBS. Permeabilized cells were then blocked with PBS-BSA (PBS with 1% bovine serum 
albumin) for 10 min and then incubated 2 hr at room temperature with rabbit polyclonal antibodies 
anti-Flag epitope (1/200, Sigma) and mouse monoclonal antibody anti-CXCR3 (mouse IgGl, 
1/200, R&D) diluted in PBS-BSA. Cells were then washed three times 5 min at room temperature 

25 in PBS-BSA, and incubated for 1 hr with Cy3 (red fluorescence)-conjugated goat anti-rabbit IgG 
(1/1000, Amersham Pharmacia Biotech) and FITC-labeled goat anti -mouse -IgG (1/40, Zymed 
Laboratories Inc., San Francisco, CA, USA) secondary antibodies, diluted in PBS-BSA. After 
extensive washing in PBS, samples were air dried and mounted in Mowiol. Images were collected 
on a Leica confocal laser scanning microscope. The FITC (green) and Cy3 (red) fluorescence 

30 signals were recorded sequentially for identical image fields to avoid cross-talk between the 
channels. 

In cells co-transfected with phMIG-Flag and pEF-CXCR3 expression vectors, hMIG-Flag 
was found to accumulate in the nucleus of the majority of transfected cells (Figures 28A-D and 
29A-C). Nuclear localization of MIG-Flag was specifically observed in CXCR3 -positive cells 
35 (Figure 29A-C) and was not found in cells co-transfected with the pEF-Bos control vector (Figure 
28A-D). These results demonstrated that chemokine receptor CXCR3 plays an essential role in 
nuclear translocation of secreted chemokine MIG. 

-226- 

BNSDOCID: <WO 2O04O55O50A2_l_> 



10 



15 



20 



25 



30 



35 



WO 2004/055050 

PCT/IB2003/006434 

EXAMPLE 41 

The TH AP 1 ASLC-CC1 .2 1 n omplex modulates transcrintion 
To analyze the effects of SLC/CCL2 1 and the THAP1/SLC-CCL21 complex in 
transcriptional regulation, Gal4-luciferase reporter assays are performed essentially as described in 
Example 39. The SLC/CCL21 expression vector used in these transcription assays (pEF- 
SLC/CCL21) is generated by PCR. A cDNA encoding the mature form of human SLC/CCL2 1 
(amino acids 24-134 of SEQ ID NO: 1 19) (GenBank Accession Number NP_O02980), is amplified 
by PCR from HEVEC RNA with primers hSLC.Xba (5'- 

GCGTCTAGAATGAGTGATGGAGGGGCTCAGGACTGTTG-3') (SEQ ID NO: 297) and 
hSLCNot (5*-GGGCGGCCGCCTATGGCCCTTTAGGGGTCTGTGACCGC-3') (SEQ ID NO: 
298), digested with Xbal and NotI, and cloned into the Xbal and NotI sites of the pEF-BOS 
expression vector (Mi zushima and Nagata, Nucleic Acids Research, 18:5322, 1990). 

Increasing amounts of the P EF-SLC/CCL2 1 plasmid (0.025 mg, 0.05 mg, 0.1 mg, 0.2 mg, 
0.5 mg, I mg of plasmid DNA) are co-transfected in COS7 cells, together with pCMV- 
Gal4/THAP1 or P CMV-Gal4 expression vectors (0.5 mg), a pBS-luciferase reporter plasmid 
(plasmid Gal4-M2-luc, 2 mg) containing four Gal4-UAS upstream of the luciferase reporter gene, 
and a pCMV-lacZ (0.5 mg) coding for P-galactosidase. Forty-e.ght hours after transfection, 
luciferase activity is measured using a luciferase reporter assay kit (Roche). Dosage of 0- 
galactosidase is used to standardize transfection efficiencies. 

These Gal4-luciferase reporter assays should reveal that SLC/CCL2 1 is able to modulate 
transcriptional activity of THAP1, indicating a role for the THAP 1/SLC-CCL2 1 complex in 
transcriptional regulation (Figure 22A). 

Similar to other cytokines such as IFNy (Bader and Wietzerbin (1994) PNAS 91 : 11831- 
11835; Subramaniam et al. (1999) J Biol Chem 274:403-407) and growth factors such as 
FGF2 (Baldin et al. (1990) EMBO J 9:1511-1517), the basic SLC/CCL2 1 chemokine may be 
mtemahzed and translocated to the nucleus, where it may associate with THAP1 and modulate 
(emulate or inhibit) transcription of specific target genes. Target genes of THAP1 and 
THAP1/SLC complex can include genes involved in cell proliferation and cell differentiation 
particularly endothelial cell differentiation and endothelial or cancer cell proliferation. 

It will be appreciated that the above-described methods can be used to determme whether 
any particular THAP 1 /chemokine comp.ex or THAP-famdy polypeptide/chemokine complex has 
the abthty to modulate transcription. For example, cDNAs encoding THAP-family members 
THAP1 to THAP11 as well as THAP0 from humans and other species can be cloned in an 
express,on vector such as P CMV-Gal4, the desired chemokine is cloned into the expression vector 
PEF-BOS and the expression constructs are then either transfected separately or cotransfected into 
COS7 cells comprising a pBS-luciferase reporter plasmid. Luciferase assays are performed as 
described above. 
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Chemokines which can be tested in combination with THAP1 or other THAP-family 

polypeptides for their ability to modulate transcription include, but are not limited to, XCL1, XCL2, 

CCLl, CCL2, CCL3, CCL3L1, SCYA3L2, CCL4, CCL4L, CCL5, CCL6, CCL7, CCL8, SCYA9, 

SCYA10, CCL11, SCYA12, CCLI3, CCLl 4, CCLl 5, CCLl 6, CCLl 7, CCLl 8, CCLl 9, CCL20, 

5 CCL21, CCL22, CCL23, CCL24, CCL25, CCL26, CCL27, CCL28, clone 391, CARP CC-1, 

CCLl, CK-1, regakine-1, K203, CXCL1, CXCL1P, CXCL2, CXCL3, PF4, PF4V1, CXCL5, 

CXCL6, PPBP, SPBPBP, IL8, CXCL9, CXCL10, CXCL11, CXCL12, CXCL14, CXCL15, 

CXCL16, NAP-4, LFCA-1, Scyba, JSC, VHSV-induced protein, CX3CL1 and fCLl. 

In experiments conducted using M1G and THAP1, it was shown that MIG/THAP1 

10 complexes could modulate gene transcription (see Figure 22B and Example 47). 

EXAMPLE 42 

Fucosvltransferase TVflis a target gene of THAP1 and/or the THAP1/SLC-CCL21 complex 
Since chemokine SLC/CCL21 has been shown to induce the high endothelial venule 
phenotype in endothelial cells (Fan et al. (2000) J Immunol 164:3955-3959; Grant et al. (2002) Am 

15 J Pathol 2002 160:1445-55; Yoneyama et al. (2001) J Exp Med 193:35-49), we searched for target 
genes of the TH AP 1 /SLC-CCL2 1 among the few high endothelial venule-specific genes that have 
been described. This analysis led to the identification of many THAP domain recognition sequences 
in the promoter of the human Fucosyltransf erase TVII gene (Figure 23), one of the key high 
endothelial venules enzymes for lymphocyte recruitment (Smith et al. (1996) J Biol Chem 

20 271:8250-8259; Maly et al. (1996) Cell 86:643-653). 

To confirm that the Fucosyltransf erase TVII promoter is a target of THAP1 and/or the 
THAP 1 /SLC-CCL2 1 complex, transcription assays are performed with luciferase reporter genes 
under the control of the FucTVJI promoter. The FucTVII promoter (nucleotides 650-950, GenBank 
Accession Number ABO 12668) is amplified by PCR from human genomic DNA with primers 

25 FucTVII-1 (5 , -GCGCTCGAGCTGCACCTGGGCCTTCTCTGCCCTGG-3 , ) (SEQ ID NO: 299) 
and FucTVII-2 (5 ' -CG A AGCTTACTGTGCTCCTTTTATCTCTGCCC AAG-3 ') (SEQ ID NO: 
300), digested with Xhol and Hindlll, and cloned in the same sites of the pGL3-Basic luciferase 
reporter plasmid (Promega) to generate pGL3-proFucTVII-\uc> 

To analyze the effects of SLC/CCL21 and the THAP 1 /SLC-CCL2 1 complex on the 

30 FucTVII promoter, luciferase reporter assays are performed essentially as described in Example 39. 
Increasing amounts of the pEF-SLC/CCL2 1 and/or pEGFPC2-TH AP 1 plasmid (0.025 mg, 0.05 mg, 
0.1 mg, 0.2 mg, 0.5 mg, 1 mg of plasmid DNA ) are co-transfected in COS7 cells, together with the 
pGL3-/?roFucFF77-luciferase reporter plasmid, and pCMV-lacZ (0.5 mg) coding for P- 
galactosidase. Forty-eight hours after transfection, luciferase activity is measured using a luciferase 

35 reporter assay kit (Roche). Dosage of P-galactosidase is used to standardize transfection 
efficiencies. 
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These luciferase reporter assays with the pGL3-proFucTV/I-]uci{erase reporter plasmid 
reveals that THAP1, SLC/CCL21 and the THAP1/SLC-CCL21 complex are able to modulate 
transcriptional activity of the FucTVIJ promoter, indicating that the human Fucosyltransferase TVII 
gene is a target of THAP1 and the THAP 1 /SLC-CCL2 1 complex, further confirming the role of the 
THAP1/SLC-CCL21 complex in transcriptional regulation. 

EXAMPLE 43 
Retrovirus mediated expression of THAP 1 and 
chemokines SLC/CCI.71 and MIG/CXC L9 in primary human endothelial c.e\U 
Background: The method described below uses retroviral derived vectors to transduce at 
high efficiency human primary umbilical vein endothelial cells (HUVEC) with THAP1, 
SLC/CCL21, MIG/CXCL9 or any other gene of interest. This retroviral packaging system includes 
retroviral packaging plasmids and packagable vector transcripts that are produced from high 
expression plasmids after transient tri-transfection in mammalian cells. High titers of recombinant 
retroviruses are produced in these transfected mammalian cells and can then transduce a 
mammalian target cell, so-called HUVEC, by fresh supernatant infection at high efficiency. This 
method is useful for the rapid production of high titer viral supematants to transduce with high 
efficiency cells that are refractory to transduction by conventional means such as simple 
transfection. The transduction protocol in primary HUVEC has been optimized with a MLV- 
derived vector encoding enhanced green fluorescent protein (eGFP) to determine the optimal 
20 infection conditions. 

The retroviral constructs are packaging plasmids consisting of at least one retroviral helper 
DNA sequence derived from a replication-incompetent retroviral genome encoding in trans all 
virion proteins required to package a replication incompetent retroviral vector, and for producing 
virion proteins capable of packaging the replication-incompetent retroviral vector at high titer, 
25 without the production of replication-competent helper virus. The retroviral DNA sequence lacks 
the region encoding the native enhancer and/or promoter of the viral 5* LTR of the virus, and lacks 
both the psi function sequence responsible for packaging helper genome and the 3' LTR, but 
encodes the foreign P-globin polyadenylation site. The retrovirus is a leukemia virus, the Moloney 
Murine Leukemia Virus (MMLV). The foreign enhancer and promoter is the human 
cytomegalovirus (HCMV) immediate early (IE) enhancer and promoter. The retroviral packaging 
plasmid consists of two retrovnal helper DNA sequences encoded by plasmid based expression 
vectors, for example where a first helper sequence contains a cDNA encoding the gag and pol 
proteins of ecotropic MMLV and a second helper sequence contains a cDNA encoding the env 
protein. The Env gene, which determines the host range, is derived from the Vesicular Stomatitus 
35 Virus (VSV) G protein. 

Plasmid constructions: MLV retroviral vectors were based on MoMLV derived vector 
called P CFB from Stratagene where the U3 region of the 5 'LTR were replaced by the 
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enhancer/promoter of the cytomegalovirus immediate early (CMV IE) gene. The multi-cloning site 

was modified by incorporation of synthetic oligonucleotides 5'- 

GGCATTCAATTGCTCGAGTTTAAACGCGGCCGCG-3 ' (SEQ ED NO: 331) and 5'- 

AATCCGCGGCCGCGTrTAAACTCGAGCAATTGAATGCC-S' (SEQ ID NO: 332) containing 

5 the Nael and Mfel restriction sites and replacing nucleotides from position 1742 to 2244 of the 

pCFB plasmid. The modified vector was called pMLV-MCS. The pVSVG plasmid encoding the 

VSVG envelope and the pGAGPOL plasmid encoding gag and pol genes have been constructed as 

follows: VSVG and GAG-POL DNA fragments were amplified from respectively pVPack-VSV-G 

and pVPack-GP plasmids as templates (Stratagene) and cloned into the CMV-pglobin intron-MCS- 

10 Pglobin polyA expression cassette following conventional cloning procedures. Primers used to 

amplify vsvg and gagpol fragments were respectively VSVG-5' (5'- 

ATGAAGTGCCTTTTGTACTTAGCCTT-3 ') (SEQ ID NO: 333) and VSVG-3' (5'- 

TCATAAAAATTAAAAACTCAAATATAATTGAGG-3') (SEQ ID NO: 334) and GAGPOL-5' 

(5'-ATGGGCCAGACTGTTACCACTC-3 ') (SEQ ID NO: 335) and GAGPOL-3' (5'- 

1 5 TTAGGGGGCCTCGCGG-3 ' ) (SEQ ID NO: 336). 

The full length coding region of human THAP1 (SEQ ID NO: 3; amino acids 1 to 213), 

were amplified by PCR according to standard procedures with primers: THAP1-5' (5'- 

ATGGTGCAGTCCTGCTCCGC-3') (SEQ ID NO: 337) and THAPl-MfeI-3' (5'- 

GCCAATTGTTATGCTGGTACTTCAACTATTT-3 ') (SEQ ID NO: 338) using a recombinant 

20 vector containing the human THAP1 cDNA as template. The reverse primer contains an Mfel 

restriction site at its end to generate a 3* overhang compatible with the 5' end of the cleaved vector. 

The amplified DNA were then digested with Mfel, purified by electrophoresis on an agarose gel 

and the desired fragments were then cloned into the cleaved vector pMLV-MCS digested with Nael 

and Mfel restriction enzymes. 

25 Coding regions of human SLC/CCL2 1 (Genbank NP) and human MIG/CXCL9 

(NM_002416) were amplified by PCR in such a way that the amplified fragments did not contain 

the signal peptide localized from the nucleotide 4 to the nucleotide 66 of the full length open 

reading frame of both sequences. By deleting signal peptide signatures, SLC/CCL2 1 and 

MIG/CXCL9 proteins are localized into the nucleus of the cell after protein expression in the 

30 cytoplasm. Primers used were SLC-5' (5 '-ATGAGTG ATGGAGGGGCTCAGG-3 ') (SEQ ED NO: 

339) and SLC-EcoRI-3' (5 '-GGAATTCCTATGGCCCTTTAGGG-3 ') (SEQ ID NO: 340), MIG-5' 

(5'- ATGACCCCAGTAGTG AGAAAGGGTC-3 *) (SEQ ED NO: 341) and MIG-EcoRI-3' (5 J - 

GGAATTCTTATGTAGTCnTCITTTGACGAGA-3 ') (SEQ ID NO: 342) for SLC/CCL21 and 

MIG/CXCL9, respectively. Both reverse primers contain an EcoRI restriction site at their end to 

35 generate a 3' overhang compatible with the 5* end of the cleaved vector. The amplified DNAs were 

then digested with EcoRI, purified by electrophoresis on an agarose gel and the desired fragments 

were then cloned into the cleaved vector pMLV-MCS digested with Nael and EcoRI restriction 
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enzymes. The recombinant vectors thus obtained, pMLV-THAPl, pMLV-SLC/CCL21, pMLV- 
MIG/CXCL9, encode amino acids-2 to 213 of the THAP1 sequence or amino acids-24 to 134 of 
the maturated SLC/CCL21 sequence, or amino acids-23 to 125 of the maturated MIG/CXCL9 
sequence. 

5 Transfection, virus harvest, and retroviral infection of cells: Retroviral vectors carrying 

either THAP1 or SLC/CCL21 or MIG and driven by the moloney murine leukemia virus LTR were 
produced by transient transfection in 293T cells (ATCC No. CRL11268, ATCC, Rockville, 
Md)with the following plasmids: the packaging plasmid (pGAGPOL), the envelope plasmid 
coding for the vesicular stomatisis virus G protein (pVSV-G), and one of the transducing vector 

10 pMLV-THAPl, pMLV-SLC, pMLV-MIG, pMLV-MCS or pMLV-EGFP. 293T cells were 
transfected via the calcium phosphate precipitation method (Pear et al., 1993). Briefly, cells were 
plated at a density of 4 x 10 6 cells per 75- cm 2 dishes one day prior to transfection. DNA-calcium 
phosphate complexes consisting of pVSVg, pGAGPOL and one of the transducing vector pMLV- 
THAP1, pMLV-SLC, pMLV-MIG, pMLV-MCS or pMLV-EGFP were diluted in an equal volume 

15 of HBS2x buffer and incubated with cells for 16 hours. After media removal, cells were 
replenished with fresh medium and further incubated for 24 hours. Cell supernatants containing 
viral particles were harvested every 8-12 hours, clarified of cell debris using low-speed 
centrifugation and filtered on 0.45 pm filters. 

Cell Transduction: A total of 10 6 HUVEC were transduced in a 75 cm 2 plate with 10 ml 

20 of viral supernatant in the presence of 8 pg/ml of polybren (Sigma) as previously described (Yu. H. 
et al., Gene* Therapy, 6, 1876-1883, 1999). After 4 hours, the supernatant was replaced by fresh 
endothelial cell medium consisting of MCDB131 (Gibco Brl) supplemented with 20% heat- 
inactivated serum, endothelial cell growth factor (ECGS, Sigma Chemical Co.) and 5 U/ml sodium 
heparin. When applicable, second transduction were processed using the same protocol a day after 

25 the first transduction. Forty-eight hours after the second infection, cells were trypsinized and 
pelleted for RNA preparation. Total RNA was isolated from 10 6 cells with the Absolutely RNA 
miniprep kit according to the manufacturer's instructions (Stratagene, La Jolla, CA, USA). 

EXAMPLE 44 
Identification of THAP1 target genes by DNA microarravs 

30 and real-time Polymerase Chain Reaction (TCR) 

To better understand the function of THAP1 as a nuclear factor in vasculature, we globally 
profiled THAP1 target genes either in primary human endothelial cells or in primary endothelial 
cells constitutively expressing chemokines in the nucleus using retroviral gene transfer and Agilent 
oligonucleotide-based microarray technology. We quantitated the THAP1 mediated changes in 

35 expression of more than 17,000 mRNAs by transducing human vascular endothelial cells with the 
following set of viral particles: MCS as the negative infection control, THAP1, SLC/CCL21 and 
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enzymes. The recombinant vectors thus obtained, pMLV-THAPl, pMLV-SLC/CCL21, pMLV- 
MIG/CXCL9, encode amino acids-2 to 213 of the THAP1 sequence or amino acids-24 to 134 of 
the maturated SLC/CCL21 sequence or amino acids-23 to 125 of the maturated MIG/CXCL9 
sequence. 

5 Transfection, virus harvest, and retroviral infection of cells: Retroviral vectors carrying 

either THAP1 or SLC/CCL2 1 or MIG and driven by the moloney murine leukemia virus LTR were 
produced by transient transfection in 293T cells (ATCC No. CRL11268, ATCC, Rockville, 
Md)with the following plasmids: the packaging plasmid (pGAGPOL), the envelope plasmid 
coding for the vesicular stomatisis virus G protein (pVSV-G), and one of the transducing vector 

10 pMLV-THAPl, pMLV-SLC, pMLV-MIG, pMLV-MCS or pMLV-EGFP. 293T cells were 
transfected via the calcium phosphate precipitation method (Pear et al., 1993). Briefly, cells were 
plated at a density of 4 x 10 6 cells per 75- cm 2 dishes one day prior to transfection. DNA-calcium 
phosphate complexes consisting of pVSVg, pGAGPOL and one of the transducing vector pMLV- 
THAP1, pMLV-SLC, pMLV-MIG, pMLV-MCS or pMLV-EGFP were diluted in an equal volume 

15 of HBS2x buffer and incubated with cells for 16 hours. After media removal, cells were 
replenished with fresh medium and further incubated for 24 hours. Cell supernatants containing 
viral particles were harvested every 8-12 hours, clarified of cell debris using low-speed 
centrifugation and filtered on 0.45 jam filters. 

Cell Transduction: A total of 10 6 HUVEC were transduced in a 75 cm 2 plate with 10 ml 

20 of viral supernatant in the presence of 8 u.g/ml of polybren (Sigma) as previously described (Yu. H. 
et al., Gene Therapy, 6, 1876-1883, 1999). After 4 hours, the supernatant was replaced by fresh 
endothelial cell medium consisting of MCDB131 (Gibco Brl) supplemented with 20% heat- 
inactivated serum, endothelial cell growth factor (ECGS, Sigma Chemical Co.) and 5 U/ml sodium 
heparin. When applicable, second transduction were processed using the same protocol a day after 

25 the first transduction. Forty-eight hours after the second infection, cells were trypsinized and 
pelleted for RNA preparation. Total RNA was isolated from 10 6 cells with the Absolutely RNA 
miniprep kit according to the manufacturer's instructions (Stratagene, La Jolla, CA, USA). 

EXAMPLE 44 

Identification of THAP1 target genes by DNA microarrays 
30 and real-time Polymerase Chain Reaction (PCR) 

To better understand the function of THAP1 as a nuclear factor in vasculature, we globally 
profiled THAP1 target genes either in primary human endothelial cells or in primary endothelial 
cells constitutively expressing chemokines in the nucleus using retroviral gene transfer and Agilent 
oligonucleotide -based microarray technology. We quantitated the THAP1 mediated changes in 
35 expression of more than 17,000 mRNAs by transducing human vascular endothelial cells with the 
following set of viral particles: MCS as the negative infection control, THAP1, SLC/CCL21 and 
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MIG/CXCL9. In addition, SLC/CCL21 and MIG/CXCL9 infected endothelial cells were re- 
infected a day after with viral particles containing either MCS or THAN. After 50 and 120 hours 
of the second infection, HUVECs cells were pelleted, washed and lysed to prepare total RNA and 
protein extracts. Over-expression of THAP1, SLC/CCL2 1 and MIG/CXCL9 in HUVECs was 
verified both at RNA and protein levels with standard quantitative PCR and Western blotting 
procedures. 

Oligonucleotide Array Expression Analysis 

Total RNA quality control was performed by running 25-50 ng on an RNA 6000 Nano 
Assay (Agilent) using a Bio-analyser 2100. For labelling, 500 ng of good quality total RNA was 
reverse-transcribed with an oligo-dT-T7 and double stranded cDNA was generated with the 
superscript double stranded cDNA synthesis kit (Invitrogen). In an in vitro transcription step with 
T7 RNA polymerase, the cDNA was linearly amplified and labeled with fluorescent nucleotides 
(low RNA input fluorescent linear amplification kit, Agilent). Ten micrograms of labeled and 
fragmented cRNA was then hybridized onto a Human genome 1A expression array (G4110A, 
Agilent) for 16 hours at 45°C. Post-hybridization staining and washing were performed according 
to manufacturer instructions. Finally, chips were scanned with an Agilent DNA microarray scanner 
at the Microarray Facility. Data acquisition and analysis were performed with the Agilent Feature 
Extraction and Analysis software using the Rosetta Resolver data analysis system. 
Real-Time Polymerase Chain Reaction (PCR) 

Real-Time PCR was performed on cDNA synthesized from RNA isolated from HUVEC 
cells infected with THAP1, MCS, SLC, MIG, SLC and MCS, SLC and THAP1, MIG and MCS or 
MIG and THAP1 retroviral constructs using the ABI7700 Prism SDS Real-Time PCR Detection 
System (Applied Biosystems). The ABI7700 Prism was formatted for 96 well plates containing 25 
pi PCR reactions. Real-time PCR were made such as each 25 pi contained 2 pi DNA-template 
(dilution 1:4), 12.5 pi SYBR Green PCR Master Mix kit (Applied Biosystems, Foster City, CA, 
USA) and 0.8 pM forward and reverse gene primers. PCR conditions consisted of an initial 
denaturing step for 10 minutes at 95°C, followed by 40 cycles of a 2-segment step consisting of 
denaturation for 30 seconds at 95"C, annealing and elongation for 60 seconds at 60»C. After real 
time analysis, a melting curve was established for all samples to insure specific amplification. A 
negative control, where no template DNA was used, was run on each plate as well as a comparison 
of GAPDH between all samples. GAPDH served to equilibrate the starting material between the 
two experimental conditions. All unknown samples as well as controls were run in duplicate on the 
same plate (except for the negative control). Reactions were recorded and analysed with the 
ABI7700 Prism SDS sequence detection system. The threshold cycle (Q for each sample run in 
duplicate were determined and fold differences was performed as detailed previously (Van Trappen 
etal.,2001). 
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EXAMPLE 45 
THAP1 regulates cell cycle specific genes and modulates 
proliferation of both primary endothelial cells an d immortalized cancer cells 
We combined data from independent microarray analyses of the effects of THAP1 on gene 
5 expression in primary human endothelial cells to identify THAP1 -target genes (THAP responsive 
genes). 

Table 2A lists certain genes regulated by THAP1, as revealed by two independent 
microarray experiments with human primary endothelial cells transduced with THAP1 (THAP) or 
control (MCS) retrovirus expression vectors. The GenBank and Swiss Prot accession numbers are 
10 indicated for each gene as well as the fold changes, p-values and signal intensities obtained in the 
two microarray experiments. 

Table 2B lists database accession numbers for each gene and corresponding polypeptide 
listed in Table 2 A. Table 2B also shows the Aligent oligonucleotide which corresponds to each 
gene listed in Table 2A. 
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Out of 17000 genes examined in these microarray experiments, we identified 23 candidate 
THAP1 -target genes that are downregulated in THAPl-overexpressing cells. One of the genes 
identified corresponds to THAP1 itself (FLJ10477), suggesting auto-regulation. Nine genes 
correspond to predicted proteins with unknown functions. Strikingly, at least 10 of the remaining 
13 genes downregulated by THAP1 (see Table 2A) correspond to proteins previously linked to cell 
cycle/cell proliferation (CKS1, Survivin, PTTGl/Securin, PTTG2/Securin2, PTTG3/Securin3, 
MAD2L1, USP16, HMMR, KIAA0008, CDCA7). Many of these genes share common 
characteristics. 

1) role in mitosis/chromosome segregation: Survivin (polypeptide sequence SEQ ID NO: 
343, nucleotide sequence SEQ ID NO: 344) (Li et al. (1998) Nature 396:580-584; Li et al. 
(1999) Nature Cell Biol 1:461-466; Lens et al. (2003) EMBO J 22:2934-2947), 
PTTGl/Securin (polypeptide sequence SEQ ID NO: 345, nucleotide sequence SEQ ID 
NO: 346) (Zou et al. (1999) Science 285:418-422; Wang et al. (2001) Mol Endocrinol 
15:1870-1879), CKS1 (polypeptide sequence SEQ ID NO: 347, nucleotide sequence SEQ 
ID NO: 348) (Pines (1996) Curr Biol 6:1399-1402; Hixon et al. (2000) J Biol Chem 
275:40434-40442), MAD2L1 (polypeptide sequence SEQ ID NO: 349, nucleotide 
sequence SEQ ID NO: 350) (Dobles et al. (2000) Cell 101:635-645; Michel et al. (2001) 
Nature 409:355-359), USP16/Ubp-M (polypeptide sequence SEQ ID NO: 351, nucleotide 
sequence SEQ ID NO: 352) (Cai et al. (1999) PNAS 96:2828-2833), HMMR/RHAMM 
(polypeptide sequence of isoform A, SEQ ID NO: 353, nucleotide sequence of transcript 
variant 1, SEQ ID NO: 354) (polypeptide sequence (gi/7 108351) SEQ ED NO: 365, 
nucleotide sequence of transcript variant 2, SEQ ID NO: 366) ( (Maxwell et al. (2003) Mol 
Biol Cell 14:2262-2276; Tolg et al. (2003) Oncogene 22:6873-6882), KIAA0008/HURP 
(polypeptide sequence SEQ ID NO: 355, nucleot.de sequence SEQ ID NO: 356) (Tsou et 
25 al. (2003) Oncogene 22:298-307); 

2) specific rnRNA expression in S/G2-M: CKS1 (Richardson et al. (1990) Genes Dev 
4:1332-1344), Survivin (Li et al. (1998) Nature 396:580-584), PTTGl/Securin (Zou et al 
(1999) Science 285:418-422; Yu et al. (2000) Mol Endocrinol 14:1137-1146) 
30 KIAA0008/HURP (Bassal et al. (2001) Genomics 77:5-7; Tsou et al. (2003) Oncogene' 

22:298-307); 



20 



35 



3) upregulation in human tumors: CKS1 (Inui et al. (2003) BBRC 303:978-984), Survivin 
(Ambrosini et al. (1997) Nature Med 3:917-921), PTTGl/Securin (Heaney et al (2000) 
Lancet 355:716-719; Zou et al. (1999) Science 285:418-422), PTTG2/Securi„2 
(polypeptide sequence SEQ ID NO: 357, nucleotide sequence SEQ ID NO: 358) (Chen et 
al. (2000) Gene 248:41-50), PTTG3/Securin3 (polypeptide sequence SEQ ID NO: 359, 
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nucleotide sequence SEQ ED NO: 360) (Chen et al. (2000) Gene 248:41-50), 
HMMR/RHAMM (Tolg et aL (2003) Oncogene 22:6873-6882), KIAA0008/HURP 
(Bassal et al. (2001) Genomics 77:5-7; Tsou et al. (2003) Oncogene 22:298-307), 
CDCA7/JP01 (polypeptide sequence of variant t, SEQ ID NO: 361, nucleotide sequence 
of variant 1, SEQ ID NO: 362; polypeptide sequence of isoform 2, SEQ ID NO: 363, 
nucleotide sequence of transcript variant 2, SEQ ID NO: 364) (Prescott et al. (2001) J Biol 
Chem 276:48276-48284); 



4) negative regulation by the p53 tumor suppressor: Survivin (Hoffman et al. (2002) J Biol 
Chem 277:3247-3257; Mirza et al. (2002) Oncogene 21:2613-2622), PTTGl/Securin 
(Zhou et al. (2003) J Biol Chem 278:462-470); 

5) stimulation of angiogenesis: Survivin (O'Connor et al. (2000) Am J Path 156:393-398; 
Papapetropoulos et al. (2000) J Biol Chem 275:9102-9105; Mesri et al. (2001) Am J Path 
158:1757-1765), PTTGl/Securin (Ishikawa et al. (2001) J Clin Endocrinol Metab 86:867- 
874; McCabe et al. (2002) J Clin Endocrinol Metab 87:4238-4244). 

In addition, survivin has been shown to be a critical anti-apoptotic factor at the interface 
between cell cycle/mitosis and apoptosis (Li et al. (1998) Nature 396:580-584; Li et al. (1999) 
Nature Cell Biol 1:461-466), which plays an important role in the control of endothelial cell 
apoptosis (O'Connor et al. (2000) Am J Path 156:393-398; Papapetropoulos et al. (2000) J Biol 
Chem 275:9102-9105; Mesri et al. (2001) Am J Path 158:1757-1765). Downregulation of survivin 
expression by THAP1 may therefore contribute to its pro-apoptotic activity (see Example 10). 
Simultaneous downregulation by THAP1 of all these genes critical for cell cycle/cell proliferation 
and/or apoptosis (CKS1, Survivin, PTTGl/Securin, MAD2L1, USP16, HMMR), is expected to 
result in cell cycle block and inhibition of cell proliferation. Accordingly, we found that 
overexpression of THAP1 in primary human endothelial cells or human U20S osteosarcoma cancer 
cells resulted in inhibition of cell proliferation after a few days, followed by apoptosis. 

EXAMPLE 46 

THAP1 responsive elements in cell cycle-specific THAP1 target genes 
We searched the promoters of the THAP1 -target genes for the presence of THAP1- 
responsive elements. This analysis allowed us to identify candidate DR-5 or THRE motifs that may 
mediate direct binding of THAP1 to the promoters of its target genes. A candidate DR5-type 
THAP1 responsive element (GGGCAAnnnnnGGGCAC) (SEQ ID NO: 316) located in the 
antisense orientation close to the AUG codon of the human survivinlBIRCS gene is shown in Figure 
30. A candidate THRE-type THAP1 responsive element (AGTGTGGGCAT) (SEQ ID NO: 318) 
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, , , . , PCT/IB2003/006434 
located in the antisense orientation near the TATA box of the Ubiquitin specific protease 16 gene is 
shown in Figure 31 . 

EXAMPLE 47 
Chemokine SLC/C£L 21 modulates transcription of 
cell-cycle specific ^en es in a TH API -dependent manner 
To examine the effects of the nuclear SLC/THAP1 complex on global expression profiles 
in human primary endothelial cells (HUVEC), we performed m,croarray experiments with cells 
successively transduced with SLC/CCL21 chemokine and THAP1 (SLC/THAP) retrovirus 
expression vectors or control cells transduced with MCS/THAP1 or SLC/MCS retrovirus 
expression vectors. A hierarchical cluster analysis was performed based on similanty of expression 
patterns of genes. 

Table 3A lists the genes downregulated by the SLC/THAPl complex in human primary 
endothelial cells, as revealed by the above-describe microarray expenments. For each gene, the 
fold changes, p values and signal intensities obtained in the three microarray experiments are 
1 5 indicated. 

Table 3B lists database accession numbers and SEQ ID NOs. for each gene and 
corresponding polypeptide listed in Table 3A. 
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Experiment Name 


Sequence Description 


and response to oxidative stress | 


Ubiquitin-specific protease 15, a 
member of the ubiquitin-specific 
cysteine (thiol) protease family, 
cleaves ubiquitin from ubiquitin- 
conjugated protein substrates, 
may play a role in growth 
regulation 


Protein with strong similarity to 
dihydrofolate reductase (mouse 
Dhfr), which catalyses NADPH- 
dependent reduction of 
dihydrofolate to tetrahydrofolate, 
member of the dihydrofolate 
reductase family 


DNA (cytosine-5-)- 
methyltransferase, may carry out 
both de novo and maintenajice 
DNA methylation, deregulated 
expression may promote cellular 
transformation 
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No cluster of genes upregulated were found in SLCrTOAPl expressing cells. In contrast 
several clusters of genes downregulated by the SLC/THAP1 complex were discerned, which were 
not affected when the chemokine was expressed alone (Table 3 A). Most of these genes were also 
downregulated by THAP1 without chemokine, however the chemokine greatly enhanced their 
down-regulation (co-repressor effect). 

We identified - 1 20 candidate target genes (out of 1 7000 genes on the microarrays) that are 
downregulated in SLCmiAPl-overexpressmg cells. One of these genes corresponds to THAP1 
itself (FLJ.0477), and many other genes correspond to predicted proteins with unknown functions 
Strongly, most of the genes encoding proteins with known functions (60 genes) that are 
downregulated by the SLC/THAP, complex (Tables 3A and 4) correspond to genes encodmg 
protons previously finked to cell cycle/cell proliferation (Ishida et al. (2001) Mo! Cell Biol 
21:4684-4699; Whitfield et al. (2002) Mo. Bio. Cell 13:1977-2000; ): G2/M phase specific genes 
mvolved in mitosis (38 genes) and S phase specific genes involved in DNA replication or DNA 
repair (22 genes). Interesting,^ many of these cell-cycle specific genes (26 genes, indicated in 
.tahes ,n Table 4) have previously shown to be regulated positively by the cell-cycle specific 
transection factor E2F (Ishida et al. (2001) Mol Cell Biol 21:4684-4699; Ren e. al. (2002) Genes 
Dev 16:245-256), suggesting that the SLC/THAP1 complex interfere some way with E2F-mediated 
activities. In addition to the cell cycle specific genes, genes encoding splicing factors (5 genes) and 
anti-apoptotic factors (2 genes including surviving) were also identified as targe, genes down- 
regulated by the SLC/THAP1 complex (Table 4). Together, these results indicated that the nuclear 
ehemokine SLC/THAP1 complex modulate transcription profiles in human primary endothelial 
cells and appear to be a critical regulator of cell cycle/cell proliferation and/or survival 
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EXAMPLE 48 
Chemokines SLC/CCL21 and MIG/CXCL9 modulate 
transcription of pro-inflammatory chemokine genes 
5 To examine the expression of nuclear chemokines SLC/CCL21 and MIG/CXCL9, we 

performed DNA microarrays analysis of HUVEC cells transduced with SLC/CCL21 or 
MIG/CXCL9 retrovirus vectors or MCS control vector. Cluster analysis was performed based on 
similarity of expression patterns of genes. 

Table 5 A lists 5 genes encoding pro-inflammatory chemokines that are downregulated by 
10 chemokines SLC/CCL21 and MIG/CXCL9 in human primary endothelial cells, by the above- 
describe microarray experiments. For each chemokine gene, the fold changes, p values and signal 
intensities obtained in the two microarray experiments are indicated. 

Table 5B lists database accession numbers and SEQ ID NOs. for each gene and 
corresponding polypeptide listed in Table 3A. 

15 
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The chemokines SLC/CCL21 or MIG/CXCL9 expressed alone, induced changes in 
HUVEC gene expression profile characterized by distinct clusters of genes upregulated or 
downregulated. Interestingly, the main cluster of genes down-regulated by both SLC/CCL21 or 
MIG/CXCL9 corresponded to genes encoding pro-inflammatory chemokines GROl/CXCLl, 
5 GR02/CXCL2, GR03/CXCL3, IL8/CXCL8 and MCP1/CCL2 (Table 5A). Together, these results 
indicated that nuclear chemokines SLC/CCL21 and MIG/CXCL9 are able to modulate transcription 
profiles in human primary endothelial cells and may have anti-inflammatory effects by inhibiting 
expression of pro-inflammatory chemokines. 

EXAMPLE 49 

10 Construction of Adenovirus Vectors for Expressing THAP-Familv Polypeptides and Chemokines 
This example illustrates the construction of adenovirus vectors comprising nucleic acids 
encoding THAP1, SLC and MIG. It will be appreciated that these methods can be applied to other 
THAP-family polypeptides, chemokines and/or chemokine receptors as desired. 

The full-length cDNA encoding human THAP1 (SEQ ID NO: 160) is amplified from 

15 human cDNA. Similarly, mature forms (forms lacking a signal peptide) of the chemokines SLC 
and MIG can be amplified from human cDNA. The resulting PCR products are purified from an 
agarose gel and then ligated into a TA-cloning vector, such as pCR2.1 (Invitrogen, Carlsbad, 
Calif.). Once the cDNA insert sequence is verified by sequence analysis, the plasmid containing 
the insert of interest is digested to remove the cDNA insert, which is then blunt-ended with T4 

20 DNA polymerase, gel purified and ligated into the EcoRV site of the adenoviral shuttle vectors 
pAvS6a to form pAvS6a-THAP 1 , pAvS6a-SLC or pAcS6a-MIG. Finally, a fragment which 
contains the cDNA insert of interest is removed from each of the pAvS6a recombinant vectors 
using appropriate restriction enzymes and then subcloned into pAvS6alx (a shuttle vector 
containing lox site, Genetic Therapy, Inc., Gaithersburg, Md.) to generate, for example, 

25 pAvhTHAPlIx. The expression cassettes thereby generated include the gene of interest, a 
constitutive RSV promoter, a 198 bp fragment containing the adeno-tripartite leader sequence, lox 
recombination sequence, and an SV40 early polyadenylation signal. 

The recombinant adenovirus encoding human THAP1 (Av3hTHAPl), SLC (Av3hSLC) or 
MIG (Av3hMIG) are constructed by a rapid vector generation protocol using Cre recombinase- 

30 mediated recombination of two lox-site containing plasmids, pSQ3 (containing the right hand 
portion of the adenoviral vector genome), and the adenoviral shuttle plasmid pAvhTHAPlIx 
(containing the left end of the viral genome and the hTHAPl expression cassette) pAvhSLCIx or 
pAvhMIGIx. The pSQ3 (digested with Clal), pAvhTHAPlIx, pAvhSLCIx or pAvhMIGIx 
(linerarized with NotI), and the Cre-encoding plasmid, pC-Cre3.1, are cotransfected using CaP0 4 
35 (Promega's Profection kit) into S8 cells (A549 cells stably transfected with El/E2a regions under 
dexamethasone inducible promoters (Gorziglia et al., J. Virol. 6:41734178, 1996). Following 
treatment with dexamethasone the plasmids are joined by Cre-mediated recombination, generating 
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the adenovirus encoding THAP1 (Av3hTHAPl), SLC (Ac3hSLC) or MIG (Av3hMIG). A control 
vector, Av3Null is generated in a similar manner, but lacks a transgene. 

To amplify the virus, the S8 cells are harvested a week after transfection and passaged until 
a cytopathic effect (CPE) is observed. For the passage, cells are freeze/thawed to obtain a crude 
5 viral lysate (CVL), which is centrifuged to remove the cell debris and then used to infect fresh S8 
cells. Cells are harvested when CPE is observed (typically after one week). DNA is isolated from 
the CVL and the appropriate cre-lox mediated recombination event is confirmed by restriction 
digest. For purification of the vector, cell pellets are freeze/thawed and the cell debris are pelleted 
by centrifugation. The supernatant is loaded on a discontinuous Cesium Chloride gradient (1.25 

10 g/ml CsCl and 1.4 g/ml CsCl) and centrifuged for 1 hr at 28,000 rpm (in a SW28 swing bucket 
rotor). The bottom viral band is pulled from the gradient and centrifuged on a CsCl continuous 
gradient (1.33 g/ml CsCl) overnight at 60,000 rpm (in an NVT-65 rotor). The purified viral band is 
pulled from the gradient, glycerol is added to a final concentration of 10% and the mixture is then 
dialyzed in 200 mM Tris pH 8.0, 50 mM Hepes, 10% glycerol. The concentration of vector can be 

15 determined by spectrophotometric analysis (Mittereder et aL, J. Virol. 70:7498-7509, 1996). 
Purified vector is then aliquoted and stored at -70°C. 

Av3hTHAPl, AV3hSLC and Av3hMIG vector expression is examined in HUVEC cells. 
The cells are treated for 1 hour with varying multiplicities of infection of Av3hTHAPl, AV3hSLC, 
Av3hMIG or Av3Null or left untreated. Two days following treatment, cell extracts are prepared 

20 and Western blot analysis is performed using an antibody specific for THAP1, SLC or MIG. The 
biological activity of the expressed THAP1 protein is confirmed using the serum starvation assays 
as described in Examples 10 and 11. Alternatively, the effect of THAP1, SLC, MIG, or 
combinations of these polypeptides on gene transcription can be determined by comparing 
transcriptional activities of cells transfected with one or more of Av3hTHAPl, AV3hSLC, 

25 Av3hMIG with the transcriptional activities of cells transfected with Av3NuII. Assays for 
determining gene expression as well as several genes modified by THAP1 and THAP1 /chemokine 
complexes have been described in Examples 44-47. 

It will be appreciated by one of ordinary skill in the art, that vectors which express a both a 
chemokine as well as a THAP-family polypeptide or biologically active fragment thereof can also 

30 be constructed using the methods described above. Additionally, a skilled artisan will recognize 
that vectors other than adenovirus vectors can be use generate constructs capable of expressing a 
chemokine and/or a THAP-family polypeptide or a biologically active fragment thereof. Such 
vectors include, but are not limited to, adenovirus associated vectors, lentivirus vectors and 
retrovirus vectors. Additionally, non-viral vector may be used. 
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nw^avlPLE 50 
Expression of THAP-Faniilv Polyp e ptides and 
Chemokines in a Mouse Model of Rheum atoid Arthritis 
This example illustrates the use of adenovirus vectors to deliver nucleic acids encoding 
5 THAP1, SLC, MIG or combinations of these polypeptides to inflamed tissue in a mouse model for 
rheumatoid arthritis, the well-known collagen-induced arthritis model. 

Male DBA/1 mice are prepared as in Example 36 above. For viral dosing of mice, the 
DBA/1 mice are administered recombinant adenoviruses via tail vein injection using a 0.5 ml 
tuberculin syringe at doses of 0.6-1. 2X1 0 n viral particles/animal. Four groups of animals (n=5- 
10 15/group) are treated with either Av3hTHAPl, Av3hSLC, Av3hMIG, combinations of these 
recombinant viruses, Av3Null or buffer only. 

The incidence and severity of arthritis is monitored in a blind study. Each paw is assigned 
a score from 0 to 4 as follows: 0=normal; l=swelling in 1 to 3 digits; 2=mild swelling in ankles, 
forepaws, or more than 3 digits; 3=moderate swelling in multiple joints; 4-severe swelling with 
1 5 loss of function. Each paw is totaled for a cumulative score/mouse. The cumulative scores are then 
totaled for mice in each group for a mean clinical score. The capacity for THAP1 or 
THAPl/chemokine combinations to reduce the disease incidence and severity of arthritis is 
determined by comparison of the treatment groups to the control groups. 

It will be appreciated by one of ordinary skill in the art that expression of a chemokine 
20 and/or a THAP-family polypeptide or a biologically active fragment thereof can be used to 
ameliorate^ the symptoms associated with any THAP-related condition. In some embodiments such 
expression can be the result of gene therapy. 

The methods, compositions, and devices described herein are presently representative of 
preferred embodiments and are exemplary and are not intended as limitations on the scope of the 
25 invention- Changes therein and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention and are defined by the scope of the disclosure. 
Accordingly, it will be apparent to one skilled in the art that varying substitutions and modifications 
may be made to the invention disclosed herein without departing from the scope and spirit of the 
invention. 

30 As used in the claims below and throughout this disclosure, by the phrase "consisting 

essentially of is meant including any elements listed after the phrase, and limited to other elements 
that do not interfere with or contribute to the activity or action specified in the disclosure for the 
listed elements. Thus, the phrase "consisting essentially of indicates that the listed elements are 
required or mandatory, but that other elements are optional and may or may not be present 

35 depending upon whether or not they affect the activity or action of the listed elements. 
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WHAT IS CLAIMED IS : 

1 . A method of modulating expression of a THAP responsive gene, said method 
comprising modulating the interaction of a THAP-family polypeptide or a biologically active 
fragment thereof with a nucleic acid, thereby enhancing or repressing expression of said THAP 

5 responsive gene. 

2. The method of Claim 1, wherein said THAP-family polypeptide is THAP1. 

3. The method of Claim 1 , wherein said nucleic acid is a THAP responsive promoter. 

4. The method of Claim 3, wherein said THAP responsive promoter comprises a 
THAP responsive element. 

10 5. The method of Claim 4, wherein said THAP responsive element is a DR-5 element. 

6. The method of Claim 4, wherein said THAP responsive element is an ER-11 
element. 

7. The method of Claim 4, wherein said THAP responsive element is THRE. 

8. The method of Claim 3, wherein said THAP responsive promoter does not 
1 5 comprise a THAP responsive element. 

9. The method of Claim 8, wherein said THAP responsive promoter is modulated by a 
product of a gene that is under the control of a promoter which comprises a THAP responsive 
element. 

10. The method of Claim 1, wherein said THAP responsive gene is selected from the 
20 group consisting of Survivin, PTTGl/Securin, PTTG2/Securin, PTTG3/Securin, CKS1, MAD2L1, 

USP16/Ubp-M, HMMR/RHAMM, KIAA0008/HURP, CDCA7/JP01 and THAP1. 

11. The method of Claim 1, wherein said THAP responsive gene encodes a polypeptide 
involved in the G2 or M phase of the cell cycle. 

12. " The method of Claim 1, wherein said THAP responsive gene encodes a polypeptide 
25 involved in the S phase of the cell cycle. 

13. The method of Claim 12, wherein said THAP responsive gene encodes a 
polypeptide involved in DNA replication. 

14. The method of Claim 12, wherein said THAP responsive gene encodes a 
polypeptide involved in DNA repair. 

30 15. The method of Claim 1, wherein said THAP responsive gene encodes a polypeptide 

involved in RNA splicing. 

16. The method of Claim 1, wherein said THAP responsive gene encodes a polypeptide 
involved in apoptosis. 

17. The method of Claim 1, wherein said THAP responsive gene encodes a polypeptide 
35 involved in angiogenesis. 

18. The method of Claim 1, wherein said THAP responsive gene encodes a polypeptide 
involved in the proliferation of cancer cells. 
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19. The method of Claim 1, wherein said THAP responsive gene encodes a polypeptide 
involved in inflammatory disease. 

20. A method of modulating the expression of a gene responsive to a THAP/chemokine 
complex, said method comprising modulating the interaction of a chemokine with a THAP-family 

5 polypeptide or a biologically active fragment thereof, thereby enhancing or repressing expression of 
said gene. 

21. The method of Claim 20, wherein said THAP-family polypeptide is THAP1. 

22. The method of Claim 20, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

10 23. The method of Claim 20, wherein said chemokine is SLC. 

24. Hie method of Claim 20, wherein said chemokine is CXCL9. 

25. The method of Claim 20, wherein the interaction between said chemokine and said 
THAP-family polypeptide is modulated by providing a THAP-type chemokine-binding agent. 

26. The method of Claim 25, wherein said THAP-type chemokine-binding agent 
15 comprises a polypeptide selected from the group consisting of a THAP1 polypeptide, an 

chemokine-binding domain of a THAP1 polypeptide, a THAP1 polypeptide oligomer, an oligomer 
comprising a THAP I chemokine-binding domain, a THAP1 polypeptide-immunoglobulin fusion, a 
THAP1 chemokine-binding domain-imrnunoglobulin fusion and polypeptide homologs of any one 
of the aforementioned polypeptides. 
20 27. The method of Claim 26, wherein said chemokine-binding domain is an SLC- 

binding domain. 

28. The method of Claim 26, wherein said chemokine-binding domain is a CXCL9- 
binding domain. 

29. The method of Claim 20, wherein said gene encodes a polypeptide involved in the 
25 G2 or M phase of the cell cycle. 

30. The method of Claim 20, wherein said gene encodes a polypeptide involved in the 
S phase of the cell cycle. 

31. The method of Claim 30, wherein said gene encodes a polypeptide involved in 
DNA replication. 

30 32. The method of Claim 30, wherein said gene encodes a polypeptide involved in 

DNA repair. 

33. The method of Claim 20, wherein said gene encodes a polypeptide involved in 
RNA splicing. 

34. The method of Claim 20, wherein said gene encodes a polypeptide involved in 
35 apoptosis. 

35. The method of Claim 20, wherein said gene encodes a polypeptide involved in 
angiogenesis. 
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36. The method of Claim 20, wherein said gene encodes a polypeptide involved in the 
proliferation of cancer cells. 

37. The method of Claim 20, wherein said gene encodes a polypeptide involved in 
inflammatory disease. 

38. A method of modulating the expression of a gene responsive to a THAP/chemokine 
complex, said method comprising modulating the interaction of a THAP/chemokine complex with a 
nucleic acid, thereby enhancing or repressing expression of said gene. 

39. The method of Claim 38, wherein said THAP-family polypeptide is THAP1 . 

40. The method of Claim 38, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

41 . The method of Claim 38, wherein said chemokine is SLC. 

42. The method of Claim 38, wherein said chemokine is CXCL9. 

43. The method of Claim 38, wherein said gene encodes a polypeptide involved in the 
G2 or M phase of the cell cycle. 

44. The method of Claim 38, wherein said gene encodes a polypeptide involved in the 
S phase of the cell cycle. 

45. The method of Claim 44, wherein said gene encodes a polypeptide involved in 
DNA replication. 

46. The method of Claim 44, wherein said gene encodes a polypeptide involved in 
DNA repair. 

47. The method of Claim 38, wherein said gene encodes a polypeptide involved in 
RNA splicing. 

48. The method of Claim 38, wherein said gene encodes a polypeptide involved in 
apoptosis. 

49. The method of Claim 38, wherein said gene encodes a polypeptide involved in 
angiogenesis. 

50. The method of Claim 38, wherein said gene encodes a polypeptide involved in the 
proliferation of cancer cells. 

51. The method of Claim 38, wherein said gene encodes a polypeptide involved in 
inflammatory disease. 

52. The method of Claim 38, wherein said nucleic acid is a THAP responsive 
promoter. 

53. The method of Claim 52, wherein said THAP responsive promoter comprises a 
THAP responsive element. 

54. The method of Claim 53, wherein said THAP responsive element is a DR-5 
element. 
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55. The method of Claim 53, wherein said THAP responsive element is an ER-] ] 
element. 

56. The method of Claim 53, wherein said THAP responsive element is THRE. 

57. The method of Claim 52, wherein said THAP responsive promoter does not 
5 comprise a THAP responsive element. 

58. The method of Claim 57, wherein said THAP responsive promoter is modulated by 
a product of a gene that is under the control of a promoter which comprises a THAP responsive 
element. 

59. A pharmaceutical composition comprising a THAP responsive element in a 
1 0 pharmaceutical^ acceptable carrier. 

60. The pharmaceutical composition of Claim 59, wherein said THAP responsive 
element is a DR-5 element. 

61. The pharmaceutical composition of Claim 59, wherein said THAP responsive 
element is an ER-1 1 element. 

15 62. The pharmaceutical composition of Claim 59, wherein said THAP responsive 

element is an THRE. 

63. A transcription factor decoy consisting essentially of a THAP responsive element. 

64. The transcription factor decoy of Claim 63, wherein said THAP responsive element 
is a DR-5 element. 

20 65 • The transcription factor decoy of Claim 63, wherein said THAP responsive element 

is a ER-1 1 element. 

*■ 

66. The transcription factor decoy of Claim 63, wherein said THAP responsive element 
is a THRE element. 

67. ' A cell comprising a transcription factor decoy of claim 63. 

25 68 - A method of modulating the interaction between a nucleic acid and a THAP-family 

polypeptide or a biologically active fragment thereof, said method comprising providing a 
transcription factor decoy which comprises a THAP responsive element, thereby modulating the 
interaction between said nucleic acid and said THAP-family polypeptide or a biologically active 
fragment thereof. 

30 69 ' The method of Claim 68, wherein said THAP-family polypeptide is THAP1. 

70. The method of Claim 68, wherein said THAP responsive element is a DR-5 
element 

71. The method of Claim 68, wherein said THAP responsive element is an ER-11 
element. 

35 72 - The method of Claim 68, wherein said THAP responsive element is THRE. 

73. A method of modulating the interaction between a nucleic acid and a 
THAP/chemokine complex, said method comprising providing a transcription factor decoy which 
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comprises a THAP responsive element, thereby modulating the interaction between said nuciefc 
acid and said THAP/chemokine complex. 

74. The method of Claim 73, wherein said THAP-family polypeptide is THAP1. 

75. The method of Claim 73, wherein said chemokine is selected from the group 
5 consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

76. The method of Claim 73, wherein said chemokine is SLC. 

77. The method of Claim 73, wherein said chemokine is CXCL9. 

78. The method of Claim 73, wherein said THAP responsive element is a DR-5 
element. 

10 79. The method of Claim 73, wherein said THAP responsive element is an ER-11 

element. 

80. The method of Claim 73, wherein said THAP responsive element is THRE. 

81. A vector packaging cell line comprising a cell comprising a viral vector which 
comprises a promoter operably linked to a nucleic acid encoding a THAP-family polypeptide or a 

15 biologically active fragment thereof 

82. The cell line of Claim 81, wherein said cell further comprises an introduced nucleic 
acid construct comprising a nucleic acid encoding a chemokine operably linked to a promoter. 

83. The cell line of Claim 82, wherein said chemokine-encoding construct is included 
on the same vector as said nucleic acid encoding said THAP-family polypeptide or biologically 

20 active fragment thereof 

84. The cell line of Claim 82, wherein said nucleic acid encoding said chemokine 
encodes a chemokine selected from the group consisting of SLC, CCL19, CCL5, CXCL11, 
CXCLlOand CXCL9. 

85. ' The cell line of Claim 82, wherein said nucleic acid encoding said chemokine 
25 encodes SLC. 

86. The cell line of Claim 82, wherein said nucleic acid encoding said chemokine 
encodes CXCL9. 

87. The cell line of Claim 81, wherein said THAP-family polypeptide is THAP1 . 

88. The cell line of Claim 81, wherein said cell is a mammalian cell. 
30 89. The cell line of Claim 88, wherein said cell is a human cell. 

90. The cell line of Claim 81, wherein said viral vector is an adenoviral vector. 

91. The cell line of Claim 81, wherein said viral vector is a retroviral vector. 

92. A cell which is genetically engineered to express a THAP-family polypeptide or a 
biologically active fragment thereof. 

35 93. The cell line of Claim 92, wherein said THAP-family polypeptide is THAPL 

94. The cell line of Claim 92, wherein said cell is a mammalian cell. 

95. The cell line of Claim 92, wherein said cell is a human cell 
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96. The cell line of Claim 92, wherein said THAP family polypeptide is encoded by a 
gene that is introduced into the cell on an adenoviral vector. 

97. The cell line of Claim 92, wherein said THAP family polypeptide is encoded by a 
gene that is introduced into the cell on a retroviral vector. 

98. A method of constructing a cell which expresses a recombinant THAP-family 
polypeptide, said method comprising introducing into a cell a vector comprising a nucleic acid 
encoding a THAP-family polypeptide or a biologically active fragment thereof operably linked to a 
promoter. 

99. The method of Claim 98, further comprising introducing into a eel) a nucleic acid 
construct comprising a nucleic acid encoding a chemokine operably linked to a promoter. 

100. The method of Claim 99, wherein said chemokine-encoding construct is included 
on the same vector as said nucleic acid encoding said THAP-family polypeptide or biologically 
active fragment thereof. 

101. The method of Claim 99, wherein said nucleic acid encoding said chemokine 
encodes a chemokine selected from the group consisting of SLC, CCL19, CCL5, CXCL11, 
CXCL10andCXCL9. 

102. The method of Claim 99, wherein said nucleic acid encoding said chemokine 
encodes SLC. 

103. The method of Claim 99, wherein said nucleic acid encoding said chemokine 
encodes CXCL9. 

1 04. The method of Claim 98, wherein said THAP-family polypeptide is THAP1. 

105. The method of Claim 98, wherein said cell is a mammalian cell. 

106. The method of Claim 105, wherein said cell is a human cell. 

107. ' The method of Claim 98, wherein said vector is a viral vector. 

108. The method of Claim 107, wherein said vector is an adenoviral vector. 

109. The method of Claim 107, wherein said vector is a retroviral vector. 

110. The method of Claim 98, wherein said vector is introduced into said cell by 
transfection. 

111. A method of ameliorating symptoms associated with a condition mediated by a 
THAP/chemokine complex, said method comprising: 

introducing into a cell a nucleic acid construct comprising a nucleic acid encoding a 
chemokine operably linked to a promoter and a nucleic acid construct comprising a nucleic 
acid encoding a THAP-family polypeptide or a biologically active fragment thereof 
operably linked to a promoter; and 

expressing said nucleic acid encoding said chemokine and said nucleic acid 
encoding said THAP-family polypeptide or biologically active fragment thereof. 
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1 1 2. The method of Claim 111, wherein said nucleic acid constructs are present on a 
single vector. 

113. The method of Claim 111, wherein said nucleic acid constructs are present on 
different vectors. 

1 14. The method of Claim 111, wherein said cell is a mammalian cell. 

1 15. The method of Claim 1 14, wherein said cell is a human cell. 

116. The method of Claim 111, wherein said nucleic acid encoding said chemokine 
encodes a chemokine selected from the group consisting of SLC, CCL19, CCL5, CXCL11, 
CXCLlOand CXCL9. 

117. The method of Claim 111, wherein said nucleic acid encoding said chemokine 
encodes SLC. 

118. The method of Claim 111, wherein said nucleic acid encoding said chemokine 
encodes CXCL9. 

119. The method of Claim 111, wherein said THAP-family polypeptide is THAP1. 

120. A method of identifying a test compound that modulates transcription at a THAP 
responsive element, said method comprising: 

comparing the level of transcription from a THAP responsive promoter in the 
presence and absence of a test compound wherein a determination that the level of 
transcription is increased or decreased in the presence of said test compound relative to the 
level of transcription in the absence of said test compound indicates that said test compound 
is a candidate modulator of transcription. 

121. The method of Claim 120, wherein the level of transcription from said THAP 
responsive promoter in the presence and absence of the test compound is determined by performing 
an in vitro transcription reaction using a construct comprising said THAP responsive promoter and 
a THAP-family polypeptide or a biologically active fragment thereof, wherein said THAP-family 
polypeptide comprises an amino acid sequence having at least 30% amino acid identity to an amino 
acid sequence of SEQ ED NO: 1 . 

122. The method of Claim 120, wherein the level of transcription from said THAP 
responsive promoter in the presence and the absence of the test compound is determined by 
measuring the level of transcription from a THAP responsive promoter in a cell expressing a 
THAP-family polypeptide or a biologically active fragment thereof, wherein said THAP-family 
polypeptide comprises an amino acid sequence having at least 30% amino acid identity to an amino 
acid sequence of SEQ ID NO: 1 . 

123. The method of Claim 120, wherein said THAP-family polypeptide or biologically 
active fragment thereof is selected from the group consisting of SEQ ID NOs: 1-114 and 
biologically active fragments thereof. 
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124. The method of Claim 120, wherein said THAP responsive promoter comprises a 
THAP responsive element having a nucleotide sequence selected from the group consisting of SEQ 
ID NOs: 140-159, SEQ ID NO: 306, and homologs thereof having at least 60% nucleotide identity. 

125. The method of Claim 121 or Claim 122, wherein the level of transcription in the 
presence or absence of said test compound is measured in the presence of a chemokine. 

126. The method of Claim 125, wherein said chemokine is selected from the group 
consisting of CCL family chemokines and CXCL family chemokines. 

127. The method of Claim 126, wherein said CCL family chemokine is selected from 
the group consisting of SLC, CCL 19 and CCL5. 

128. The method of Claim 126, wherein said CXCL family chemokine is selected from 
the group consisting of CXCL 1 1, CXCL 10 and CXCL9. 

129. The method of Claim 125, wherein the level of transcription in the presence or 
absence of said test compound is measured in a cell which expresses a receptor for said chemokine. 

130. The method of Claim 129, wherein said chemokine receptor is selected from the 
group consisting of CCR1, CCR3, CCR5, CCR7, CCR1 1 and CXCR3. 

131. The method of Claim 130, wherein said chemokine is selected from the group 
consisting of SLC, CCL 19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

132. The method of Claim 129, wherein said THAP-family polypeptide comprises 
THAP1 or a biologically active fragment thereof and said cell expresses the CCR7 receptor. 

133. The method of Claim 1 32, wherein said chemokine is SLC. 

134 ; The method of Claim 129, wherein said THAP-family polypeptide comprises 
THAP1 or a biologically active fragment thereof and said cell expresses the CXCR3 receptor. 

135. Them method of Claim 134, wherein said chemokine is CXCL9. 

136. The method of Claim 122, wherein said THAP responsive promoter is in a gene 
endogenous to said cell. 

137. The method of Claim 122, wherein said THAP responsive promoter has been 
introduced into said cell. 

138. The method of Claim 122, wherein said THAP responsive promoter does not 
comprise a THAP responsive element. 

139. The method of Claim 138, wherein said THAP responsive promoter is modulated 
by a product of a gene that is under the control of a promoter which comprises a THAP responsive 
element. 

140. A method for reducing the symptoms associated with a condition selected from the 
group consisting of excessive or insufficient angiogenesis, inflammation, metastasis of a cancerous 
tissue, excessive or insufficient apoptosis, cardiovascular disease and neurodegenerative diseases 
comprising modulating the interaction between a THAP-family polypeptide and a chemokine in an 
individual suffering from said condition. 
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141. The method of Claim 140, wherein said THAP-family polypeptide is selected from 
a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-114. 

142. The method of Claim 140, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1 , CXCL10 and CXCL9. 

5 143. The method of Claim 140, wherein said chemokine is SLC and the condition is 

inflammation. 

144. The method of Claim 140, wherein said chemokine is SLC and the condition is 
excessive or insufficient angiogenesis. 

145. The method of Claim 140, wherein said chemokine is CXCL9 and the condition is 
10 inflammation. 

146. The method of Claim 140, wherein said chemokine is CXCL9 and the condition is 
excessive or insufficient angiogenesis. 

147. A method for reducing the symptoms associated with a condition resulting from the 
activity of a chemokine in an individual comprising modulating the interaction between said 

1 5 chemokine and a THAP-family polypeptide in said individual. 

148. The method of Claim 147, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

149. The method of Claim 147, wherein said chemokine is SLC. 

150. The method of Claim 147, wherein said chemokine is CXCL9. 

20 1 5 1 . The method of Claim 147, wherein said THAP-family polypeptide is THAP-1 . 

152. The method of Claim 147, wherein the condition is inflammation. 

153. The method of Claim 147, wherein the condition is excessive or insufficient 
angiogenesis. 

154. The method of Claim 147, wherein the interaction between said chemokine and 
25 said THAP-family polypeptide is modulated by administering to an individual, a therapeutically 

effective amount of a THAP-type chemokine-binding agent. 

155. The method of Claim 154, wherein said THAP-type chemokine-binding agent 
comprises a therapeutically effective amount of a polypeptide selected from the group consisting of 
a THAP1 polypeptide, an chemokine-binding domain of a THAP1 polypeptide, a THAP1 

30 polypeptide oligomer, an oligomer comprising a THAP1 chemokine-binding domain, a THAP1 
polypeptide-immunoglobulin fusion, a THAP1 chemokine-binding domain-immunoglobulin fusion 
and polypeptide homologs having at least 30% amino acid identity to any one of the 
aforementioned polypeptides. 

156. The method of Claim 155, wherein said chemokine-binding domain is an SLC- 
35 binding domain. 

157. The method of Claim 155, wherein said chemokine-binding domain is a CXCL9- 
binding domain. 
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158. A method of reducing the symptoms associated with a condition resulting from the 
activity of a THAP-family polypeptide in an individual comprising modulating the extent of 
transcriptional repression or activation of at least one THAP-family responsive promoter in said 
individual. 

159. The method of Claim 158, wherein said THAP-family polypeptide comprises an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 1-114. 

160. The method of Claim 158, wherein said THAP-family polypeptide comprises an 
amino acid sequence of SEQ ID NO: 3. 

161. The method of Claim 158, wherein said THAP responsive promoter comprises a 
THAP responsive element. 

162. The method of Claim 358, wherein said THAP responsive promoter does not 
comprise a THAP responsive element. 

163. A method of reducing the symptoms associated with a condition resulting from the 
activity of a THAP-family polypeptide in an individual, said method comprising: 

diagnosing said individual with a condition resulting from the activity of a THAP- 
family polypeptide; and 

administering a compound which modulates the interaction between said THAP- 
family polypeptide and a chemokine to said individual. 

164. The method of Claim 163, wherein said THAP-family polypeptide is selected from 
a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-1 14. 

1 65. The method of Claim 163, wherein said THAP-family polypeptide is THAP1 . 

166. The method of Claim 163, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

1 67. ■ The method of Claim 1 63, wherein said chemokine is SLC. 

168. The method of Claim 163, wherein said chemokine is CXCL9. 

169. A method of reducing the symptoms associated with a condition resulting from the 
activity of a THAP-family polypeptide in an individual comprising: 

diagnosing said individual with a condition resulting from the activity of THAP- 
family polypeptide; and 

administering a chemokine or an analog thereof to said individual. 

170. The method of Claim 169, wherein said THAP-family polypeptide is selected from 
a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-1 14. 

171. The method of Claim 169, wherein said THAP-family polypeptide is THAP1 . 

172. The method of Claim 169, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

173. The method of Claim 169, wherein said chemokine is SLC. 

1 74. The method of Claim 1 69, wherein said chemokine is CXCL9. 
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175. A method of reducing the symptoms associated with transcriptional repression or 
activation mediated by a THAP-family polypeptide in an individual comprising administering a 
chemokine or an analog thereof to said individual. 

176. The method of Claim 175, wherein said THAP-family polypeptide is selected from 
5 a group consisting of polypeptides having an amino acid sequence of SEQ ID NOs: 1-114. 

177. The method of Claim 175, wherein said THAP-family polypeptide is THAP1. 

178. The method of Claim 175, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

179. The method of Claim 175, wherein said chemokine is SLC. 

10 1 80. The method of Claim 175, wherein said chemokine is CXCL9. 

1 81. A method of reducing the symptoms associated with the activity of a chemokine in 
an individual comprising modulating the extent to which said chemokine is transported to the 
nucleus of a cell in said individual. 

182. The method of Claim 181, wherein said chemokine is selected from the group 
15 consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

183. The method of Claim 181, wherein said cell expresses a chemokine receptor 
selected from the group consisting of CCR1, CCR3, CCR5, CCR7, CCR1 1 and CXCR3. 

184. The method of Claim 183, wherein said chemokine is SLC and said chemokine 
receptor is CCR7. 

20 185. The method of Claim 183, wherein said chemokine is CXCL9 and said chemokine 

receptor isCXCR3. 

186. The method of Claim 181, wherein the extent of transport of said chemokine into a 
nucleus of a cell is modulated by contacting said chemokine with a THAP-type chemokine-binding 
agent. 

25 187. The method of Claim 186, wherein said THAP-type chemokine-binding agent 

selected from the group consisting of a THAP1 polypeptide, a chemokine-binding domain of a 
THAP1 polypeptide, a THAP1 polypeptide oligomer, an oligomer comprising a THAP1 
chemokine-binding domain, a THAP1 polypeptide-immunoglobulin fusion, a THAP1 chemokine- 
binding domain-immunoglobulin fusion and polypeptide homologs having at least 30% amino acid 

30 identity to any one of the aforementioned polypeptides. 

188. The method of Claim 187, wherein said chemokine-binding domain is an SLC- 
binding domain. 

189. The method of Claim 187, wherein said chemokine-binding domain is a CXCL9- 
binding domain. 

35 190. A method for identifying a compound which modulates the transport of a 

chemokine into the nucleus comprising comparing the extent of said chemokine transport into the 
nucleus of cells in the presence and absence of a test compound. 
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191. The method of Claim 190, wherein said chemokine is selected from the group 
consisting of SLC, CCL19, CCL5, CXCL1 1, CXCL10 and CXCL9. 

192. The method of Claim 190, wherein said cell expresses a chemokine receptor 
selected from the group consisting of CCR1, CCR3, CCR5, CCR7, CCRI 1 and CXCR3. 

193. The method of Claim 192, wherein said chemokine is SLC and said chemokine 
receptor is CCR7. 

194. The method of Claim 192, wherein said chemokine is CXCL9 and said chemokine 
receptor isCXCR3. 

195. The method of Claim 190, wherein the extent of transport of said chemokine into a 
nucleus of a cell is modulated by contacting said chemokine with a THAP-type chemokine-binding 
agent. 

196. The method of Claim 195, wherein said THAP-type chemokine-binding agent is 
selected from the group consisting of a THAP1 polypeptide, a chemokine-binding domain of a 
THAP1 polypeptide, a THAP1 polypeptide oligomer, an oligomer comprising a THAP1 
chemokine-binding domain, a THAP1 polypeptide-immunoglobulin fusion, a THAP1 chemokine- 
binding domain-immunoglobulin fusion and polypeptide homologs having at least 30% amino acid 
identity to any one of the aforementioned polypeptides. 

197. The method of Claim 196, wherein said chemokine-binding domain is an SLC- 
binding domain. 

198. The method of Claim 196, wherein said chemokine-binding domain is a CXCL9- 
bmding domain. 

199. The method of Claim 190, wherein transport of SLC into the nucleus is measured 
by immunostaining. 

200. A vector comprising a THAP responsive promoter operably linked to a nucleic acid 
encoding a detectable product. 

201. The vector of Claim 200, wherein said THAP responsive promoter comprises a 
THAP responsive element. 

202. The vector of Claim 200, wherein said THAP responsive promoter does not 
comprise a THAP responsive element. 

203. A genetically engineered cell comprising the vector of any one of Claims 200-202. 

204. An in vitro transcription reaction comprising a nucleic acid comprising a THAP 
responsive promoter, ribonucleotides and an RNA polymerase. 

205. The in vitro transcription reaction of Claim 204, wherein said THAP responsive 
promoter comprises a THAP responsive element. 

206. An isolated mutant THAP-family polypeptide that does not bind to a chemokine. 
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207. The isolated mutant THAP-family polpeptide of Claim 206, wherein said 
chemokine is selected from the group consisting of SLC, CCL19, CCL5, CXCL11, CXCL10 and 
CXCL9. 

208. The isolated mutant THAP-family polypeptide of Claim 206, wherein said 
5 chemokine is SLC. 

209. The isolated mutant THAP-family polypeptide of Claim 206, wherein said 
chemokine is CXCL9. 

210. The isolated mutant THAP-family polypeptide of Claim 206, wherein said THAP- 
family polypeptide is TH AP 1 . 

10 211. The isolated mutant THAP-family polypeptide of Claim 210, wherein said 

polypeptide comprises an amino acid sequence of SEQ ID NO: 3. 

212. The isolated mutant THAP-family polypeptide of Claim 211, wherein said amino 
acid sequence comprises at least one point mutation. 
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Figure 12 
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Figure 13 
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FIGURE 1 5 
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Gly Pro Pro Glu Val Ser Arg Leu Arg Arg Cys Arg Lys Arg Cys Ser 

115 120 125 

Glu Gly Arg Gly Pro Thr Thr Pro Phe Ser Pro Pro Pro Pro Ala Asp 

130 135 140 

Val Thr Cys Phe Pro Val Glu Glu Ala Ser Ala Pro Ala Thr Leu Pro 
145 150 155 160 

Ala Ser Pro Ala Gly Arg Leu Glu Pro Gly Leu Ser Ser Pro Phe Ser 

165 170 175 

Asp Leu Leu Gly Pro Leu Gly Ala Gin Ala Asp Glu Ala Gly Cys Ser 

180 185 190 

Ala Gin Pro Ser Pro Glu Arg Gin Pro Ser Pro Leu Glu Pro Arg Pro 

195 200 205 

Val Ser Pro Ser Ala Tyr Met Leu Arg Leu Pro Pro Pro Ala Gly Ala 

210 215 220 

Tyr He Gin Asn 'Glu His Ser Tyr Gin Val Gly Ser Ala Leu Leu Trp 
225 230 235 240 

Lys Arg Arg Ala Glu Ala Ala Leu Asp Ala Leu Asp Lys Ala Gin Arg 

245 250 * ' 255 

Gin Leu Gin Ala Cys Lys Arg Arg Glu Gin Arg Leu Arg Leu Arg Leu 

260 265 270 

Thr Lys Leu Gin Gin Glu Arg Ala Arg Glu Lys Arg Ala Gin Ala Asp 

275 280 285 

Ala Arg Gin Thr Leu Lys Glu His Val Gin Asp Phe Ala Met Gin Leu 

290 295 300 

Ser Ser Ser Met Ala 
305 



<210> 10 
<211> 274 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Pro Lys Tyr Cys Arg Ala Pro Asn Cys Ser Asn Thr Ala Gly Arg 

15 10 15 

Leu Gly Ala Asp Asn Arg Pro Val Ser Phe Tyr Lys Phe Pro Leu Lys 
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20 

Asp Gly Pro Arg 
35 

Trp Val Pro Ser 
50 

Ser Cys Phe Gin 
65 

Val Pro Ser He 

Thr Arg Ser Thr 
100 

Asn Thr Pro Leu 
115 

Arg Leu Val Val 
130 

Thr Met Leu Leu 
145 

Gin Pro Glu Val 

Leu Gly Ala Leu 
180 

Arg His Gin Ala 
195 

His Gly Glu Ser 
210 

Thr Thr Ala Gin 
225 

He Cys Gly Gly 

Pro Ala Thr Val 
260 

Ser Ala 



Leu Gin Ala Trp 

.40 

Cys His Gin His 
55 

Trp Arg Trp Gly 
70 

Phe Ser Arg Gly 
85 

Gin Lys Pro Val 

Pro Gin Ser Pro 
120 

Leu Gly Pro Thr 
135 

Thr Pro Leu Ala 
150 

Pro Ala Gin Gin 
165 

Gin Arg Arg Val 

Gin Leu Gin Ala 
200 

Leu Leu Ala Arg 
215 

Thr Leu Gly Pro 
230 

Pro Asp He Ala 
245 

Asp Ala Lys Pro 



25 

Leu Gin His Met 

Leu Cys Ser Glu 
60 

Val Arg Tyr Leu 
75 

Pro Pro Ala Lys 
90 

Ser Pro Pro Pro 
105 

Ala He Pro Val 

Ser Gly Ser Pro 
140 

Pro Ala Pro Thr 
155 

Ala Gin Thr Gly 
170 

Arg Arg Leu Gin 
185 

Leu Glu Arg Leu 

Ala Arg Arg Gly 
220 

Glu Glu Ser Gin 
235 

Met Val Leu Ala 
250 

Glu Leu Leu Asp 
265 



30 

Gly Cys Glu His 
45 

His Phe Thr Pro 

Arg Pro Asp Ala 
80 

Ser Gin Arg Arg 
95 

Pro Leu Gin Lys 
110 

Ser Gly Pro Val 
125 

Lys Thr Val Ala 

Pro Glu Arg Ser 
160 

Leu Gly Pro Val 
175 

Arg Cys Gin Glu 
190 

Ala Gin Gin Leu 
205 

Leu Gin Arg Leu 

Thr Phe Thr lie 
240 

Gin Asp Pro Ala 
255 

Thr Arg He Pro 
270 



<210> 11 

<211> 903 

<212> PRT 

<213> Homo sapiens 



<400> 11 



Met 


Thr 


Arg 


Ser 


Cys 


Ser 


Ala 


Val 


Gly 


Cys 


Ser 


Thr 


Arg 


Asp 


Thr 


Val 


1 








5 










10 










15 




Leu 


Ser 


Arg 


Glu 


Arg 


Gly 


Leu 


Ser 


Phe 


His 


Gin 


Phe 


Pro 


Thr 


Asp 


Thr 








20 










25 










30 






He 


Gin 


Arg 


Ser 


Lys 


Trp 


He 


Arg 


Ala 


Val 


Asn 


Arg 


Val 


Asp 


Pro 


Arg 






35 










40 










45 








Ser 


Lys 


Lys 


He 


Trp 


He 


Pro 


Gly 


Pro 


Gly 


Ala 


He 


Leu 


Cys 


Ser 


Lys 




50 










55 










60 










His 


Phe 


Gin 


Glu 


Ser 


Asp 


Phe 


Glu 


Ser 


Tyr 


Gly 


He 


Arg Arg 


Lys 


Leu 


65 










70 










75 










80 


Lys 


Lys 


Gly 


Ala 


Val 


Pro 


Ser 


Val 


Ser 


Leu 


Tyr 


Lys 


He 


Pro 


Gin 


Gly 










85 










90 










95 




Val 


His 


Leu 


Lys 


Gly 


Lys 


Ala 


Arg 


Gin 


Lys 


He 


Leu 


Lys 


Gin 


Pro 


Leu 








100 










105 










110 






Pro 


Asp 


Asn 


Ser 


Gin 


Glu 


Val 


Ala 


Thr 


Glu 


Asp 


His 


Asn 


Tyr 


Ser 


Leu 






115 










120 










125 








Lys 


Thr 


Pro 


Leu 


Thr 


He 


Gly 


Ala 


Glu 


Lys 


Leu 


Ala 


Glu 


Val 


Gin 


Gin 




130 










135 










140 










Met 


Leu 


Gin 


Val 


Ser 


Lys 


Lys 


Arg 


Leu 


He 


Ser 


Val 


Lys 


Asn 


Tyr 


Arg 



145 150 155 160 
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Met 


He 


Lys 


Lys 


Arg 


Lys 


Gly 


Leu 










165 








Glu 


Lys 


Leu 


Leu 


Ser 


Glu 


Glu 


Thr 








180 










Ser 


Asp 


Phe 


Lys 


Trp 


Glu 


Leu 


Tyr 






195 










200 


Ser 


Ala 


Glu 


Met 


Lys 


Gin 


Phe 


Ala 




210 










215 




Lys 


Val 


Tyr 


Asp 


Tyr 


Val 


Arg 


Lys 


225 










230 






He 


Leu 


Arg 


Thr 


Trp 


Leu 


Ser 


Lys 










245 








Ser 


Asn 


He 


Phe 


Ser 


Phe 


Leu 


Gin 








260 










Leu 


Tyr 


Gin 


Tyr 


Cys 


Ser 


Leu 


Leu 






275 










280 


Gin 


Leu 


Gin 


Trp 


Asp 


Pro 


Ser 


Ser 




290 










295 




Phe 


Gly 


Leu 


Gly 


Lys 


Leu 


Asp 


Ala 


305 










310 






Thr 


Val 


Leu 


Leu 


Met 


Ala 


Val 


Gly 










325 








Leu 


Gly 


Tyr 


Phe 


Phe 


Val 


Asn 


Arg 








340 










Leu 


Leu 


Arg 


Leu 


Thr 


He 


Gly 


Lys 






355 










360 


Leu 


Ala 


Val 


Thr 


Ser 


Asp 


Ala 


Thr 




370 










375 




Ala 


Leu 


Gly 


lie 


His 


He 


Asp 


Gly 


385 










390 






His 


Pro 


Ser 


Ser 


Ser 


Ser 


Gin 


Gin 










405 








His 


Leu 


Leu 


Arg 


Leu 


He 


Arg 


Asn 








420 










Gin 


Phe 


He 


Asn 


Gly 


He 


Ala 


His 






435 










440 


Ala 


Leu 


Glu 


Glu 


Gin 


Glu 


Leu 


Ser 




450 










455 




Leu 


Ala 


Asn 


Leu 'Lys 


Asn 


His 


Val 


465 










470 






Leu 


Phe 


Ser 


Glu 


Ser 


Val 


Ala 


Ser 










485 








Asp 


Leu 


Pro 


Pro 


Phe 


Gin 


Asn 


Cys 








500 










Leu 


He 


Asn 


Asn 


Leu 


Phe 


Asp 


He 






515 










520 


Lys 


Gly 


Leu 


Lys 


Gly 


Pro 


Leu 


Leu 




530 










535 




His 


Val 


Leu 


He 


Glu 


Ala 


Lys 


Thr 


545 










550 






Ser 


Asn 


Asn 


Gin 


He 


He 


Lys 


Gly 










565 








Phe 


Leu 


Leu 


Asn 


Ala 


Glu 


Ser 


Leu 








580 










Phe 


Pro 


Lys 


Val 


Met 


Pro 


Phe 


Pro 






595 










600 


His 


Asp 


His 


Leu 


Glu 


Leu 


Phe 


Leu 




610 










615 




Thr 


Ser 


Ser 


Ser 


Pro 


Thr 


Cys 


Met 


625 










630 






Leu 


Glu 


Thr 


Arg 


Tyr 


Lys 


Phe 


Gin 
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Arg 


Leu 


He 


Asp 


Ala 


Leu 


Val 


Glu 




170 










175 




Glu 


Cys 


Leu 


Leu 


Arg 


Ala 


Gin 


Phe 


185 










190 






Asn 


Trp 


Arg 


Glu 


Thr 


Asp 


Glu 


Tyr 










205 








Cys 


Thr 


Leu 


Tyr 


Leu 


Cys 


Ser 


Ser 








220 










He 


Leu 


Lys 


Leu 


Pro 


His 


Ser 


Ser 






235 










240 


Cys 


Gin 


Pro 


Ser 


Pro 


Gly 


Phe 


Asn 




250 










255 




Arg 


Arg 


Val 


Glu 


Asn 


Gly 


Asp 


Gin 


2 65 










270 






lie 


Lys 


Ser 


lie 


Pro 


Leu 


Lys 


Gin 










285 








His 


Ser 


Leu 


Gin 


Gly 


Phe 


Met 


Asp 








300 










Asp 


Glu 


Thr 


Pro 


Leu 


Ala 


Ser 


Glu 






315 










320 


lie 


Phe 


Gly 


His 


Trp 


Arg 


Thr 


Pro 




330 










335 




Ala 


Ser 


Gly 


Tyr 


Leu 


Gin 


Ala 


Gin 


345 










350 






Leu 


Ser 


Asp 


lie 


Gly, 


He 


Thr 


Val 










365 








Ala 


His 


Ser 


Val 


Gin 


Met 


Ala 


Lys 








380 










Asp 


Asp 


Met 


Lys 


Cys 


Thr 


Phe 


Gin 






395 










400 


He 


Ala 


Tyr 


Phe 


Phe 


Asp 


Ser 


Cys 




410 










415 




Ala 


Phe 


Gin 


Asn 


Phe 


Gin 


Ser 


lie 


425 










430 






Trp 


Gin 


His 


Leu 


Val 


Glu 


Leu 


Val 










445 








Asn 


Met 


Glu 


Arg 


He 


Pro 


Ser 


Thr 








460 










Leu 


Lys 


Val 


Asn 


Ser 


Ala 


Thr 


Gin 






475 










480 


Ala 


Leu 


Glu 


Tyr 


Leu 


Leu 


Ser 


Leu 




4 90 










495 




He 


Gly 


Thr 


lie 


His 


Phe 


Leu 


Arg 


505 










510 






Phe 


Asn 


Ser 


Arg 


Asn 


Cys 


Tyr 


Gly 










525 








Pro 


Glu 


Thr 


Tyr 


Ser 


Lys 


lie 


Asn 








540 










lie 


Phe 


Val 


Thr 


Leu 


Ser 


Asp 


Thr 






555 










560 


Lys 


Gin 


Lys 


Leu 


Gly 


Phe 


Leu 


Gly 




570 










575 




Lys 


Trp 


Leu 


Tyr 


Gin 


Asn 


Tyr 


Val 


585 










590 






Tyr 


Leu 


Leu 


Thr 


Tyr 


Lys 


Phe 


Ser 










605 








Lys 


Met 


Leu 


Arg 


Gin 


Val 


Leu 


Val 








620 










Ala 


Phe 


Gin 


Lys 


Ala 


Tyr 


Tyr 


Asn 






635 










640 


Asp 


Glu 


Val 


Phe 


Leu 


Ser 


Lys 


Val 
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Ser lie Phe Asp 
660 

Thr Val Gin Arg 
675 

Glu Glu Gly lie 
690 

Leu Leu Asp Leu 
705 

Tyr Val Ala Asn 

Thr Ala Leu Tyr 
740 

Leu Phe Val Lys 
755 

Cys Arg Val lie 
770 

Met Ala He Phe 
785 

Gin Lys He Leu 

Val Asn Lys His 
820 

Val Lys Leu Leu 
835 

Lys Asn Val Ala 
850 

Met Lys Thr Leu 
8 65 

Cys Ser Ser Phe 

Asn Asp Gly Tyr 
900 



645 

He Ser He Ala 

Gin Tyr Gly Val 
680 

Cys Gin Asp Trp 
695 

Ser Asp His Arg 
710 

Lys Leu Ser Ala 
725 

Ala Ser Asp Leu 

Lys Lys Asn Gly 
7 60 

Asn He Cys Glu 
775 

Glu Leu Val Ser 
790 

Cys Glu Leu Ser 
805 

Leu Phe Asp Gly 

Lys Asp lie He 
840 

Gin Asn Pro Leu 
855 

Ser Arg Lys His 
870 

Ala Asn Thr Ser 
885 

Pro Phe Lys 



650 

Arg Arg Lys Asp 
665 

Ser Val Thr Lys 

Ser His Cys Ser 
7 00 

Arg Asn Leu He 
715 

Leu Leu Thr Cys 
730 

Lys Ala Ser Lys 
745 

Leu His Phe Pro 

Arg Val Val Arg 
780 

Lys Gin Arg Glu 
795 

Gly His He Asp 
810 

Glu Val Cys Ala 
825 

He Cys Phe Leu 

Lys His His Ser 
860 

Trp Ser Pro Val 
875 

Ser Lys Phe Arg 
890 



655 

Leu Ala Leu Trp 
670 

Thr Val Phe His 
685 

Leu Ser Glu Ala 

Cys Tyr Ala Gly 
720 

Glu Asp Cys He 
735 

lie Gly Ser Leu 
750 

Ser Glu Ser Leu 
765 

Thr His Ser Arg 

Leu Tyr Leu Gin 
800 

Leu Phe Val Asp 
815 

lie Asn His Phe 
830 

Asn lie Arg Ala 
84 5 

Glu Arg Thr Asp 

Gin Asp Tyr Lys 
880 

His Leu Leu Ser 
895 



<210> 12 
<211> 257 
<212> PRT 

<213> Homo sapiens 



<400> 12 



Met 


Pro 


Ala 


Arg 


Cys 


Val 


Ala 


Ala 


His 


Cys 


Gly 


Asn 


Thr 


Thr 


Lys 


Ser 


1 








5 










10 










15 




Gly 


Lys 


Ser 


Leu 


Phe 


Arg 


Phe 


Pro 


Lys 


Asp Arg 


Ala 


Val 


Arg 


Leu 


Leu 








20 










25 










30 






Trp 


Asp 


Arg 


Phe 


Val 


Arg Gly 


Cys 


Arg 


Ala 


Asp 


Trp 


Tyr 


Gly 


Gly Asn 






35 










40 










45 








Asp 


Arg 


Ser 


Val 


lie 


Cys 


Ser 


Asp 


His 


Phe 


Ala 


Pro 


Ala 


Cys 


Phe 


Asp 




50 










55 










60 










Val 


Ser 


Ser 


Val 


lie 


Gin 


Lys 


Asn 


Leu 


Arg 


Phe 


Ser 


Gin 


Arg 


Leu 


Arg 


65 










70 










75 










80 


Leu 


Val 


Ala 


Gly 


Ala 


Val 


Pro 


Thr 


Leu 


His 


Arg 


Val 


Pro 


Ala 


Pro 


Ala 










85 










90 










95 




Pro 


Lys 


Arg 


Gly 


Glu 


Glu 


Gly 


Asp 


Gin 


Ala 


Gly 


Arg 


Leu 


Asp 


Thr 


Arg 








100 










105 










110 






Gly 


Glu 


Leu 


Gin 


Ala 


Ala 


Arg 


His 


Ser 


Glu 


Ala 


Ala 


Pro 


Gly 


Pro 


Val 






115 










120 










125 








Ser 


Cys 


Thr 


Arg 


Pro 


Arg 


Ala 


Gly 


Lys 


Gin 


Ala 


Ala 


Ala 


Ser 


Gin 


He 




130 










135 










140 










Thr 


Cys 


Glu 


Asn 


Glu 


Leu 


Val 


Gin 


Thr 


Gin 


Pro 


His 


Ala 


Asp 


Asn 


Pro 



145 150 155 160 
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Ser Asn Thr 


Val 


Thr 


Ser 


Val 


Pro 


Thr 


His 


Cys 


Glu 


Glu 


Gly 


Pro 


Val 






165 










170 










175 




His Lys Ser 


Thr 


Gin 


He 


Ser 


Leu 


Lys 


Arg 


Pro 


Arg 


His 


Arg 


Ser 


Val 




180 










185 










190 






Gly lie Gin 


Ala 


Lys 


Val 


Lys 


Ala 


Phe 


Gly 


Lys 


Arg 


Leu 


Cys 


Asn 


Ala 


195 










200 










205 








Thr Thr- C. 1 r> 

1I1X 1 I1X VJ-LIl 


i nr 


r*i n 


bill 


Leu 


Trp 


ber 


Arg 


Thr 


Ser 


Ser 


Leu 


Phe 


Asp 


210 








215 








220 








lie Tyr Ser 


Ser 


Asp 


Ser 


Glu 


Thr 


Asp 


Thr 


Asp 


Trp Asp 


He 


Lys 


Ser 


225 






230 










235 










240 


Glu Gin Ser 


Asp 


Leu 


Ser 


Tyr 


Met 


Ala 


Val 


Gin 


Val 


Lys 


Glu 


Glu 


Thr 






245 










250 










255 




Cys 




























<210> 13 




























<211> 314 




























<212> PRT 




























<213> Homo sapiens 
























<400> 13 




























Met Pro Gly 


Phe 


Thr 


Cys 


Cys 


Val 


Pro 


Gly 


Cys 


Tyr 


Asn 


Asn 


Ser 


His 


1 




5 










10 










15 




Arg Asp Lys 


Ala 


Leu 


His 


Phe 


Tyr 


Thr 


Phe 


Pro 


Lys 


Asp 


Ala 


Glu 


Leu 




20 










25 










30 






Arg Arg Leu 


Trp 


Leu 


Lys 


Asn 


Val 


Ser 


Arg Ala 


Gly Val 


Ser 


Gly 


Cys 


35 










40 










45 






Phe Ser Thr 


Phe 


Gin 


Pro 


Thr 


Thr 


Gly 


His 


Arg 


Leu 


Cys 


Ser 


Val 


His 


50 








55 










60 










Pne Gin Giy 


Gly 


Arg 


Lys 


Thr 


Tyr 


Thr 


Val 


Arg 


Val 


Pro 


Thr 


He 


Phe 


65 






70 










75 










80 


tiu itcu rtiy 




Vdl 


Asn 


Glu 


Arg 


Lys 


Val 


Ala 


Arg 


Arg 


Pro 


Ala 


Gly 




Ala 


85 










90 










95 




Ala Ala Ala 


Arg 


Arg 


Arg 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 




100 










105 










110 






Gin Gin Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


115 










120 










125 








Gin Gin Gin 


Gin' 


Ser 


Ser 


Pro 


Ser 


Ala 


Ser 


Thr 


Ala 


Gin 


Thr 


Ala 


Gin 


130 








135 










140 










Leu Gin Pro 


Asn 


Leu 


Val 


Ser 


Ala 


Ser 


Ala 


Ala 


Val 


Leu 


Leu 


Thr 


Leu 


145 






150 










155 










160 


Gin Ala Thr 


Val 


Asp 


Ser 


Ser 


Gin 


Ala 


Pro 


Gly 


Ser 


Val 


Gin 


Pro 


Ala 






165 










170 










175 




Pro lie Thr 


Pro 


Thr 


Gly 


Glu 


Asp 


Val 


Lys 


Pro 


He 


Asp 


Leu 


Thr 


Val 




180 










185 










190 






Gin Val Glu 


Phe 


Ala 


Ala 


Ala 


Glu 


Gly 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


195 










200 










205 








Ser Glu Leu 


Gin 


Ala 


Ala 


Thr 


Ala 


Gly 


Leu 


Glu 


Ala 


Ala 


Glu 


Cys 


Pro 


210 








215 










220 








Met Gly Pro 


Gin 


Leu 


Val 


Val 


Val 


Gly 


Glu 


Glu 


Gly 


Phe 


Pro 


Asp 


Thr 


225 






230 










235 








240 


Gly Ser Asp 


His 


Ser 


Tyr 


Ser 


Leu 


Ser 


Ser 


Gly 


Thr 


Thr 


Glu 


Glu 


Glu 






245 










250 










255 




Leu Leu Arg 


Lys 


Leu 


Asn 


Glu 


Gin 


Arg 


Asp 


He 


Leu 


Ala 


Leu 


Met 


Glu 




260 










265 










270 






Val Lys Met 


Lys 


Glu 


Met 


Lys 


Gly 


Ser 


He Arg 


His 


Leu 


Arg 


Leu 


Thr 


275 










280 










285 






Glu Ala Lys 


Leu 


Arg 


Glu 


Glu 


Leu 


Arg 


Glu 


Lys 


Asp Arg 


Leu 


Leu 


Ala 


290 








295 










300 










Met Ala Val 


lie 


Arg 


Lys 


Lys 


His 


Gly 


Met 
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<210> 14 

<211> 761 

<212> PRT 

<213> Homo sapiens 



<400> 14 



Met 


Pro 


Asn 


Phe 


Cys 


Ala 


Ala 


Pro 


Asn 


Cys 


Thr 


Arg 


Lys 


Ser 


Thr 


Gin 


1 








5 










10 










15 




Ser 


Asp 


Leu 


Ala 


Phe 


Phe 


Arg 


Phe 


Pro 


Arg 


Asp 


Pro 


Ala 


Arg 


Cys 


Gin 








20 










25 










30 






Lys 


Trp 


Val 


Glu 


Asn 


Cys 


Arg 


Arg 


Ala 


Asp 


Leu 


Glu 


Asp 


Lys 


Thr 


Pro 






35 










40 










45 








Asp 


Gin 


Leu 


Asn 


Lys 


His 


Tyr 


Arg 


Leu 


Cys 


Ala 


Lys 


His 


Phe 


Glu 


Thr 




50 










55 










60 










Ser 


Met 


lie 


Cys 


Arg 


Thr 


Ser 


Pro 


Tyr 


Arg 


Thr 


Val 


Leu 


Arg 


Asp 


Asn 


65 










70 










75 










80 


Ala 


lie 


Pro 


Thr 


He 


Phe 


Asp 


Leu 


Thr 


Ser 


His 


Leu 


Asn 


Asn 


Pro 


His 










85 










90 










95 




Ser 


Arg 


His 


Arg 


Lys 


Arg 


He 


Lys 


Glu 


Leu 


Ser 


Glu 


Asp 


Glu 


He 


Arg 








100 










105 










110 






Thr 


Leu 


Lys 


Gin 


Lys 


Lys 


He 


Asp 


Glu 


Thr 


Ser 


Glu 


Gin 


Glu 


Gin 


Lys 






115 










120 










12,5 








His 


Lys 


Glu 


Thr 


Asn 


Asn 


Ser 


Asn 


Ala 


Gin 


Asn 


Pro 


Ser 


Glu 


Glu 


Glu 




130 










135 










140 










Gly 


Glu 


Gly 


Gin 


Asp 


Glu 


Asp 


He 


Leu 


Pro 


Leu 


Thr 


Leu 


Glu 


Glu 


Lys 


145 










150 










155 










160 


Glu 


Asn 


Lys 


Glu 


Tyr 


Leu 


Lys 


Ser 


Leu 


Phe 


Glu 


He 


Leu 


He 


Leu 


Met 










165 










170 










175 




Gly 


Lys 


Gin 


Asn 


He 


Pro 


Leu 


Asp 


Gly 


His 


Glu 


Ala 


Asp 


Glu 


He 


Pro 








180 










185 










190 






Glu 


Gly 


Leu 


Phe 


Thr 


Pro 


Asp 


Asn 


Phe 


Gin 


Ala 


Leu 


Leu 


Glu 


Cys 


Arg 






195 


*- 








200 










205 








lie 


Asn 


Ser 


Gly 


Glu 


Glu 


Val 


Leu 


Arg 


Lys 


Arg 


Phe 


Glu 


Thr 


Thr 


Ala 




210 










215 










220 










Val 


Asn 


Thr 


Leu 


Phe 


Cys 


Ser 


Lys 


Thr 


Gin 


Gin 


Arg 


Gin 


Met 


Leu 


Glu 


225 










230 










235 










240 


lie 


Cys 


Glu 


Ser 


Cys 


He 


Arg 


Glu 


Glu 


Thr 


Leu 


Arg 


Glu 


Val 


Arg 


Asp 










245 










250 










255 




Ser 


His 


Phe 


Phe 


Ser 


He 


lie 


Thr 


Asp 


Asp 


Val 


Val 


Asp 


He 


Ala 


Gly 








260 










265 










270 






Vj 1 u 


Glu 


His 


Leu 


jrro 


vai 


Leu 


v a x 


Arg 


rr ne 


Vdi 


Asp 


U-LU 


Ser 


HIS 


Asn 






275 










280 










285 








Leu 


Arg 


Glu 


Glu 


Phe 


He 


Gly 


Phe 


Leu 


Pro 


Tyr 


Glu 


Ala 


Asp 


Ala 


Glu 




290 










295 










300 










lie 


Leu 


Ala 


Val 


Lys 


Phe 


His 


Thr 


Met 


He 


Thr 


Glu 


Lys 


Trp 


Gly 


Leu 


305 










310 










315 










320 


Asn 


Met 


Glu 


Tyr 


Cys 


Arg 


Gly 


Gin 


Ala 


Tyr 


He 


Val 


Ser 


Ser 


Gly 


Phe 










325 










330 










335 




Ser 


Ser 


Lys 


Met 


Lys 


Val 


Val 


Ala 


Ser 


Arg 


Leu 


Leu 


Glu 


Lys 


Tyr 


Pro 








340 










345 










350 






Gin 


Ala 


He 


Tyr 


Thr 


Leu 


Cys 


Ser 


Ser 


Cys 


Ala 


Leu 


Asn 


Met 


Trp 


Leu 






355 










360 










365 








Ala 


Lys 


Ser 


Val 


Pro 


Val 


Met 


Gly 


Val 


Ser 


Val 


Ala 


Leu 


Gly 


Thr 


He 




370 










375 










380 










Glu 


Glu 


Val 


Cys 


Ser 


Phe 


Phe 


His 


Arg 


Ser 


Pro 


Gin 


Leu 


Leu 


Leu 


Glu 


385 










390 










395 










400 


Leu 


Asp Asn 


Val 


He 


Ser 


Val 


Leu 


Phe 


Gin 


Asn 


Ser 


Lys 


Glu 


Arg 


Gly 



405 410 415 
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Lys 


Glu 


Leu 


Lys 


Glu 


He 


Cys 


His 


Ser 


Gin 


Trp 


Thr 


Gly 


Arg 


His 


Asp 








420 










425 










430 






Ala 


Phe 


Glu 


He 


Leu 


Val 


Glu 


Leu 


Leu 


Gin 


Ala 


Leu 


Val 


Leu 


Cys 


Leu 






435 










440 










445 






Asp 


Gly 


He 


Asn 


Ser 


Asp 


Thr 


Asn 


He 


Arg 


Trp 


Asn 


Asn 


Tyr 


lie 


Ala 




450 










455 










460 










Gly Arg 


Ala 


Phe 


Val 


Leu 


Cys 


Ser 


Ala 


Val 


Ser 


Asp 


Phe 


Asp 


Phe 


He 


465 










470 










475 










480 


Val 


Thr 


He 


Val 


Val 


Leu 


Lys 


Asn 


Val 


Leu 


Ser 


Phe 


Thr 


Arg 


Ala 


Phe 










485 










490 










495 




Gly 


Lys 


Asn 


Leu 


Gin 


Gly 


Gin 


Thr 


Ser 


Asp 


Val 


Phe 


Phe 


Ala 


Ala 


Gly 








500 










505 










510 




Ser 


Leu 


Thr 


Ala 


Val 


Leu 


His 


Ser 


Leu 


Asn 


Glu 


Val 


Met 


Glu 


Asn 


He 






515 










520 










525 








Glu 


Val 


Tyr 


His 


Glu 


Phe 


Trp 


Phe 


Glu 


Glu 


Ala 


Thr 


Asn 


Leu 


Ala 


Thr 




530 










535 










540 










Lys 


Leu 


Asp 


He 


Gin 


Met 


Lys 


Leu 


Pro 


Gly 


Lys 


Phe 


Arg 


Arg 


Ala 


His 


545 










550 










555 










560 


Gin 


Gly 


Asn 


Leu 


Glu 


Ser 


Gin 


Leu 


Thr 


Ser 


Glu 


Ser 


Tyr 


Tyr 


Lys 


Glu 










565 










570 










575 




Thr 


Leu 


Ser 


Val 


Pro 


Thr 


Val 


Glu 


His 


He 


lie 


Gin 


Glu 


Leu 


Lys 


Asp 








580 










585 










590 




He 


Phe 


Ser 


Glu 


Gin 


His 


Leu 


Lys 


Ala 


Leu 


Lys 


Cys 


Leu 


Ser 


Leu 


Val 






595 










600 










605 








Pro 


Ser 


Val 


Met 


Gly 


Gin 


Leu 


Lys 


Phe 


Asn 


Thr 


Ser 


Glu 


Glu 


His 


His 




610 










615 










620 










Ala 


Asp 


Met 


Tyr 


Arg 


Ser 


Asp 


Leu 


Pro 


Asn 


Pro Asp 


Thr 


Leu 


Ser 


Ala 


625 










630 










635 










640 


Glu 


Leu 


His 


Cys 


Trp 


Arg 


He 


Lys 


Trp 


Lys 


His 


Arg 


Gly 


Lys 


Asp 


He 










645 










650 










655 




Glu 


Leu 


Pro 


Ser 


Thr 


He 


Tyr 


Glu 


Ala 


Leu 


His 


Leu 


Pro 


Asp 


He 


Lys 








660 










665 










670 




Phe 


Phe 


Pro 


Asn 


Val 


Tyr 


Ala 


Leu 


Leu 


Lys 


Val 


Leu 


Cys 


He 


Leu 


Pro 






675 


Val 








680 










685 








Val 


Met 


Lys 


Glu 


Asn 


Glu 


Arg 


Tyr 


Glu 


Asn 


Gly 


Arg 


Lys 


Arg 


Leu 




690 










695 










700 










Lys 


Ala 


Tyr 


Leu 


Arg 


Asn 


Thr 


Leu 


Thr 


Asp 


Gin 


Arg 


Ser 


Ser 


Asn 


Leu 


705 










710 










715 










720 


Ala 


Leu 


Leu 


Asn ' 


He 


Asn 


Phe 


Asp 


He 


Lys 


His 


Asp 


Leu 


Asp 


Leu 


Met 










725 










730 










735 




Val 


Asp 


Thr 


Tyr 


He 


Lys 


Leu 


Tyr 


Thr 


Ser 


Lys 


Ser 


Glu 


Leu 


Pro 


Thr 








740 










745 










750 






Asp Asn 


Ser 


Glu 


Thr 


Val 


Glu 


Asn 


Thr 




















755 










760 



















<210> 15 
<211> 38 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Consensus sequence for PAR 4 binding domain of THAP 

<221> UNSURE 

<222> (1) . . . (38) 

<223> Xaa = Any Amino Acid 

<400> 15 

Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 
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Gin Arg Xaa Arg Arg Gin Xaa Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Gin Xaa Glu 
35 



<210> 16 

<211> 73 

<212> PRT 

<213> Sus scrofa 

<400> 16 

Met Val Gin Ser Cys Ser Ala Tyr Gly Cys Lys Asn Arg Tyr Asp Lys 

15 10 15 

Asp Lys Pro Val Ser Phe His Lys Phe Pro Leu Thr Arg Pro Ser Leu 

20 25 30 

Cys Lys Lys Trp Glu Ala Ala Val Arg Arg Lys Asn Phe Lys Pro Thr 

35 40 45 

Lys Tyr Ser Ser lie Cys Ser Glu His Phe Thr Pro Asp Cys Phe Lys 

50 55 60 

Arg Glu Cys Asn Asn Lys Leu Leu Lys 
65 70 



<210> 17 

<211> 99 

<212> PRT 

<213> Sus scrofa 

<400> 17 

Met Val Lys Cys Cys Ser Ala lie Gly Cys Ala Ser Arg Cys Leu Pro 

15 10 15 

Asn Ser Lys Leu Lys Gly Leu Thr Phe His Val Phe Pro Thr Asp Glu 

20 25 30 

Lys Val Lys Arg Lys Trp Val Leu Ala Met Lys Arg Leu Asp Val Asn 

35 " 40 45 

Ala Ala Gly Met Trp Glu Pro Lys Lys Gly Asp Val Leu Cys Ser Arg 

50 55 60 

His Phe Lys Lys Thr Asp Phe Asp Arg Thr Thr Pro Asn lie Lys Leu 
65 70 75 80 

Lys Pro Gly Val lie Pro Ser lie Phe Asp Ser Pro Ser His Leu Thr 
85 90 95 

Gly Glu Glu 



<210> 18 
<211> 103 
<212> PRT 
<213> Sus scrofa 

<400> 18 

Met Pro Arg His Cys Ser Ala Ala Gly Cys Cys Thr Arg Asp Thr Arg 

1 5 10 15 

Glu Thr Arg Asn Arg Gly lie Ser Phe His Arg Leu Pro Lys Lys Asp 

20 25 30 

Asn Pro Arg Arg Gly Leu Trp Leu Ala Asn Cys Gin Arg Leu Asp Pro 

35 4 0 4 5 

Ser Gly Gin Gly Leu Trp Asp Pro Ala Ser Glu Tyr lie Tyr Phe Cys 

50 " 55 60 

Ser Lys His Phe Glu Glu Asn Cys Phe Glu Leu Val Gly lie Ser Gly 
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65 70 75 80 

Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr He Phe Glu Ser Phe 

85 90 95 

Ser Lys Leu Arg Arg Thr Ala 
100 



<210> 19 

<211> 99 

<212> PRT 

<213> Sus scrofa 

<400> 19 

Met Thr Arg Ser Cys Ser Ala Val Gly Cys Ser Thr Arg Asp Thr Val 

1 5 10 15 

Leu Ser Arg Glu Arg Gly Leu Ser Phe His Gin Phe Pro Thr Asp Thr 

20 25 30 

He Gin Arg Ser Gin Trp He Arg Ala Val Asn Arg Met Asp Pro Arg 

35 40 45 

Ser Lys Lys He Trp He Pro Gly Pro Gly Ala Met Leu Cys Ser Lys 

50 55 60 

His Phe Gin Glu Ser Asp Phe Glu Ser Tyr Gly He Arg Arg Lys Leu 
65 70 75 80 

Lys Lys Gly Ala Val Pro Ser Val Ser Leu Tyr Lys Val Leu Gin Gly 
85 90 " ,95 

Ala His Leu 



<210> 20 

<211> 92 

<212> PRT 

<213> Bos taurus 

<400> 20 

Met Pro Lys Ser Cys Ala Ala Arg Gin Cys Cys Asn Arg Tyr Ser Asn 

15 10 " 15 

Arg Arg Lys Gin Leu Thr Phe His Arg Phe Pro Phe Ser Arg Pro Glu 

20 • 25 30 

Leu Leu Lys Glu Trp Val Leu Asn He Gly Arg Gly Asp Phe Glu Pro 

35 4 0 * 45 

Lys Gin His Thr Val He Cys Ser Glu His Phe Arg Pro Glu Cys Phe 

50 55 60 

Ser Ala Phe Gly Asn Arg Lys Asn Leu Lys His Asn Ala Val Pro Thr 

65 70 ~ 75 80 

Val Phe Ala Phe Gin Gly Pro Pro Gin Leu Val Arg 
85 90 



<210> 21 

<211> 75 

<212> PRT 

<213> Bos taurus 

<400> 21 

Arg Leu Pro Lys Lys Asp Asn Pro Arg Arg Gly Leu Trp Leu Ala Asn 

15 10 * 15 

Cys Gin Arg Leu Asp Pro Ser Gly Gin Gly Leu Trp Asp Pro Ala Ser 

20 25 30 

Glu Tyr He Tyr Phe Cys Ser Lys His Phe Glu Glu Asn Cys Phe Glu 
35 4 0 45 
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Leu Val Gly He Ser Gly Tyr His Arg Leu Lys Glu Gly Ala Val Pro 

50 55 60 

Thr He Phe Glu Ser Phe Ser Lys Leu Arg Arg 
65 70 75 



<210> 22 
<211> 91 
<212> PRT 

<213> Mus musculus 
<400> 22 

Met Val Gin Ser Cys Ser Ala Tyr Gly Cys Lys Asn Arg Tyr Asp Lys 

15 10 15 

Asp Lys Pro Val Ser Phe His Lys Phe Pro Leu Thr Arg Pro Ser Leu 

20 25 30 

Cys Lys Gin Trp Glu Ala Ala Val Lys Arg Lys Asn Phe Lys Pro Thr 

35 4 0 4 5 

Lys Tyr Ser Ser He Cys Ser Glu His Phe Thr Pro Asp Cys Phe Lys 

50 55 60 

Arg Glu Cys Asn Asn Lys Leu Leu Lys Glu Asn Ala Val Pro Thr He 
65 70 75 80 

Phe Leu Tyr He Glu Pro His Glu Lys Lys Glu 
85 90 



<210> 23 
<211> 90 
<212> PRT 

<213> Mus musculus 
<400> 23 

Met Pro Thr Asn Cys Ala Ala Ala Gly Cys Ala Ala Thr Tyr Asn Lys 

15 10 15 

His He Asn He Ser Phe His Arg Phe Pro Leu Asp Pro Lys Arg Arg 

20 25 30 

Lys Glu Trp Val Arg Leu Val Arg Arg Lys Asn Phe Val Pro Gly Lys 

35 40 45 

His Thr Phe Leu Cys Ser Lys His Phe Glu Ala Ser Cys Phe Asp Leu 

50 55 60 

Thr Gly Gin Thr Arg Arg Leu Lys Met Asp Ala Val Pro Thr He Phe 
65 " 70 75 80 

Asp Phe Cys Thr His He Lys Ser Leu Lys 
85 90 



<210> 24 
<211> 92 
<212> PRT 

<213> Mus musculus 
<400> 24 

Met Pro Lys Ser Cys Ala Ala Arg Gin Cys Cys Asn Arg Tyr Ser Ser 

15 10 ~ 15 

Arg Arg Lys Gin Leu Thr Phe His Arg Phe Pro Phe Ser Arg Pro Glu 

20 25 30 

Leu Leu Arg Glu Trp Val Leu Asn He Gly Arg Ala Asp Phe Lys Pro 

35 40 45 

Lys Gin His Thr Val He Cys Ser Glu His Phe Arg Pro Glu Cys Phe 

50 55 60 

Ser Ala Phe Gly Asn Arg Lys Asn Leu Lys His Asn Ala Val Pro Thr 
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65 70 75 

Val Phe Ala Phe Gin Asn Pro Thr Glu Val Cys Pro ' 8 ° 

85 90 

<210> 25 
<211> 95 
<212> PRT 

<213> Mus musculus 
<400> 25 

Met Val He Cys Cys Ala Ala Val Asn Cys Ser Asn Arg Gin Gly Lys 

5 10 15 

Gly Glu Lys Arg Ala Val Ser Phe His Arg Phe Pro Leu Lys Asp Ser 

20 25 30 

Lys Arg Leu lie Gin Trp Leu Lys Ala Val Gin Arg Asp Asn Trp Thr 

35 40 45 

Pro Thr Lys Tyr Ser Phe Leu Cys Ser Glu His Phe Thr Lys Asp Ser 

50 55 60 

Phe Ser Lys Arg Leu Glu Asp Gin His Arg Leu Leu Lys Pro Thr Ala 

Val Pro Ser lie Phe His Leu Ser Glu Lys Lys Arg Gly Ala Gly 
85 90 95 

<210> 26 
<211> 52 
<212> PRT 

<213> Mus musculus 
<400> 26 

lie Leu Gin Ala Phe Gly Ser Leu Lys Lys Gly Asp Val Leu Cys Ser 

15 10 15 

Arg His Phe Lys Lys Thr Asp Phe Asp Arg Ser Thr Leu Asn Thr Lys 

20 25 30 

Leu Lys Ala Gly Ala He Pro Ser He Phe Glu Cys Pro Tyr His Leu 

35 4 0 45 

Gin Glu Lys Arg 
50 

<210> 27 
<211> 103 
<212> PRT 

<213> Mus musculus 
<400> 27 

Met Pro Arg His Cys Ser Ala Ala Gly Cys Cys Thr Arg Asp Thr Arg 

5 10 15 

Glu Thr Arg Asn Arg Gly He Ser Phe His Arg Leu Pro Lys Lys Asp 

20 25 30 

Asn Pro Arg Arg Gly Leu Trp Leu Ala Asn Cys Gin Arg Leu Asp Pro 

35 40 4 5 

Ser Gly Gin Gly Leu Trp Asp Pro Thr Ser Glu Tyr He Tyr Phe Cys 

50 55 60 

Ser Lys His Phe Glu Glu Asn Cys Phe Glu Leu Val Gly He Ser Gly 

S 70 7 5 80 

Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr He Phe Glu Ser Phe 

85 90 95 

Ser Lys Leu Arg Arg Thr Ala 
100 
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<210> 28 
<211> 90 
<212> PRT 

<213> Mus musculus 
<400> 28 



Met 


Pro 


Gly 


Phe 


Thr 


Cys 


Cys 


Val 


Pro 


Gly 


Cys 


Tyr 


Asn 


Asn 


Ser 


His 


1 








5 










10 










15 




Arg 


Asp 


Lys 


Ala 


Leu 


His 


Phe 


Tyr 


Thr 


Phe 


Pro 


Lys 


Asp 


Ala 


Glu 


Leu 








20 










25 










30 






Arg 


Arg 


Leu 


Trp 


Leu 


Lys 


Asn 


Val 


Ser 


Arg 


Ala 


Gly 


Val 


Ser 


Gly 


Cys 






35 










40 










45 








Phe 


Ser 


Thr 


Phe 


Gin 


Pro 


Thr 


Thr 


Gly 


His 


Arg 


Leu 


Cys 


Ser 


Val 


His 




50 










55 










60 










Phe 


Gin 


Gly 


Gly Arg 


Lys 


Thr 


Tyr 


Thr 


Val 


Arg 


Val 


Pro 


Thr 


lie 


Phe 


65 










70 










75 










80 


Pro 


Leu 


Arg 


Gly 


Val 


Asn 


Glu 


Arg 


Lys 


Val 















85 90 



<210> 29 
<211> 96 
<212> PRT 

<213> Mus musculus 



<400> 29 



Met 


Pro 


Asn 


Phe 


Cys 


Ala 


Ala 


Pro 


1 








5 








Ser 


Asp 


Leu 


Ala 

20 


Phe 


Phe 


Arg 


Phe 


Lys 


Trp 


Val 
35 


Glu 


Asn 


Cys 


Arg 


Arg 
40 


Asp 


Gin 
50 


Leu 


Asn 


Lys 


His 


Tyr 
55 


Arg 


Ser 


Met 


lie 


Cys 


Arg 


Thr 


Ser 


Pro 


65 










70 






Ala 


lie 


Pro 


Thr 


He 


Phe 


Asp 


Leu 



85 



Asn Cys Thr Arg Lys Ser Thr Gin 

10 15 
Pro Arg Asp Pro Ala Arg Cys Gin 
25 30 
Ala Asp Leu Glu Asp Lys Thr Pro 
45 

Leu Cys Ala Lys His Phe Glu Thr 
60 

Tyr Arg Thr Val Leu Arg Asp Asn 

75 80 
Thr Ser His Leu Asn Asn Pro His 
90 95 



<210> 30 
<211> 24 
<212> PRT 

<213> Rattus norvegicus 
<400> 30 

Met Pro Thr Asn Cys Ala Ala Ala Gly Cys Ala Ala Thr Tyr Asn Lys 

1 5 10 15 

His He Asn He Ser Phe His Arg 
20 



<210> 31 
<211> 85 
<212> PRT 

<213> Rattus norvegicus 
<400> 31 

Arg Gin Cys Cys Asn Arg Tyr Ser Ser Arg Arg Lys Gin Leu Thr Phe 



-18- 



BNSDOCID: <WO 2004055050A2J_> 



WO 2004/055050 PCT/IB2003/006434 

1 5 10 15 

His Arg Phe Pro Phe Ser Arg Pro Glu Leu Leu Arg Glu Trp Val Leu 

20 25 30 

Asn He Gly Arg Ala Asp Phe Lys Pro Lys Gin His Thr Val He Cys 

35 40 45 

Ser Glu His Phe Arg Pro Glu Cys Phe Ser Ala Phe Gly Asn Arg Lys 

50 55 60 

Asn Leu Lys His Asn Ala Val Pro Thr Val Phe Ala Phe Gin Asn Pro 

65 70 75 80 

Ala Gin Val Cys Pro 

85 

<210> 32 
<211> 70 
<212> PRT 

<213> Rattus norvegicus 
<400> 32 

Arg Phe Pro Leu Lys Asp Ser Lys Arg Leu lie Gin Trp Leu Lys Ala 

5 10 15 

Val Gin Arg Asp Asn Trp Thr Pro Thr Lys Tyr Ser Phe Leu Cys Ser 

20 25 30 

Glu His Phe Thr Lys Asp Ser Phe Ser Lys Arg Leu Glu Asp Gin His 

35 4 0 4 5 

Arg Leu Leu Lys Pro Thr Ala Val Pro Ser He Phe His Leu Ser Glu 

50 55 60 

Lys Lys Arg Gly Ala Gly 
65 70 

<210> 33 
<211> 55 
<212> PRT 

<213> Rattus norvegicus 
<400> 33 

Met Val Lys Cys Cys Ser Ala He Gly Cys Ala Ser Arg Cys Leu Pro 

5 10 15 

Asn Ser Lys Leu Lys Gly Leu Thr Phe His Val Phe Pro Thr Asp Glu 

20 25 30 

Asn He Lys Arg Lys Trp Val Leu Ala Met Lys Arg Leu Asp Val Asn 

35 4 0 45 

Thr Ala Gly He Trp Glu Pro 
50 55 

<210> 34 
<211> 103 
<212> PRT 

<213> Rattus norvegicus 
<400> 34 

Met Pro Arg His Cys Ser Ala Ala Gly Cys Cys Thr Arg Asp Thr Arg 

5 10 15 

Glu Thr Arg Asn Arg Gly He Ser Phe His Arg Leu Pro Lys Lys Asp 

20 25 30 

Asn Pro Arg Arg Gly Leu Trp Leu Ala Asn Cys Gin Arg Leu Asp Pro 

35 40 45 

Ser Gly Gin Gly Leu Trp Asp Pro Thr Ser Glu Tyr He Tyr Phe Cys 
50 55 60 
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Ser Lys His Phe Glu Glu Asn Cys Phe Glu Leu Val Gly lie Ser Gly 

65 70 75 80 

Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr lie Phe Glu Ser Phe 

85 90 95 

Ser Lys Leu Arg Arg Thr Ala 
100 



<210> 35 
<211> 90 
<212> PRT 

<213> Rattus norvegicus 
<400> 35 



Met 


Pro 


Gly 


Phe 


Thr 


Cys 


Cys 


Val 


Pro 


Gly 


Cys 


Tyr 


Asn 


Asn 


Ser 


His 


1 








5 










10 










15 




Arg 


Asp 


Lys 


Ala 
20 


Leu 


His 


Phe 


Tyr 


Thr 
25 


Phe 


Pro 


Lys 


Asp 


Ala 
30 


Glu 


Leu 


Arg 


Arg 


Leu 
35 


Trp 


Leu 


Lys 


Asn 


Val 
40 


Ser 


Arg 


Ala 


Gly 


Val 
45 


Ser 


Gly 


Cys 


Phe 


Ser 
50 


Thr 


Phe 


Gin 


Pro 


Thr 

55 


Thr 


Gly 


His 


Arg 


Leu 
60 


Cys 


Ser 


Val 


His 


Phe 


Gin 


Gly 


Gly 


Arg 


Lys 


Thr 


Tyr 


Thr 


Val 


Arg 


Val 


Pro 


Thr 


lie 


Phe 


65 










70 










75 










80 


Pro 


Leu 


Arg 


Gly 


Val 
85 


Asn 


Glu 


Arg 


Lys 


Val 
90 















<210> 36 
<211> 96 
<212> PRT 

<213> Rattus norvegicus 
<400> 36 

Met Pro Asn *Phe Cys Ala Ala Pro Asn Cys Thr Arg Lys Ser Thr Gin 

1 5 10 15 

Ser Asp Leu Ala Phe Phe Arg Phe Pro Arg Asp Pro Ala Arg Cys Gin 

20 25 30 

Lys Trp Val Glu Asn Cys Arg Arg Ala Asp Leu Glu Asp Lys Thr Pro 

35 40 45 

Asp Gin Leu Asn Lys His Tyr Arg Leu Cys Ala Lys His Phe Glu Thr 

50 55 60 

Ser Met lie Cys Arg Thr Ser Pro Tyr Arg Thr Val Leu Arg Asp Asn 
65 70 75 80 

Ala lie Pro Thr lie Phe Asp Leu Thr Ser His Leu Asn Asn Pro His 
85 90 95 



<210> 37 
<211> 94 
<212> PRT 

<213> Gallus gallus 
<400> 37 

Met Val lie Cys Cys Ala Ala Ala 

1 5 
Ala Leu Arg Gly Ala Val Ser Phe 
20 

Lys Arg Leu He Gin Trp Leu Lys 

35 40 
Pro Thr Lys Tyr Ser Phe Leu Cys 



Asn Cys Ser Asn Arg Gin Gly Lys 

10 15 

His Arg Phe Pro Leu Lys Asp Ser 
25 30 

Ala Val Gin Arg Asp Asn Trp Thr 
45 

Ser Glu His Phe Thr Lys Asp Ser 
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. 50 55 60 

Phe Ser Arg Arg Leu Glu Asp Gin His Arg Leu Leu Lys Pro Thr Ala 
65 70 75 80 

Val Pro Thr He Phe Gin Leu Ala Glu Lys Lys Arg Asp Asn 
8 5 90 



<210> 38 
<211> 94 
<212> PRT 

<213> Gallus gallus 
<400> 38 

Met Pro Arg Tyr Cys Ala Ala Ser Tyr Cys Lys Asn Arg Gly Gly Gin 

1 5 10 15 

Ser Ala Arg Asp Gin Arg Lys Leu Ser Phe Tyr Pro Phe Pro Leu His 

20 2 5 3 0 

Asp Lys Glu Arg Leu Glu Lys Trp Leu Arg Asn Met Lys Arg Asp Ala 

35 4 0 4 5 

Trp Thr Pro Ser Lys His Gin Leu Leu Cys Ser Asp His Phe Thr Pro 

50 55 60 

Asp Ser Leu Asp Val Arg Trp Gly He Arg Tyr Leu Lys His Thr Ala 
65 70 75 80 

Val Pro Thr He Phe Ser Ser Pro Asp Asp Glu Glu Lys Gly 
85 90 , 



<210> 39 
<211> 102 
<212> PRT 

<213> Gallus gallus 
<400> 39 

Met Pro Arg His Cys Ser Ala Ala Gly Cys Cys Thr Arq 

1 5 io 

Glu Thr Arg Ser Arg Gly He Ser Phe His Arg Leu Pro 

20 25 
Asn Pro Arg Arg Ala Leu Trp Leu Glu Asn Ser Arg Arg 

35 4 o 45 

Ser Gly Glu Gly Arg Trp Asp Pro Ala Ser Lys Tyr He 

50 55 60 

Ser Gin His Phe Glu Lys Ser Cys Phe Glu He Val Gly 
65 70 75 

Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr Val Phe 

85 90 
Ser Pro Arg Pro Pro Arg 
100 



Asp Thr Arg 
15 

Lys Lys Asp 
30 

Arg Asp Ala 

Tyr Phe Cys 

Phe Ser Gly 
80 

Glu Ser Thr 
95 



<210> 40 
<211> 27 
<212> PRT 

<213> Gallus gallus 
<400> 40 

Met Thr Arg Ser Cys Ser Ala Leu 

1 5 
Arg Ser Arg Glu Arg Gly lie Ser 
20 



Gly Cys Ser Ala Arg Asp Asn Gly 

10 is 
Phe His Gin 
25 
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<210> 41 
<211> 90 
<212> PRT 

<2 13> Xenopus laevi 
<400> 41 

Met Val Gin Ser Cys Ser Ala Tyr Gly Cys Lys Asn Arg Tyr Asp Lys 

15 10 15 

Asp Arg Pro lie Ser Phe His Lys Phe Pro Leu Lys Arg Pro Leu Leu 

20 25 30 

Cys Lys Lys Trp Glu Ala Ala Val Arg Arg Ala Asp Phe Lys Pro Thr 

35 40 45 

Lys Tyr Ser Ser lie Cys Ser Asp His Phe Thr Ala Asp Cys Phe Lys 

50 5 5 60 

Arg Glu Cys Asn Asn Lys Leu Leu Lys Asp Asn Ala Val Pro Thr Val 
65 70 75 80 

Phe Ala Leu Ala Glu lie Lys Lys Lys Met 
85 90 



<210> 42 

<211> 103 

<212> PRT 

<213> Xenopus laevi 

<400> 42 



Met 


Pro 


Arg 


His 


Cys 


Ser 


Ala 


Leu 


Gly 


Cys 


Thr 


Thr 


Arg 


Asp 


Ser Arg 


1 








5 










10 










15 




Gin 


Thr 


Arg 


Asn 
20 


Asn 


Asn 


He 


Ser 


Phe 
25 


His 


Arg 


Leu 


Pro 


Arg 
30 


Lys 


Asp 


Asp 


Pro 


Arg 
35 


Arg 


Ash 


Leu 


Trp 


He 
40 


Ala 


Asn 


Cys 


Gin 


Arg 
45 


Thr 


Asp 


Pro 


Ser 


Gly 


Lys 


Gly 


Leu 


Trp 


Asp 


Pro 


Ser 


Ser 


Asp 


Tyr 


Val 


Tyr 


Phe 


Cys 




50 




Phe 






55 










60 










Ser 


Lys 


His 


Glu 


Lys 


Ser 


Cys 


Phe 


Glu 


Val 


Val 


Gly 


Thr 


Ser 


Gly 


65 










70 










75 










80 


Tyr 


His 


Arg 


Leu 


Lys 
85 


Glu 


Asp 


Ala 


Val 


Pro 
90 


Thr 


Leu 


Phe 


Leu 


Ser 
95 


Ser 


Ala 


Lys 


Leu 


Arg 
100 


Arg 


Ala 


Ala 





















<210> 43 
<211> 90 
<212> PRT 

<213> Xenopus laevi 
<400> 43 



Met 


Val 


Arg 


Ser 


Cys 


Ser 


Ala 


Ala 


Asn 


Cys 


Val Asn 


Arg 


Gin 


Thr 


Ala 


1 








5 










10 








15 




Leu 


Asn 


Lys 


Arg 


Lys 


Gly 


lie 


Thr 


Phe 


His 


Arg Phe 


Pro 


Lys 


Glu 


Gin 








20 










25 








30 






Ala 


Arg 


Arg 


Gin 


Leu 


Trp 


He 


Thr 


Ala 


Val 


Thr His 


Ser 


His 


Ala 


Ala 






35 










40 








45 








Val 


Gly 


Thr 


Asp 


Trp 


Thr 


Pro 


Ser 


He 


His 


Ser Ser 


Leu 


Cys 


Ser 


Gin 




50 










55 








60 










His 


Phe 


Asn 


Asn 


Thr 


Gin 


Phe 


Asp 


Arg 


Thr 


Gly Gin 


Thr 


Val 


Arg 


Leu 


65 










70 










75 








80 


Arg 


Asp 


Ser 


Ala 


Val 


Pro 


Thr 


Val 


Phe 


Ser 













85 90 

} 
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<210> 44 
<211> 99 
<212> PRT 

<213> Xenopus laevi 
<400> 44 

Met Pro Val Ser Cys Ala Ala Ser 

1 5 
Asp Ala Arg Glu Lys Gly He Thr 
20 

Pro Thr Leu Leu Glu Lys Trp Arg 

35 40 
Asn Gly Glu Leu Trp Met Pro Ser 

50 55 
His Phe Lys Gin Cys Cys Phe Asp 
65 70 
Arg Glu Asp Val lie Pro Thr He 
85 

Val He Phe 



Gly Cys Lys Ser Arg Tyr Thr Met 

10 15 
Phe His Arg Phe Pro Arg Ser Asn 
25 30 
Leu Ala Met Arg Arg Ser Thr Arg 
4b 

Arg Tyr Gin Arg Leu Cys Ser Leu 
60 

Thr Thr Gly Gin Thr Lys Arg Leu 

75 so 
Phe Asp Phe Pro Glu Glu Thr His 
90 ' 95 



<210> 45 
<211> 90 
<212> PRT 

<213> Xenopus laevi 
<400> 45 

Met Pro Ala Cys Ala Ala He Asn 

1 5 
Cys Gly Lys Ser Phe His Lys Phe 
20 

Lys Lys Trp Val Met Asn Met Arg 

35 40 
Lys Ala Val Leu Cys Ser Asp His 

50 " 55 

Thr Gly Gin Thr He Arg Leu Arg 
65 70 
Thr Phe Pro Gly Lys Met Lys Lys 
85 



Cys Thr Ser Arg Gin Thr Arg Gly 

10 15 
Pro His Gly Arg Pro Glu Val Leu 
25 30 
Arg Asp Lys Phe Lys Pro Ser Ser 
45 

Phe Glu Glu Phe Cys Phe Asp Arg 
60 

Thr Asp Ala Val Pro Thr Val Phe 
75 80 

Asp Arg 
90 



<210> 46 
<211> 105 
<212> PRT 

<213> Xenopus laevi 
-<400> 46 

Met Pro His Cys Val Val Ser Asn Cys Val His Phe Asn Tyr Lys Lys 

5 10 15 

Ser Asn Leu His Gly Val Ala Leu His Pro Phe Pro Asn Asp Leu Ser 

20 25 30 

Arg He Lys Leu Trp Leu Gin Gin He Gly Leu Thr Thr Asp Glu lie 

35 40 45 

Asp Tyr Leu Ala Gin Lys Val Val Glu Gly Lys Arg Lys Lys Thr Asp 

50 55 6Q 

Ser His Arg Met Cys Ser Ala His Phe Thr Pro Asn Cys Tyr lie Val 

70 75 80 

Gin Asp Ala Lys Leu Val Leu Arg Ser Asp Ala He Pro Thr Met Phe 
85 90 95 
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Pro Gly Leu Ser Ser Ser Thr Thr Asn 
100 105 



<210> 41 
<211> 104 
<212> PRT 

<213> Xenopus laevi 
<400> 47 



Met 


Pro 


Lys 


Cys 


He 


Val 


Thr 


Lys 


Cys 


Pro 


His 


Lys 


Thr 


Gly 


Gin 


Lys 


1 








5 










10 










15 




Glu 


Leu 


Tyr 


Pro 
20 


Ser 


Val 


He 


Leu 


His 
25 


Pro 


Phe 


Pro 


Gly 


Asn 
30 


He 


Glu 


Lys 


lie 


Lys 
35 


Gin 


Trp 


Leu 


Leu 


Gin 
40 


Thr 


Gly 


Glu 


Asp 


Tyr 
45 


Gly 


Asp 


Tyr 


Glu 


Val 
50 


Phe 


Ala 


Glu 


Lys 


Val 
55 


Leu 


Glu 


Ala 


Lys 


Lys 
60 


Thr 


Asp 


Ala 


Tyr 


Arg 


He 


Cys 


Ser 


Arg 


His 


Phe 


Ala 


Glu 


Asp 


Gin 


Tyr 


Val 


Lys 


Arg 


Gly 


65 










70 










75 










80 


Pro 


Arg 


Lys 


Leu 


Leu 
85 


Ser 


Lys 


Asp 


Ala 


Val 
90 


Pro 


Thr 


He 


Phe 


Ser 
95 


Asn 


Leu 


His 


Pro 


Leu 
100 


He 


Gin 


Leu 


His 



















<210> 48 
<211> 102 
<212> PRT 

<213> Xenopus laevi 
<400> 48 



Met 


Pro 


Arg 


Cys 


Val 


Val 


Lys 


Asn 


Cys 


Pro 


His 


Trp 


Thr 


Gly 


Lys 


Lys 


1 








5 










10 










15 




Gly 


Ser 


Gin 


Val 


He 


Leu 


His 


Gly 


Phe 


Pro 


Asn 


Asn 


Ser 


Arg 


Leu 


He 








20 










25 










30 






Lys 


Leu 


Trp 


Leu 


Ser 


Gin 


Thr 


Lys 


Gin 


Asp 


Phe 


Gly 


Asp 


Val 


Glu 


Asp 






35 










40 










45 








Phe 


Thr 


Gin 


Lys 


He 


Leu 


Glu 


Gly 


Lys 


Lys 


Asn 


Asp 


Leu 


Tyr 


Arg 


Leu 




50 










55 










60 










Cys 


Ser 


Lys 


His 


Phe 


Thr 


Asn 


Asp 


Ser 


Tyr 


Glu 


He 


Arg 


Gly 


Thr 


Lys 


65 










70 










75 










80 


Arg 


Phe 


Leu 


Lys 


Tyr Gly Ala 


Val 


Pro 


Thr 


Val 


Phe 


Glu 


Asp 


Thr 


Pro 










85 










90 










95 




Pro 


Leu 


Lys 


Arg Arg 


Lys 























100 



<210> 49 
<211> 104 
<212> PRT 

<213> Xenopus laevi 
<4 00> 4 9 

Met Pro Asn Cys He Val Lys Asp Cys Arg His Lys Ser Gly Gin Lys 

1 5 10 15 

He Gin Asn Pro Asp Val Val Leu His Pro Phe Pro Asn Asn He Asn 

20 25 30 

Met He Lys Asn Trp Leu Leu Gin Thr Gly Gin Asp Phe Gly Asp He 

35 40 45 

Asp Val Leu Ala Asp Lys He Leu Lys Gly Lys Lys Thr Ala Asn Phe 
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50 55 60 

Arg Met Cys Ser Cys His Phe Thr Arg Asp Ser Tyr Met Ala Arg Gly 
65 7 ° 75 80 

Ser Lys Thr Thr Leu Lys Pro Asn Ala lie Pro Thr He Phe Pro Val 

B5 90 95 

He Leu Pro Thr Thr Val Pro Ser 
100 



<210> 50 
<211> 99 
<212> PRT 

<213> Xenopus laevi 
<400> 50 

Met Pro Lys Cys Phe Val Gin Ser Cys Pro His Tyr Thr Gly Arg Asn 

1 5 io 15 

Gly Lys Pro Asp Asn Val He Leu His Thr Phe Pro Arg Cys Lys Lvs 

20 25 30 

Gin Val Gin Val Trp Leu Ser Arg Thr Gly Glu Arg Tyr Glu Asn Met 

35 40 45 

Ala Glu Phe Val Thr Tyr He Thr Gin Arg Cys Ser Asn Phe Arq Met 

50 55 60 

Cys Ser Glu His Phe Thr Asp Asp Cys Tyr He Thr Val Glu Gly Lys 
65 70 75 . so 

Arg Arg Leu Met Glu Asn Ser Ala Pro Thr He Phe Lys Thr Thr Phe 
85 90 95 

Arg Gin Asn 



<210> 51 

<211> 104 

<212> PRT 

<213> Xenopus^ laevi 

<400> 51 
Me 
1 



65 



Thr 


Lys 


Cys 


He 


Val 


Lys 


Gly 


Cys 


Arg 


His 


Thr 


Thr 


Gly 


Gin 


Lys 








' 5 










10 








15 


Lys 


Phe 


Pro 


His 


He 


Val 


Met 


His 


Ala 


Phe 


Pro 


Ser 


Asn 


Leu 


Lys 






20 










25 










30 




He 


Lys 


Val 


Trp 


Leu 


Lys 


Gin 


Thr 


Gly 


Gin 


Tyr 


Gly Asn Asn 


Leu 




35 










40 










45 








Glu 


Met 


Ala 


Leu 


Lys 


Val 


Leu 


Gly 


Gly Lys 


Lys 


Ser 


Asp 


Ser 


Tyr 


50 










55 










60 






Leu 


Cys 


Ser 


Ala 


His 


Phe 


Thr 


Val 


Asp 


Ser 


Tyr 


Ala 


Leu 


Arg 


Arg 










70 










75 








80 


Lys Asn 


Met 


Leu 


Lys 


Lys 


Asp 


Ala 


Phe 


Pro 


Thr 


Leu 


Phe 


Gly Gin 








85 










90 










95 




Gin 


He 


Asn 


Ala 


Ala 


Asn 


Val 



















100 



<210> 52 
<211> 84 
<212> PRT 

<213> Xenopus laevi 



<400> 52 

Met Pro Lys Cys He Val He His Cys Pro His Ser Cys Ser Lys Lys 
1 5 10 15 
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Val Thr Lys Asn Thr Gly Val Val Met His Thr Phe Pro Phe Asn Leu 

20 25 30 

Asp Arg He Lys Asn Trp Leu Leu Ser He Asp Gin Asn Phe Gly Asn 

35 40 45 

He Asp Thr Leu Ala Asn Arg lie Leu Glu Glu Lys Lys Lys His Ser 

50 55 60 

Asp Leu Tyr Arg Leu Cys Ser Glu His Phe Thr Pro Gin Cys Tyr He 
65 70 75 ~ 80 

Ser Thr Gly Glu 



<210> 53 
<211> 104 
<212> PRT 

<213> Xenopus laevi 



<400> 53 




























Met 


Pro 


Ser 


Cys 


He 


Val 


Lys 


Gly 


Cys 


Pro 


His 


Arg 


Thr 


Gly 


Gin 


Lys 


1 








5 










10 










15 




Asp 


Lys 


Phe 


Pro 


Asn 


Val 


Thr 


Leu 


His 


Asn 


Phe 


Pro 


Lys 


Thr 


He 


Pro 








20 










25 










30 






Lys 


He 


Lys 


Asn 


Trp 


Leu 


Trp 


Gin 


Thr 


Gly 


Gin 


Tyr 


Gly 


Glu 


Asp 


Ser 






35 










40 










45 








Asp 


Ala 


He 


Ala 


Glu 


Glu 


He 


Leu 


Gin 


Gly 


Leu 


Lys 


Thr 


Cys 


Arg 


His 




50 










55 










60 










Arg 


Met 


Cys 


Ser 


Met 


His 


Phe 


Ser 


Glu 


Asn 


Cys 


Phe 


He 


Thr 


Leu 


Gly 


65 










70 










75 










80 


Ser 


Lys 


Arg 


Val 


Leu 


Thr 


Arg 


Asn 


Ala 


Val 


Pro 


Thr 


He 


Phe 


Lys 


Pro 










85 










90 










95 




Gin 


Thr 


Thr 


Pro 


Ala 


He 


Leu 


Ala 



















100 



<210> 54 
<211> 104 
<212> PRT 

<213> Xenopus laevi 
<400> 54 



Met 


Pro 


Lys 


Cys 


He 


Leu 


Asn 


Gly 


Cys 


Pro 


Tyr 


Arg 


Thr 


Gly 


:G.ln 


Lys 


1 








5 










10 










15 




Leu 


Lys 


Phe 


Pro 
20 


Asp 


He 


Val 


Leu 


His 

25 


Pro 


Phe 


Pro 


Lys 


Ser 
30 


Met 


Glu 


Met 


He 


Arg 
35 


Asn 


Trp 


Leu 


Phe 


Gin 
40 


Thr 


Gly 


Gin 


His 


Ala 
45 


Glu 


Asp 


Val 


Glu 


Ser 
50 


Leu 


Ser 


Gin 


Arg 


He 
55 


Tyr 


Gin 


Gly 


Leu 


Lys 
60 


Thr 


Ser 


Asn 


Phe 


Arg 


Met 


Cys 


Ser 


Lys 


His 


Phe 


Thr 


Gin 


Asp 


Cys 


Tyr 


Met 


Gin 


Val 


Gly 


65 










70 










75 










80 


Ser 


Arg 


Lys 


Cys 


Leu 
85 


Lys 


Pro 


Asn 


Ala 


Val 
90 


Pro 


Thr 


Val 


Phe 


Glu 
95 


Ser 


Tyr 


Asn 


Val 


Pro 
100 


Val 


Thr 


Thr 


Phe 



















<210> 55 
<211> 105 
<212> PRT 

<213> Xenopus laevi 
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<400> 55 

Asn Asn Ala Ser Cys lie Val Arg Gly Cys His His Ser Thr Ala Arg 

5 10 15 

Lys Cys Leu Ser Pro Gly He Ala Leu His Gly Phe Pro Asn Asn Leu 

20 25 30 

Ser Arg He Lys Gin Trp Leu Val Asn He Gly Gin Asn Val Gly Asp 
35 4 0 45 

lie Asp Asp Phe Ala Gin Lys Val Leu Asp Gly Lys Lys Gin Asn Ser 

50 55 60 

Tyr Arg lie Cys Ser Ala His Phe Ser Ser Asp Cys Phe Val Gin Phe 
65 70 75 80 

Gly Tyr Ser Lys Gly Leu Lys Ala Asp Ala Val Pro Thr He Phe Ala 

85 90 95 

Trp Asn Thr Pro Glu Ser Arg Gly Arg 
100 105 

<210> 56 
<211> 107 
<212> PRT 

<213> Xenopus laevi 
<400> 56 

Met Pro Ser Cys lie Val Lys Gly Cys Arg His Lys Ser Gly Gin Lys 

5 10 15 

Val Leu Tyr Pro Asp Val Val Leu His Ser Phe Pro Asn Asn He His 

20 25 30 

Met He Lys Asn Trp Leu Leu Gin Thr Gly Gin Val Phe Gly Asp He 

35 4 0 45 

Asp Ala Phe Ala Glu Lys Val Leu Lys Gly Asn Lys Thr Ser Ala Phe 

50 55 60 

Arg Met Cys Ser Arg His Phe Thr Arg Asp Ser Tyr Met Ala Lys Gly 
65 70 75 80 

Ser Lys He Thr Leu Lys Pro Asn Ala Val Pro Thr He Phe Asn Thr 

85 90 95 

Leu Pro Pro Ala Ala Ala Val Pro Ser Leu Met 
100 105 

<210> 57 

<211> 91 

<212> PRT 

<213> Danio rerio 

<400> 57 

Met Val Gin Ser Cys Ser Ala Tyr Gly Cys Asn Asn Arg Tyr Gin Lys 

1 5 10 15 

Asp Arg He He Ser Phe His Lys Phe Pro Leu Ala Arg Pro Glu Val 

20 25 30 

Cys Val Gin Trp Val Ser Ala Met Ser Arg Arg Asn Phe Lys Pro Thr 

35 40 45 

Lys Tyr Ser Asn He Cys Ser Gin His Phe Thr Ser Asp Cys Phe Lys 

50 55 60 

Gin Glu Cys Asn Asn Arg Val Leu Lys Asp Asn Ala Val Pro Ser Leu 

70 75 80 

Phe Thr Leu Gin Thr Gin Asp Pro Phe Ser Ala 
85 90 

<210> 58 
<211> 103 



-27- 



BNSDOCID: <WO 2004055050A2_I_> 



WO 2004/055050 



PCT/IB2003/006434 



<212> PRT 

<213> Danio rerio 

<400> 58 



Met 


Pro 


Arg 


His 


Cys 


Ser 


Ala 


Val 


Gly 


Cys 


Lys 


Ser 


Arg 


Asp 


Thr 


Lys 


1 








5 










10 










15 




Asp 


Val 


Arg 


Lys 
20 


Ser 


Gly 


He 


Thr 


Phe 
25 


His 


Arg 


Leu 


Pro 


Lys 
30 


Lys 


Gly 


Asn 


Pro 


Arg 


Arg 


Thr 


Thr 


Trp 


He 


He 


Asn 


Ser Arg 


Arg 


Lys 


Gly 


Pro 






35 










40 










45 








Glu 


Gly 
50 


Lys 


Gly 


Gin 


Trp 


Asp 
55 


Pro 


Gin 


Ser 


Gly 


Phe 
60 


He 


Tyr 


Phe 


Cys 


Ser 


Lys 


His 


Phe 


Thr 


Pro 


Asp 


Ser 


Phe 


Glu 


Leu 


Ser 


Gly Val 


Ser 


Gly 


65 










70 










75 










80 


Tyr 


His 


Arg 


Leu 


Lys 


Asp Asp Ala 


He 


Pro 


Thr 


Val 


Phe 


Glu 


He 


Glu 










85 










90 










95 




Pro 


His 


Lys 


Lys 
100 


Gly 


Thr 


Ala 





















<210> 59 

<211> 90 

<212> PRT 

<213> Danio rerio 

<400> 59 



Met 


Pro 


Gly Phe 


Thr 


Cys 


Cys 


Val 


Pro 


Gly 


Cys 


Tyr 


Asn 


Asn 


Ser 


His 


1 






5 










10 










15 




Arg 


Asp 


Arg Asp 
20 


Leu 


Arg 


Phe 


Tyr 


Thr 
25 


Phe 


Pro 


Lys 


Asp 


Pro 
30 


Thr 


Gin 


Arg 


Glu 


He Trp 


Leu 


Lys 


Asn 


He 


Ser 


Arg 


Ala 


Gly Val 


Ser 


Gly 


Cys 






35 








40 










45 








Phe 


Ser 


Thr Phe 


Gin 


Pro 


Thr 


Thr 


Gly 


His 


Arg 


Val 


Cys 


Ser 


Val 


His 




50 


Gly Gly 






55 










60 










Phe 


Pro 


Arg 


Lys 


Thr 


Tyr 


Thr 


He 


Arg 


Val 


Pro 


Thr 


Leu 


Phe 


65 








70 










75 










80 


Pro 


Leu 


Arg Gly 


Val 
85 


Asn 


Glu 


Arg 


Arg 


Ser 
90 















<210> 60 

<211> 96 

<212> PRT 

<213> Danio rerio 

<400> 60 

Met Pro Asn Phe Cys 

1 5 
Ser Val Leu Ala Phe 
20 

Lys Trp Val Glu Asn 
35 

Asp His Leu Asn Lys 
50 

Asn Leu He Thr Lys 
65 

Ala Val Pro Thr He 

85 



<210> 61 
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Ala Ala Leu Asn Cys 
10 

Phe Arg Phe Pro Arg 
25 

Cys Ser Arg Ser Asp 
40 

Tyr His Arg Leu Cys 
55 

Thr Ser Pro Phe Arg 
70 

Phe Asp Asn Pro Phe 
90 



Ser Arg Asn Ser Thr His 
15 

Asp Pro Glu Arg Cys Lys 
30 

Leu Lys Asp Lys Thr Pro 
45 

Ala Arg His Phe Glu Pro 
60 

Thr Val Leu Lys Asp Ser 
75 80 
Lys Arg Ser Asn Asn Glu 
95 



WO 2004/055050 

PCT/IB2003/006434 

<211> 99 

<212> PRT 

<213> Danio rerio 

<400> 61 

Met Pro Tyr Lys Cys Val Ala Tyr G ly Cys Gly Lys H e Ser Gly Gln 



10 



Asn Val Ser Met Phe Arg Phe Pro Lys Asp Pro Glu Glu Phe Ser Lys 
Arc 

35 « 45 



25 



Trp Gin Arg Gin Val Gin Lys Thr Arg Arg Asn Trp Leu aL Asn Thr 



Tyr Ser His Leu Cys Asn Glu His Phe Thr Lys Asp Cys Phe Glu Pro 

Lys Thr Tyr Val Thr Ala Lys Ala Ser Gly Phe Lys Arg Leu Lys Leu 

75 cm 
Lys Asp Gly Ala Val Pro Thr Val Phe lie Arg Arg Arg Cys Arg lys 

° b 90 nc 

Cys Gly Gly 95 



<210> 62 

<211> 90 

<212> PRT 

<213> Danio rerio 

<400> 62 

Met Gly Gly Cys Ser Ala Pro Asn Cys Ser Asn Ser Thr Thr He Gly 

Lys Gin Leu Phe Arg Phe Pro Lys Asp Pro Val Arg Met Arg Lys Trp 

Leu Val Asn Cys Arg Arg Asp Phe Val Pro Thr Pro Cys Ser Arg Leu 

Cys Gin Asp His Phe Glu Glu Ser Gin Phe Glu Glu lie Ala Arg Ser 

55 gO 
Pro Ala Gly Gly Arg Lys Leu Lys Pro Asn Ala He Pro Thr Leu Phe 

75 on 

Asn Val Pro Asp Pro Pro Ser Pro Val Thr 
85 90 

<210> 63 

<211> 105 

<212> PRT 

<213> Danio rerio 

<4 00> 63 

Met Val Leu Asn Cys Ala Tyr Pro Gly Cys Leu Asn Leu Phe Lys Lys 



10 



Glu Arg Leu Arg Ser Asn Ser Ser Ser His Gly Gly Lys Leu Thr Phe 



20 



His Arg Phe Pro Thr Leu Glu Pro Gly Arg Leu Leu Leu Trp Arg Ala 
Ala Leu Gly Met Asp Pro Asp Tnr Pro Met Arg Ser Leu Arg Val Trp 



55 60 



Arg lie Cys Ser Glu His Phe Ser Pro Glu Asp Phe Arg Ala Val Asn 
Gly Asn Lys Val Leu Leu Lys Ala Ser Ala 111 Pro Arg Val Tyr |° r 

90 AC 

Thr Pro Ala Pro Gly Ser Arg Ala Asp " 
100 105 
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<210> 64 

<211> 99 

<212> PRT 

<213> Danio rerio 



<400> 64 



Met 


Ala 


Ser 


Ser 


Arg 


Arg 


Cys 


Tyr 


Cys 


Ser 


Val 


Pro 


Gly 


Cys 


Ser 


Asn 


1 








5 










10 










15 




Ser 


Lys 


Lys 


Arg 


His 


Pro 


Tyr 


Leu 


Ser 


Phe 


His 


Asp 


Phe 


Pro 


Lys 


Asp 








20 










25 










30 






Glu 


Gly 


Gin 


Arg 


Lys 


Ser 


Trp 


Val 


Lys 


Phe 


He 


Arg 


Arg 


Glu 


Glu 


Gly 






35 










40 










45 








Pro 


Phe 


Phe 


Gin 


lie 


Lys 


Arg 


Gly 


Ser 


Thr 


Phe 


Val 


Cys 


Ser 


Met 


His 




50 










55 










60 










Phe 


Lys 


Ala 


Asp 


Asp 


He 


Tyr 


Thr 


Thr 


He 


Ser 


Gly 


Arg 


Arg 


Lys 


He 


65 










70 










75 










80 


Asn 


Pro 


Gly Ala 


Ala 


Pro 


Arg 


Leu 


Phe 


Ser 


Trp 


Asn 


Asn 


Trp 


Ser 


Thr 










85 










90 










95 




Asp 


Lys 


Val 





























<210> 65 

<211> 66 

<212> PRT 

<213> Danio rerio 

<400> 65 

Phe Pro Lys Glu Asn Val Leu Arg Lys Gin Trp Glu He Ala Leu Lys 

1 5 10 15 

Arg Lys Gly Phe Ser Ala Ser Glu Ser Ser Val Leu Cys Ser Glu His 

20 25 30 

Phe Arg Pro Gin Asp Leu Asp Arg Thr Gly Gin Thr Val Arg Val Arg 

35 40 45 

Asp Gly Ala Lys Pro Ser Val Phe Ser Phe Pro Ala His Met Gin Lys 

50 55 60 

His Val 
65 



<210> 66 

<211> 93 

<212> PRT 

<213> Danio rerio 

<400> 66 



Ser 


Ser 


Glu 


His 


Cys 


Cys 


Val 


Pro 


Leu 


Cys 


Gly 


Ala 


Ser 


Ser 


Arg 


Phe 


1 








5 










10 










15 




Asn 


Ser 


Ala 


Val 


Ser 


Phe 


His 


Thr 


Phe 


Pro 


Val 


Ser 


Thr 


Glu 


He 


Arg 








20 










25 










30 






Glu 


Lys 


Trp 


He 


Lys 


Asn 


He 


Arg 


Arg 


Glu 


Lys 


Leu 


Asn 


He 


Thr 


Tyr 






35 










40 










45 








His 


Thr 


Arg 


Val 


Cys 


Cys 


Arg 


His 


Phe 


Thr 


Thr 


Asp 


Asp 


Leu 


He 


Gin 




50 










55 










60 










Pro 


Arg 


Asn 


Pro 


He 


Gly 


Arg 


Arg 


Leu 


Leu 


Arg 


Lys 


Gly Ala 


Val 


Pro 


65 










70 










75 










80 


Thr 


Leu 


Phe 


Lys 


Trp 


Asn 


Gly Tyr 


Ser 


Asp Ala 


Glu 


Ala 









85 90 
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<210> 67 

<211> 93 

<212> PRT 

<213> Danio rerio 

<400> 67 

Met Pro Asp Phe Cys Ala Ala Tyr Gly Cys Ser Asn Glu Arg Thr Lys 

Lys Leu Lys Asp Lys Gly He Thr Phe lis Arg Phe Pro Arg ll p V al 

Lys Arg Arg Gin Ala Trp Thr Leu Ala Leu Arg Arg Asp Lys Phe Glu 

Pro Lys Pro Arg Ser Leu Leu Cys Ser Cys His Phe Arg Pro Glu Asp 

55 60 



Phe Asp Arg Thr Gly Gin Thr Val Arg Leu Arg Asp Gly Val He Pro 
Ser He Phe Asn Phe Ser Asn Pro Leu Ser Lys Leu Ser 80 



<210> 68 

<211> 97 

<212> PRT 

<213> Danio rerio 



<400> 68 

Met Pro Val Cys Ser Ala Tyr Lys Cys Lys Lys Arg Ser Asp Arg Glu 
5 10 



Tyr Lys Glu Ala Tyr Lys Arg Gly Glu Phe Ser Phe His Lys Phe Pro 

25 30 
Leu Glu Asp Gly Leu Arg Val Arg Glu Trp Leu Arg Arg Met Arg Trp 

Gin Asn Trp Trp Pro Thr Gly Asn Ser Val Leu Cys Ser Asp His Phe 

55 60 
Glu Lys Asp Cys Phe Glu Gin Val Gly Ser His Lys Arg Leu Arg Lys 

Ser Ala Val Pro Thr He Phe Asn Phe Pro lys His Leu Gin Trp Lys 
85 90 



Val 



<210> 69 

<211> 90 

<212> PRT 

<213> Danio rerio 



95 



<400> 69 

Met Val Leu Val Cys Ser Ala Tyr Asn Cys Lys Asn Thr Leu Arg Asn 

_ 5 10 ]s 

Lys Ser Val Ser Phe His Leu Phe Pro Leu Lys Asp Pro Ser Leu Leu 

7 25 30 

Lys Lys Trp Leu Lys Asn Leu Arg Trp Lys Asp Trp Lys Pro Asn Pro 

Asn Ser Lys lie Cys Ser Ala His Phe Glu Glu Lys Jys Phe He Leu 

55 60 
Glu Gly Lys Lys Thr Arg Leu His Thr Trp Ala Val Pro Thr He Phe 

Ser Phe Pro Asn Arg Phe Ser Glu Arg Asn 80 
85 90 
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<210> 70 

<211> 107 

<212> PRT 

<213> Danio rerio 

<400> 70 

Met Asn Ser lie Ser Leu Lys Tyr Leu Arg Arg Glu Cys Ala Tyr Ser 

1 5 10 15 

Arg Tyr Cys Cys Val Pro Phe Cys Lys He Ser Ser Arg Phe Asn Ser 

20 25 30 

Val He Ser Phe His Lys Leu Pro Leu Asp Arg Ala Thr Arg Lys Met 

35 4 0 4 5 

Trp Leu His Asn He Arg Arg Lys Thr Phe Glu Val Ser Pro His Val 

50 55 60 

Arg Val Cys Ser Arg His Phe Thr Asn Asp Asp Phe He Glu Pro Ser 
65 7 0 75 80 

Tyr Pro Thr Ala Arg Arg Leu Leu Lys Lys Gly Ala Val Pro Thr Leu 

85 90 95 

Phe Arg Trp Asn Asn Asp Ser Thr Ser Gly Gin 
100 105 

<210> 71 

<211> 89 

<212> PRT 

<213> Danio rerio 

<400> 71 

Leu Arg Leu Arg Gin Ser Ala Ser Ser His Glu Glu Ser Leu Thr Phe 

15 10 15 

Tyr Ser Leu Pro Leu Gin Asp Phe Lys Arg Leu Asn Leu Trp Leu Asn 

20 25 30 

Ala Val Arg Arg Asp Thr Lys Ser Ser He Arg Asn He Arg Gly Leu 

35 40 4 5 

Arg Val Cys Ser Glu His Phe Ala Gin Asp Asp Phe Ser Leu Asn Arq 

50 55 60 

Gly Ser Lys Arg Arg Leu Lys Ser Thr Ala Val Pro Lys Cys Asn Glu 
65 70 75 80 

Ala Leu Pro Gin lie Arg Arg Ala Gly 
85 



<210> 72 

<211> 105 

<212> PRT 

<213> Danio rerio 

<400> 72 

Met Val He Thr Cys Ala Cys Pro Gly Cys Asp Asn Arg Tyr Lys Thr 

1 5 10 15 

Leu Arg Leu Arg Ser Asp Ser Lys Phe His Pro Gly Lys Leu Thr Phe 

20 25 30 

His Lys Phe Pro Thr Ser Asp Pro Glu Arg Leu Lys Leu Trp Leu Leu 

35 40 45 

Ala Leu Gly Leu Asp He Asn Thr Pro Leu Ser Val Leu Glu Thr Arg 

50 55 60 

Arg He Cys Ser Asp His Phe Ser Pro Phe Asp Phe Lys Asp Thr Lys 
65 70 75 1 80 

Gly Ser He Val Gin Leu Lys Ser Trp Ala Val Pro Met Asn Leu Ser 
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85 90 95 

Glu Gin Phe Val Asp Asp Pro Ser Lys 
100 105 

<210> 73 

<211> 96 

<212> PRT 

<213> Danio rerio 

<400> 73 

Met Pro Asp Cys Cys Ala Ala Ala Asn Cys Lys Gin Ser Thr Asp Gin 

5 10 15 

Ser Ser Val Ser Phe Phe Glu Phe Pro Leu Asp Pro Asp Arg Cys Arg 

20 25 30 

Gin Trp Val Gly Arg Cys Asn Arg Pro Asp Leu Gin Thr Lys Thr Pro 

35 40 45 

Glu Asp Leu His Lys Asn Tyr Lys Val Cys Ser Arg His Phe Glu Thr 

55 60 
Ser Met He Cys Gin Gin Ser Ala Val Lys Cys He Leu Lys Asp Asp 

70 75 
Ala Val Pro Thr Leu Phe Asn Phe Ser Thr Asn Gin Asp Asn Ala Gin 
85 90 95 

<210> 74 

<211> 91 

<212> PRT 

<213> Danio rerio 

<400> 74 

Met Val Lys Cys Thr Val Gin Gly Cys He Asn Phe Ser Asp Leu Arg 

5 10 15 

Pro Glu Glu Gin Pro Asn Arg Pro Arg Lys Arg Phe Phe Arg Phe Pro 

20 25 30 

Lys Asp Lys Val Leu Val Lys Val Trp Leu Ala Ala Leu Arg Asp Thr 

35 40 45 

Glu Arg Glu He Thr Asp Leu His Arg He Cys Glu Asp His Phe Leu 

55 60 
Ser His His He Thr Ala Asp Gly He Ser Pro Asp Ala lie Pro He 

70 7 5 80 

Met Pro Pro Leu Asp Gly Pro Val Gly Asn Trp 
85 90 

<210> 75 

<211> 84 

<212> PRT 

<213> Danio rerio 

<400> 75 

Met Pro He Ser Cys Ser Ala Val Asp Cys Ser Asn Arg Phe Val Lys 

5 10 15 

Gly Ser Glu lie Arg Phe Tyr Arg Phe Pro He Ser Lys Pro Gin Leu 

20 25 30 

Ala Glu Gin Trp Val Arg Ser Leu Gly Arg Lys Asn Phe Val Pro Thr 

35 40 45 

Gin Asn Ser Cys Leu Cys Ser Glu His Phe Gin Pro Asp Cys Phe Arg 

Asp Tyr Asn Gly Lys Leu Phe Leu Arg Glu Asp Ala Val Pro Thr He 
65 70 75 80 
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<210> 76 
<211> 95 
<212> PRT 

<213> Oryzias latipes 
<400> 76 

Met Pro Asn Phe Cys Ala Ala Pro Asn Cys Thr Arg Lys Ser Thr Gin 

15 10 "* 15 

Ser Asp Leu Ala Phe Phe Arg Phe Pro Arg Asp Pro Glu Arg Cys Arg 

20 25 30 

lie Trp Val Glu Asn Cys Arg Arg Ala Asp Leu Glu Ala Lys Thr Ala 

35 40 45 

Asp Gin Leu Asn Lys His Tyr Arg Leu Cys Ala Lys His Phe Asp Pro 

50 55 60 

Ala Met Val Cys Lys Thr Ser Pro Tyr Arg Thr Val Leu Lys Asp Thr 
65 70 75 80 

Ala lie Pro Thr lie Phe Asp Leu Thr Ser His Leu Lys Asn Pro 
85 90 " 95 



<210> 77 
<211> 90 
<212> PRT 

<213> Oryzias latipes 
<400> 77 

Met Pro Thr Gly Cys Ala His Ala Asn Cys Arg Asn Val Val Gly Lys 

15 10 15 

Phe Arg Gly Val Thr Phe His Lys Phe Pro Arg Asp Pro Glu Lys Leu 

20 25 30 

Ser Arg Trp Thr Lys Phe Met Lys Arg His Glu Ser Trp Val Pro Lys 

35 4 0 4 5 

Tyr Tyr Asp Arg Val Cys Ser Val His Phe Ser Ser Glu His Phe Asp 

50 55 60 

Arg Thr Gly Gin Thr Val Arg Leu Arg Asp Asn Ala Glu Pro Ser Leu 
65 70 75 80 

Pro His .Leu Pro Trp Arg Phe Pro Lys Ser 
85 90 



<210> 78 
<211> 94 
<212> PRT 

<213> Oryzias latipes 
<400> 78 

Met Gin Asn Arg Cys Ala Val Leu Thr Cys Pro Ser Gly Lys Thr Asp 

15 10 ~ " 15 

Phe Gin Pro Met Phe Arg Phe Pro His Asp Gin Glu Arg Ser Arg Arg 

20 25 30 

Trp Val Glu Lys Cys Gin Gly- Glu Asn Leu lie Gly Lys Ser Pro Glu 

35 40 45 

Gin Leu Tyr Arg Tyr Tyr Arg lie Cys Lys Arg His Phe Glu Thr Ser 

50 55 60 

Ala Phe Asp Cys Asp Ala Asp Gly Ala Val Leu Lys Lys Asp Ala Val 
65 70 75 80 

Pro Thr lie Phe Asp Ala Ser Val Pro Pro Gin Ser Ser Gin 
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85 90 

<210> 79 
<211> 92 
<212> PRT 

<213> Drosophila melanogaster 
<400> 79 

Met Pro Ala His Cys Ala Val lie Asn Cys Ser His Lys Tyr Val His 

5 10 15 

Ala Gly Ser He Ser Phe His Arg Phe Pro Phe Lys Arg Lys Asp Leu 

20 25 30 

Leu Gin Lys Trp Lys Glu Phe Thr Gin Arg Ser Ala Gin Trp Met Pro 

35 40 45 

Ser Lys Trp Ser Ala Leu Cys Ser Arg His Phe Gly Asp Glu Asp Phe 

bU 55 60 

Asn Cys Ser Asn Asn Arg Lys Thr Leu Lys Lys Asn Ala Val Pro Ser 
o5 70 -75 

He Arg Val Ser Glu Asp Asp Ser Met Ser Gly His ^ 
85 90 

<210> 80 
<211> 90 
<212> PRT 

<213> Drosophila melanogaster 
<400> 80 

Met Pro Thr He Arg Arg Cys Cys He lie Gly Cys Leu Ser Asn Ser 

5 10 15 

Arg Gin His Pro Ser Met Gin Phe Phe Ala Phe Pro Arg Pro Glu Asn 

20 25 30 

Pro Phe His Lys Leu Trp Lys Glu Ala Cys His Ala Ser Leu Arg Ara 

35 40 45 

He Val Pro Phe Lys Lys Pro Val Val Cys Ala Leu His Phe Asp Pro 
e 50 55 60 

Ser Val Leu Gly Gly Arg Arg Leu Gin Ser Asn Ala Leu Pro Thr Leu 

f 70 7 $ 80 

Arg Leu Glu Val Pro Ser Asn Leu Glu Ala 

85 90 

<210> 81 
<211> 104 
<212> PRT 

<213> Drosophila melanogaster 
<400> 81 

Met Arg Cys Ala Val Pro Asn Cys Arg Asn Phe Ser Asp Cys Arg Ser 

5 10 15 

Lys Arg Asn Ala Ala Gin Gin Gin Arg Leu Gly Phe Phe Arg Phe Pro 

20 25 30 

Lys Cys Pro Asp Thr Phe Lys Ala Trp Leu Ala Phe Cys Gly Tyr Thr 

35 40 45 

Glu Glu Ser Leu Lys Leu Lys Asn Pro Cys lie Cys He Glu His Phe 

50 55 60 

Lys Asp Glu Asp He Glu Gly Ser Leu Lys Phe Glu Met Gly Leu Ala 

70 75 
Lys Lys Arg Thr Leu Arg Pro Gly Ala Val Pro Cys Val Asn Lys Ser 
85 90 — 



95 
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Gin Glu Ser Gly Ser Asp Arg Ala 
100 



<210> 82 
<211> 96 
<212> PRT 

<213> Drosophila melanogaster 



<400> 82 



Met 


Gly Gly 


Thr 


Lys 


Cys 


Cys 


Phe 


Arg 


Asp 


Cys 


Pro 


Val 


Gly 


Ser 


Ser 


1 








5 










10 










15 




Arg 


Asn 


Pro 


Asn 


Met 


His 


Phe 


Phe 


Lys 


Phe 


Pro 


Val 


Lys 


Asp 


Pro 


Lys 








20 










25 










30 






Arg 


Leu 


Lys 


Asp 


Trp 


Val 


Arg 


Asn 


Cys 


Ser 


Asn 


Pro 


Asp 


Val 


Ser 


Asn 






35 










40 










45 








Ala 


Pro 


Pro 


Ser 


Lys 


Leu 


Ala 


Ala 


Lys 


Thr 


Val 


Cys 


Ala 


Arg 


His 


Phe 




50 










55 










60 










Arg 


Ala 


Glu 


Cys 


Phe 


Met 


Asn 


Tyr 


Lys 


Met 


Asp 


Arg 


Leu 


He 


Pro 


Met 


65 










70 










75 










80 


Gin 


Thr 


Pro 


Thr 


Leu 


Phe 


Arg 


He 


Asn 


Arg 


Asp 


Leu 


Ala 


Leu 


Asp 


Tyr 



85 90 95 



<210> 83 
<211> 96 
<212> PRT 

<213> Drosophila melanogaster 



<400> 83 



Met 


Ala 


Thr 


Arg 


Ser 


Cys 


Ala 


Tyr 


Lys 


Asp 


Cys 


Glu 


Tyr 


Tyr 


Tyr 


Val 


1 








5 










10 










15 




Gly 


His 


Glu 


Asn 


Ala 


Leu 


Thr 


Lys 


Gly 


Arg 


Thr 


Leu 


Phe 


Ala 


Phe 


Pro 








20 










25 










30 






Lys 


Gin 


Pro 


Gin 


Arg 


Ala 


Arg 


He 


Trp 


His 


Glu 


Asn 


Gly 


Gin 


Val 


His 






35 










40 










45 








Pro 


Lys 


He 


Pro 


His 


Ser 


Gin 


Leu 


Phe 


Met 


Cys 


Ser 


Leu 


His 


Phe 


Asp 




50 










55 










60 










Arg 


Lys 


Phe 


He 


Ser 


Ser 


Ser 


Lys 


Asn 


Arg 


Thr 


Leu 


Leu 


Val 


Gly 


Glu 


65 










70 










75 










80 


Ala 


Val 


Pro 


Phe 


Pro 


Tyr 


Glu 


Glu 


Ser 


Ser 


Ser 


Lys 


Pro 


Glu 


Glu 


Glu 



85 90 95 



<210> 84 
<211> 87 
<212> PRT 

<213> Drosophila melanogaster 
<400> 84 

Met Lys Tyr Cys Lys Phe Cys Cys 

1 5 
He His Val Pro Lys Cys Ala He 
20 

Leu Gly Cys Ser Leu Gly Glu Asn 

35 40 
Asn Asp Ser Gin Trp Lys Ala Ala 

50 55 
Arg Arg Arg Leu Asn Ala Asp Ala 
65 70 
Glu Pro Glu Lys He Lys Glu 



Lys Ala Val Thr Gly Val Lys Leu 

10 15 
Lys Arg Lys Leu Trp Glu Gin Ser 
25 30 
Ser Gin lie Cys Asp Thr His Phe 
45 

Pro Ala Lys Gly Gin Thr Phe Lys 
60 

Val Pro Ser Lys Val He Glu Pro 
75 80 
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85 

<210> 85 
<211> 92 
<212> PRT 

<213> Anopheles gambiae 
<400> 85 

Met Pro Ala Ser Cys Val lie Pro Asp Cys Asp Leu Lys Tyr Thr His 

5 10 15 

Gly Asp Asp Val Ser Phe His Lys Phe Pro Leu Lys Ser Pro Glu Leu 

20 25 30 

Leu Lys Gin Trp He Gin Phe Thr Gly Arg Asp Glu Gly Trp His Pro 

35 4 0 4$ 

Thr Lys Trp Ser Ala Leu Cys Ser Arg His Phe Val Ala Ser Asp Phe 

50 55 60 

Lys Gly Cys Ala Ala Arg Lys He Leu Leu Pro Thr Ala Val Pro Ser 

70 75 80 

Val Arg Asn Ala Val Ala Ala Lys Ala Gin Pro Asn 
85 90 

<210> 86 
<211> 108 
<212> PRT 

<213> Anopheles gambiae 
<400> 86 

Met Ser Ala Val Arg Ser Cys Ala Leu Cys Gin Asn Arg Ser Asn He 

5 10 15 

Thr Asp Gin Gin Thr Asp Asp Ala Leu Glu Arg lie Thr Tyr His Lys 

20 25 30 

Phe Pro Thr Asn Pro Val Arg Arg Asp Arg Trp He Glu Phe Cys Asp 

35 ^0 45 

Leu Pro Lys Glu Ser Phe Pro Lys Ser Ala Tyr Lys Phe Leu Cys Ser 

50 55 60 

Ser His Phe Thr Pro Glu Cys Phe Glu Arg Asp Leu Arg Gly Glu Leu 

70 75 80 

Leu Tyr Gly Thr Lys Arg Met Thr Leu Gin Lys Asp Ala Met Pro Thr 

85 90 95 

He Arg Ser Val Ser Gin Gin Leu Lys Arg Thr Thr 
100 105 

<210> 87 
<211> 100 
<212> PRT 

<213> Anopheles gambiae 
<400> 87 

Met Trp Asp Cys Ala Val He Gly Cys Pro Asn Ser Arg Phe Asn Ala 

5 10 15 

Gin Lys Thr Arg Pro Arg He Ser Phe His Val Phe Pro His Pro Val 

20 25 30 

Arg Glu Ser Asn Arg Phe Arg Arg Trp Leu Ala Leu He Asn Asn Pro 

35 40 45 

Arg Leu Phe Arg Leu Asp Pro Leu Asn Val Phe Lys Ser Val Arg Val 

50 55 60 

Cys Arg Arg His Phe Gly Pro Asp Cys Phe Asn Gly Val Cys Arg Asn 
65 70 75 bo 
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Leu Leu Pro Thr Ala He Pro Thr Leu Asn Leu Pro Glu Val Arg Pro 
85 90 95 

Val Ala Leu Val 
100 



<210> 88 

<211> 95 

<212> PRT 

<213> Anopheles 



gambiae 



<400> 88 




























Met 


Gly 


He 


Arg 


Lys 


Cys 


He 


Val 


Pro 


Glu 


Cys 


Pro 


Ser 


Ser 


Ser 


Ala 


1 








5 










10 










15 




Arg 


Pro 


Glu 


Asp 


Arg 


Gly 


Val 


Thr 


Tyr 


His 


Lys 


He 


Pro 


Tyr 


Leu 


Asp 








20 










25 










30 






Glu 


Met 


Lys 


Arg 


Leu 


Trp 


He 


Val 


Ala 


Cys 


His 


Leu 


Pro 


Asp 


Asp 


Tyr 






35 










40 










45 








Phe 


Ala 


Thr 


Lys 


Ala 


Ser 


Asn 


Val 


Cys 


Ser 


Arg 


His 


Phe 


Arg 


Arg 


Ala 




50 










55 










60 










Asp 


Phe 


Gin 


Glu 


Phe 


Lys 


Gly 


Lys 


Lys 


Tyr 


Val 


Leu 


Lys 


Leu 


Gly 


Val 


65 










70 










75 










80 


Val 


Pro 


Thr 


Val 


Phe 


Pro 


Trp 


Thr 


Val 


Thr 


Lys 


Pro 


Pro 


Gly 


Glu 





85 90 95 



<210> 89 
<211> 107 
<212> PRT 

<213> Anopheles gambiae 
<400> 89 



Met 


Gly Lys 


He 


Ser 


Gly 


Ser 


His 


Cys 


Leu 


Val 


Leu 


Gly 


Cys 


Arg 


Asn 


1 






5 










10 










15 




Arg 


Gin Leu 


Leu 
20 


Asn 


Gin 


Ala 


Asn 


He 
25 


Arg 


Ser 


Tyr 


Phe 


Arg 
30 


Phe 


Pro 


Arg 


Asp Ala 


Asp 


Leu 


Cys 


Lys 


Lys 


Trp Val 


Asp 


Phe 


Cys 


Asn 


Arg 


Pro 




35 










40 










45 








Glu 


Leu Tyr 
50 


Lys 


Lys 


Tyr 


Asp 
55 


Glu 


Asn 


Gly 


Pro 


Glu 
60 


Tyr 


Leu 


Tyr 


Lys 


Ser 


Ser Arg 


He 


Cys 


Ser 


Asp 


His 


Phe 


Gin 


Pro 


Ala 


Asp 


Phe 


Asn 


Asn 


65 








70 










75 










80 


Pro 


Asn Leu 


Phe 


Ser 
85 


Gin 


Gly 


Leu 


Lys 


Lys 
90 


Gly 


Ser 


Val 


Pro 


Ser 
95 


Val 


Asn 


Pro Ala 


Asn 
100 


Leu 


Glu 


Ala 


Ala 


Lys 
105 


Pro 


His 













<210> 90 
<211> 104 
<212> PRT 

<213> Anopheles gambiae 



<400> 90 












Met 


Thr 


Asn 


Cys 


Ser 


Cys 


Ala 


Val 


1 








5 








Asn 


Val 


Arg 


Lys 
20 


Arg 


Met 


Leu 


Asp 


Asp 


Pro 


Val 
35 


Gin 


Arg 


Gin 


Arg 


Trp 
40 


Ser 


Trp 


Gin 


Pro 


Lys 


Ser 


Cys 


Asp 



Ala Asp Cys Asn Asn Asn Arg Arg 

10 15 

He Gly Phe His Thr Phe Pro Ser 
25 30 

Val Lys Phe Cys Gin Arg Glu Pro 
45 

Ser Met Cys Ser Val His Phe Lys 
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50 55 60 

Asp Thr Asp Tyr Gin Met Ser His Ser Pro Leu He Arg Leu Ala Thr 
« 7 0 75 80 

Asn Leu Arg Arg Leu Lys Pro Asp Val He Pro Thr He Arg Lys Gly 

85 90 95 

Arg Ala He Pro Val Ala Ala Arg 
100 



<210> 91 
<211> 95 
<212> PRT 

<213> Anopheles gambiae 
<400> 91 

Met Gly Gly Cys Arg Cys Thr Phe Arg Asp Cys Glu Asn Gly Thr Ala 

15 10 15 

Ser Arg Lys Glu Leu His Tyr Phe Arg Tyr Pro Val Arg Asp Gin Glu 

20 25 30 

Arg Leu lie Glu Trp Ala Lys Asn Ala Asp Arg Leu Glu Phe Val Asp 

35 40 45 

Leu Pro Val Asp Lys Val Ser Asn Lys Val Val Cys Gin Glu His Phe 

50 55 60 

Glu Arg Lys Met Phe Met Asn Asp Leu Arg Asp Arg L<,_>hr Lys Met 
65 ™ 75 80 

Ala lie Pro Arg Leu Met Val Met Pro Asp Glu Thr He Val Asn 
85 90 95 



<210> 92 
<211> 97 
<212> PRT 

<213> Anopheles gambiae 
<400> 92 

Met Lys Cys Phe Val Ser Gly Cys Asp Thr Asp Asp Asn Val Val Ser 

1 5 10 15 

Tyr Thr Ser Val Phe Tyr Val Asn Cys Pro Thr Asp Pro Thr He Gin 

20 2.S 30 

Gin Gin Trp Phe Thr Leu Leu Glu Val Thr Asp Pro Asp Ala Met Arg 

35 40 45 

Ala Leu Val Asp Gly Arg Ser Lys Val Cys Ser Cys His Phe Thr Glu 

50 55 60 

Asp Cys Phe Gly His His Pro Val Tyr Gly Tyr Arg Tyr Leu Leu Ala 
65 ™ 75 80 

Thr Ala Leu Pro Thr Val Phe Pro Pro Arg Lys Glu He Glu Gin Pro 
85 90 95 

Lys 



<210> 93 

<211> 92 

<212> PRT 

<213> Bombyx mori 

<400> 93 

Met Pro Arg Cys Ser Val lie Val 

1 5 
Asn Tyr Lys Lys Asp Ser lie Ser 
20 



Cys Lys Asn Asn Ser Cys lie Val 

10 15 
Phe His Thr Tyr Pro Lys Asp Pro 
25 ' 30 
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Lys He Lys Glu Met Trp lie Asn Ala Thr Gly Arg Gly Pro Ser Trp 

35 40 4 5 

Phe Pro Thr Lys Asn His Thr He Cys Ser Ser His Phe Glu Pro Lys 

50 55 60 

Cys Phe Gin Pro Leu Lys Lys Val Arg Arg Leu Phe Glu Trp Ser Val 
65 70 75 80 

Pro Thr Leu Lys Leu Arg Met Val Leu Met Asn Tyr 
85 90 



<210> 94 

<211> 96 

<212> PRT 

<213> Bombyx mori 

<400> 94 

Met Pro Asp Thr His Arg Thr Cys Glu Val Cys Gly He Lys Glu Arg 

1 5 10 15 

His Leu Thr Glu Lys Arg Phe Phe Ala Arg Phe Pro Leu Asp Val Asn 

20 25 30 

Arg Cys Lys Gin Trp Val Lys Met Val Gly Lys Glu Asp Leu Ala Tyr 

35 40 45 

Leu Gin Val His Met Leu His Asp Leu Lys His Val Cys Glu Ala His 

50 55 60 

Phe Ser Arg Arg Asp Phe Thr Lys Ser Lys Lys Arg Leu Lys Lys Arg 
65 70 75 80 

Ala Val Pro Lys Leu Asn Leu Thr Leu Pro Pro Leu Arg Asp Glu He 
85 90 95 



<210> 95 
<211> 89 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 95 

Met Pro Thr Thr Cys Gly Phe Pro Asn Cys Lys Phe Arg Ser Arg Tyr 

1 5 10 15 

Arg Gly Leu Glu Asp Asn Arg His Phe Tyr Arg lie Pro Lys Arg Pro 

20 25 30 

Leu He Leu Arg Gin Arg Trp Leu Thr Ala He Gly Arg Thr Glu Glu 

35 40 " 45 

Thr Val Val Ser Gin Leu Arg He Cys Ser Ala His Phe Glu Gly Gly 

50 55 60 

Glu Lys Lys Glu Gly Asp He Pro Val Pro Asp Pro Thr Val Asp Lys 
65 70 75 80 

Gin He Lys lie Glu Leu Pro Pro Lys 
85 



<210> 96 
<211> 100 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 96 

Met Tyr Gly Val Gin Ser Glu Cys Val Leu Cys Ala His Ala Asn Asp 

15 10 15 

Cys Thr Ala Met He Pro Phe Pro Gly Pro Asp Asp Glu Lys Leu Arg 

20 25 30 

Thr Lys Trp lie Asn Ser Met Cys Arg Glu Pro Trp lie Tyr Arg Tyr 
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35 40 45 

Leu Ser Thr Arg Leu Glu Lys Pro Gly Arg His Tyr Leu Cys Ala Ser 

50 55 60 

His Phe Asn Arg Asn Ser Leu Arg Tyr His Ala Gly Leu Gly Leu Trn 
65 70 75 80 

Arg Arg Ala Ala Ala Cys Pro Val Leu Ala Cys Thr Thr Asp Glu Glu 
85 90 95 

Arg Gin Glu Val 
100 

<210> 97 
<211> 86 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 97 

Met Glu His Pro Leu Gin Cys Cys Tyr Cys Leu Glu Val Tyr Glu Lys 

1 5 10 15 

Arg Tyr Met Thr Gin Val Pro Lys Thr Glu Gin Arg He Ala Arg Trp 

20 25 30 

Val Ala He Leu Gly Glu Gin Phe Arg He Arg Leu Arg Met Lys Pro 

35 40 45 

Ala Asn Tyr Met Cys Arg Lys His Phe Pro Gin Ala Asp Phe Ser Ser 
50 55 60 

Arg Gly Arg Leu Leu Lys Thr Ala Val Pro Asn Val Val Ser Gin Glu 

65 70 75 so 

Lys Val Leu Ala Phe Lys 

85 

<210> 98 
<211> 97 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 98 

Asn Leu Thr His Lys Pro Cys Thr Val Cys Asn Arg Val Met Lys Ser 

1 5 10 15 

Gly Glu Met His Leu Asn Phe Pro Ala Asp Leu Asp Arg Arg Arg He 

20 25 30 

Trp Ala Asn Leu Leu Gly Phe Lys Tyr Lys Asp He Leu Arg Ser Lys 

35 4 0 45 

Met Gly Pro Val Ser Phe Ser He Ala Ala Gly Pro He Cys Thr Glu 

50 55 60 

His Phe Ala Glu Glu Cys Phe Arg Asn His Asn Phe Asn Lys Ser Ala 

70 75 80 

He Glu Ala Phe Gly Val Pro Val Ala He Ser Pro Asp Val Lys Thr 

85 90 95 

Thr yD 



<210> 99 
<211> 210 
<212> PRT 

<213> Mus musculus 
<400> 99 

Met Val Gin Ser Cys Ser Ala Tyr Gly Cys Lys Asn Arg Tyr Asp Lys 
1 5 io 1 5 
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Asp Lys Pro Val Ser Phe His Lys Phe Pro Leu Thr Arg Pro Ser Leu 

20 25 30 

Cys Lys Gin Trp Glu Ala Ala Val Lys Arg Lys Asn Phe Lys Pro Thr 

35 40 4 5 

Lys Tyr Ser Ser lie Cys Ser Glu His Phe Thr Pro Asp Cys Phe Lys 

50 55 60 

Arg Glu Cys Asn Asn Lys Leu Leu Lys Glu Asn Ala Val Pro Thr He 
65 7 0 75 80 

Phe Leu Tyr He Glu Pro His Glu Lys Lys Glu Asp Leu Glu Ser Gin 

85 90 95 

Glu Gin Leu Pro Ser Pro Ser Pro Pro Ala Ser Gin Val Asp Ala Ala 

100 105 110 

He Gly Leu Leu Met Pro Pro Leu Gin Thr Pro Asp Asn Leu Ser Val 

115 120 ' 125 

Phe Cys Asp His Asn Tyr Thr Val Glu Asp Thr Met His Gin Arg Lys 

130 135 140 

Arg He Leu Gin Leu Glu Gin Gin Val Glu Lys Leu Arg Lys Lys Leu 
145 150 155 160 



Lys Thr 


Ala 


Gin 


Gin 


Arg 


Cys 


Arg 


Arg 


Gin 


Glu 


Arg 


Gin 


Leu 


Glu 


Lys 








165 










170 










175 




Leu Lys 


Glu 


Val 


Val 


His 


Phe 


Gin 


Arg 


Glu 


Lys 


Asp 


Asp 


Ala 


Ser 


Glu 






180 










185 










190 






Arg Gly 


Tyr 


Val 


He 


Leu 


Pro 


Asn 


Asp 


Tyr 


Phe 


Glu 


He 


Val 


Glu 


Val 




195 










200 










205 








Pro Ala 






























210 






























<210> 100 




























<211> 217 




























<212> PRT 




























<213> Mus musculus 
























<400> 100 




























Met Pro 


Thr 


Asn 


Cys 


Ala 


Ala 


Ala 


Gly 


Cys 


Ala 


Ala 


Thr 


Tyr 


Asn 


Lys 


1 






5 










10 










1 D 




His He 


Asn 


He 


Ser 


Phe 


His 


Arg 


Phe 


Pro 


Leu 


Asp 


Pro 


Lys 


Arg 


Arg 






20 










25 










30 






Lys Glu 


Trp 


Val 


Arg 


Leu 


Val 


Arg 


Arg 


Lys 


Asn 


Phe 


Val 


Pro 


Gly 


Lys 




35 










40 










45 








His Thr 


Phe 


Leu 


Cys 


Ser 


Lys 


His 


Phe 


Glu 


Ala 


Ser 


Cys 


Phe 


Asp 


Leu 


50 










55 










60 










Thr Gly 


Gin 


Thr 


Arg 


Arg 


Leu 


Lys 


Met 


Asp Ala 


Val 


Pro 


Thr 


He 


Phe 


65 








70 










75 










80 . 


Asp Phe 


Cys 


Thr 


His 


He 


Lys 


Ser 


Leu 


Lys 


Leu 


Lys 


Ser 


Arg 


Asn 


Leu 








85 










90 










95 




Leu Lys 


Thr 


Asn 


Asn 


Ser 


Phe 


Pro 


Pro 


Thr 


Gly 


Pro 


Cys 


Asn 


Leu 


Lys 






100 










105 










110 






Leu Asn 


Gly 


Ser 


Gin 


Gin 


Val 


Leu 


Leu 


Glu 


His 


Ser 


Tyr 


Ala 


Phe 


Arg 




115 










120 










125 








Asn Pro 


Met 


Glu 


Ala 


Lys 


Lys 


Arg 


lie 


He 


Lys 


Leu 


Glu 


Lys 


Glu 


He 


130 










135 










140 










Ala Ser 


Leu 


Arg Lys 


Lys 


Met 


Lys 


Thr 


Cys 


Leu 


Gin 


Arg 


Glu 


Arg 


Arg 


145 








150 










155 










160 


Ala Thr 


Arg 


Arg 


Trp 


He 


Lys 


Ala 


Thr 


Cys 


Phe 


Val 


Lys 


Ser 


Leu 


Glu 








165 










170 










175 




Ala Ser 


Asn 


Met 


Leu 


Pro 


Lys 


Gly 


He 


Ser 


Glu 


Gin 


He 


Leu 


Pro 


Thr 






180 










185 










190 






Ala Leu 


Ser 


Asn 


Leu 


Pro 


Leu 


Glu 


Asp 


Leu 


Lys 


Ser 


Leu 


Glu 


Gin 


Asp 




195 










200 










205 








Gin Gin 


Asp 


Lys 


Thr 


Val 


Pro 


He 


Leu 
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210 215 

<210> 101 
<211> 218 
<212> PRT 

<213> Mus musculus 
<400> 101 

Met Pro Lys Ser Cys Ala Ala Arg Gin Cys Cys Asn Arg Tyr Ser Ser 

5 10 IS 

Arg Arg Lys Gin Leu Thr Phe His Arg Phe Pro Phe Ser Arg Pro Glu 

20 25 30 

Leu Leu Arg Glu Trp Val Leu Asn He Gly Arg Ala Asp Phe Lys Pro 

Lys Gin His Thr Val He Cys Ser Glu His Phe Arg Pro Glu Cys Phe 

50 55 60 

Ser Ala Phe Gly Asn Arg Lys Asn Leu Lys His Asn Ala Val Pro Thr 

70 75 
Val Phe Ala Phe Gin Asn Pro Thr Glu Val Cys Pro Glu Val Gly Ala 

85 90 95 

Gly Gly Asp Ser Ser Gly Arg Asn Met Asp Thr Thr Leu Glu Glu Leu 

100 105 U0 

Gin Pro Pro Thr Pro Glu Gly Pro Val Gin Gin Val Leu Pro Asp Arg 

115 120 12 5 

Glu Ala Met Glu Ala Thr Glu Ala Ala Gly Leu Pro Ala Ser Pro Leu 

135 140 
Gly Leu Lys Arg Pro Leu Pro Gly Gin Pro Ser Asp His Ser Tyr Ala 

150 155 
Leu Ser Asp Leu Asp Thr Leu Lys Lys Lys Leu Phe Leu Thr Leu Lys 

165 17 0 175 

Glu Asn Lys Arg Leu Arg Lys Arg Leu Lys Ala Gin Arg Leu Leu Leu 

180 185 190 

Arg Arg Thr Cys Gly Arg Leu Arg Ala Tyr Arg Glu Gly Gin Pro Gly 

195 200 205 

Pro Arg Ala Arg Arg Pro Ala Gin Gly Ser 
210 215 

<210> 102 
<211> 205 
<212> PRT 

<213> Mus musculus 
<400> 102 

Met Val lie Cys Cys Ala Ala Val Asn Cys Ser Asn Arg Gin Gly Lys 

Gly Glu Lys Arg Ala Val Ser Phe His Arg Phe Pro Leu Lys lip Ser 

20 25 30 

Lys Arg Leu He Gin Trp Leu Lys Ala Val Gin Arg Asp Asn Trp Thr 

35 40 45 

Pro Thr Lys Tyr Ser Phe Leu Cys Ser Glu His Phe Thr Lys Asp Ser 

Phe Ser Lys Arg Leu Glu Asp Gin His Arg Leu Leu Lys Pro Thr Ala 

Val Pro Ser He Phe His Leu Ser Glu Lys Lys Arg Gly Ala Gly Gly 

85 90 95 

Hxs Gly His Ala Arg Arg Lys Thr Thr Ala Ala Met Arg Gly His Thr 

100 105 110 

Ser Ala Glu Thr Gly Lys Gly Thr He Gly Ser Ser Leu Ser Ser Ser 
115 120 125 
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Asp Asn Leu Met Ala Lys Pro Glu Ser Arg Lys Leu Lys Arg Ala Ser 

130 135 140 

Leu Gin Asp Asp Ala Ala Pro Lys Val Thr Pro Gly Ala Val Ser Gin 
145 150 155 160 

Glu Gin Gly Gin Ser Leu Glu Lys Thr Pro Gly Asp Asp Pro Ala Ala 

165 170 175 

Pro Leu Ala Arg Gly Gin Glu Glu Ala Gin Ala Ser Ala Thr Glu Ala 

180 185 190 

Asp His Gin Lys Ala Ser Ser Ser Thr Asp Ala Glu Gly 
195 200 205 



<210> 103 
<211> 186 
<212> PRT 

<213> Mus musculus 
<400> 103 

lie Leu Gin Ala Phe Gly Ser Leu Lys Lys Gly Asp Val Leu Cys Ser 

1 5 10 15 

Arg His Phe Lys Lys Thr Asp Phe Asp Arg Ser Thr Leu Asn Thr Lys 

20 25 30 

Leu Lys Ala Gly Ala lie Pro Ser lie Phe Glu Cys Pro Tyr His Leu 

35 40 45 

Gin Glu Lys Arg Glu Lys Leu His Cys Arg Lys Asn Phe Leu Leu Lys 

50 55 60 

Thr Leu Pro lie Thr His His Gly Arg Gin Leu Val Gly Ala Ser Cys 
65 70 75 80 

lie Glu Glu Phe Glu Pro Gin Phe lie Phe Glu His Ser Tyr Ser Val 

85 90 95 

Met Asp Ser Pro Lys Lys Leu Lys His Lys Leu Asp Arg Val lie lie 

100 105 110 

Glu Leu Glu Asn Thr Lys Glu Ser Leu Arg Asn Val Leu Ala Arg Glu 

115 120 125 

Lys His Phe Gin Lys Ser Leu Arg Lys Thr lie Met Glu Leu Lys Asp 

130 135 140 

Glu Ser Leu lie Ser Gin Glu Thr Ala Asn Ser Leu Gly Ala Phe Cys 
145 150 155 160 

Trp Glu Cys Tyr His Glu Ser Thr Ala Gly Gly Cys Ser Cys Glu Val 

165 170 175 

lie Ser Tyr Met Leu His Leu Gin Leu Thr 
180 185 



<210> 104 
<211> 194 
<212> PRT 

<213> Mus musculus 
<400> 104 

Met Pro Arg His Cys Ser Ala Ala Gly Cys Cys Thr Arg Asp Thr Arg 

1 5 10 15 

Glu Thr Arg Asn Arg Gly lie Ser Phe His Arg Leu Pro Lys Lys Asp 

20 25 " 30 

Asn Pro Arg Arg Gly Leu Trp Leu Ala Asn Cys Gin Arg Leu Asp Pro 

35 40 45 

Ser Gly Gin Gly Leu Trp Asp Pro Thr Ser Glu Tyr lie Tyr Phe Cys 

50 55 60 

Ser Lys His Phe Glu Glu Asn Cys Phe Glu Leu Val Gly He Ser Gly 
65 70 75 80 

Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr He Phe Glu Ser Phe 
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c 85 90 95 

Ser Lys Leu Arg Arg Thr Ala Lys Thr Lys Gly His Gly Tyr Pro- Pro 

100 105 no 

Gly Leu Pro Asp Val Ser Arg Leu Arg Arg Cys Arg Lys Arg Cys Ser 

115 120 125 

Glu Arg Gin Gly Pro Thr Thr Pro Phe Ser Pro Pro Pro Arg Ala Asp 

lie He Cys Phe Pro Val Glu Glu Ala Ser Ala Pro Ala Thr Leu Pro 

Ala Ser Pro Ala Val Arg Leu Asp Pro Gly Leu Asn Ser Pro Phe Ser 

165 1? 0 175 

Asp Leu Leu Gly Pro Leu Gly Ala Gin Ala Asp Glu Ala Gly Cys Ser 

180 185 190 

Thr Gin iyu 



<210> 105 
<211> 305 
<212> PRT 
<213> Mus musculus 

<400> 105 

Met Pro Gly Phe Thr Cys Cys Val Pro Gly Cys Tyr Asn Asn Ser His 

Arg Asp Lys Ala Leu His Phe Tyr Thr Phe Pro Lys Asp Ala Glu Leu 

20 25 30 

Arg Arg Leu Trp Leu Lys Asn Val Ser Arg Ala Gly- Val Ser Gly Cys 

Phe Ser Thr Phe Gin Pro Thr Thr Gly His Arg Leu ^ys Ser Val His 
3U 55 60 



r 



Phe Gin Gly Gly Arg Lys Thr Tyr Thr Val Arg Val Pro Thr lie Phe 

7 0 *7 5 

Pro Leu Arg Gly Val Asn Glu Arg Lys Val Ala Arg Arg Pro Ala Gly 

85 90 95 

Ala Ala Ala Ala Arg Arg Arg Gin Gin Gin Gin Gin Gin Gin Gin Gin 

100 !05 110 

Gin Gin Gin Gin Gin Gin Leu Gin Gin Gin Gin Pro Ser Pro Ser Ser 

115 !20 125 

Ser Thr Ala Gin Thr Thr Gin Leu Gin Pro Asn Leu Val Ser Ala Se 

Ala Ala Val Leu Leu Thr Leu Gin Ala Ala Val Asp Ser Asn Gin Ala 

^ 155 i fin 

Pro Gly Ser Val Val Pro Val Ser Thr Thr Pro Ser Gly Asp Asp Val 

165 170 175 

Lys Pro He Asp Leu Thr Val Gin Val Glu Phe Ala Ala Ala Glu Gly 

180 185 190 

Ala Ala Ala Ala Ala Ala Ala Ser Glu Leu Glu Ala Ala Thr Ala Gly 

195 200 205 

Leu Glu Ala Ala Glu Cys Thr Leu Gly P ro Gin Leu Val Val Val Gly 

0 215 220 

Glu Glu Gly Phe Pro Asp Thr Gly Ser Asp His Ser Tyr Ser Leu Ser 

230 235 
Ser Gly Thr Thr Glu Glu Glu Leu Leu Arg Lys Leu Asn Glu Gin Arg 

245 250 255 

Asp He Leu Ala Leu Met Glu Val Lys Met Lys Glu Met Lys Gly Ser 

260 265 270 

He Arg His Leu Arg Leu Thr Glu Ala Lys Leu Arg Glu Glu Leu Arg 

280 285 
Glu Lys Asp Arg Leu Leu Ala Met Ala Val He Arg Lys Lys His Gly 



300 



-45- 



BNSDOCID: <WO 2004O5505QA2J_> 



WO 2004/055050 



PCT/IB2003/006434 



Met 
305 



<210> 106 
<211> 305 
<212> PRT 

<213> Mus musculus 



<400> 106 
Met Pro Gly Phe 
1 

Arg Asp Lys Ala 
20 

Arg Arg Leu Trp 
35 

Phe Ser Thr Phe 
50 

Phe Gin Gly Gly 
65 

Pro Leu Arg Gly 

Ala Ala Ala Ala 
100 

Gin Gin Gin Gin 
115 

Ser Thr Ala Gin 
130 

Ala Ala Val Leu 
145 

Pro Gly Ser Val 

Lys Pro lie Asp 
180 

Ala Ala Ala Ala 
195 

Leu Glu Ala Ala 
210 

Glu Glu Gly Phe 
225 

Ser Gly Thr Thr 

Asp lie Leu Ala 
260 

lie Arg His Leu 
275 

Glu Lys Asp Arg 
290 

Met 
305 



Thr Cys Cys Val 
5 

Leu His Phe Tyr 

Leu Lys Asn Val 
4 0 

Gin Pro Thr Thr 
55 

Arg Lys Thr Tyr 
70 

Val Asn Glu Arg 

85 

Arg Arg Arg Gin 

Gin Gin Leu Gin 
120 

Thr Thr Gin Leu 
135 

Leu Thr Leu Gin 
150 

Val Pro Val Ser 
165 

Leu Thr Val Gin 

Ala Ala Ala Ser 
200 

Glu Cys Thr Leu 
215 

Pro Asp Thr Gly 
230 

Glu Glu Glu Leu 
245 

Leu Met Glu Val 

Arg Leu Thr Glu 
280 

Leu Leu Ala Met 
295 



Pro Gly Cys Tyr 
10 

Thr Phe Pro Lys 
25 

Ser Arg Ala Gly 

Gly His Arg Leu 
60 

Thr Val Arg Val 
75 

Lys Val Ala Arg 
90 

Gin Gin Gin Gin 
105 

Gin Gin Gin Pro 

Gin Pro Asn Leu 
140 

Ala Ala Val Asp 
155 

Thr Thr Pro Ser 
170 

Val Glu Phe Ala 
185 

Glu Leu Glu Ala 

Gly Pro Gin Leu 
220 

Ser Asp His Ser 
235 

Leu Arg Lys Leu 
250 

Lys Met Lys Glu 
265 

Ala Lys Leu Arg 

Ala Val lie Arg 
300 



Asn Asn Ser His 
15 

Asp Ala Glu Leu 
30 

Val Ser Gly Cys 
45 

Cys Ser Val His 

Pro Thr lie Phe 
80 

Arg Pro Ala Gly 
95 

Gin Gin Gin Gin 
110 

Ser Pro Ser Ser 
125 

Val Ser Ala Ser 

Ser Asn Gin Ala 
160 

Gly Asp Asp Val 
175 

Ala Ala Glu Gly 
190 

Ala Thr Ala Gly 
205 

Val Val Val Gly 

Tyr Ser Leu Ser 
240 

Asn Glu Gin Arg 
255 

Met Lys Gly Ser 
270 

Glu Glu Leu Arg 
285 

Lys Lys His Gly 



<210> 107 
<211> 652 
<212> PRT 

<213> Mus musculus 
<400> 107 

Met Pro Asn Phe Cys Ala Ala Pro Asn Cys Thr Arg Lys Ser Thr Gin 

1 5 10 15 

Ser Asp Leu Ala Phe Phe Arg Phe Pro Arg Asp Pro Ala Arg Cys Gin 
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20 25 30 

Lys Trp Val Glu Asn Cys Arg Arg Ala Asp Leu Glu Asp Lys Thr Pro 

^5 40 45 

Asp Gin Leu Asn Lys His Tyr Arg Leu Cys Ala Lys His Phe Glu Thr 

Ser Met lie Cys Arg Thr Ser Pro Tyr Arg Thr Val Leu Arg Asp Asn 

Ala lie Pro Thr He Phe Asp Leu Thr Ser His Leu Asn Asn Pro His 

85 90 95 

Ser Arg His Arg Lys Arg He Lys Glu Leu Ser Glu Asp Glu He Arc 

100 105 110 

Thr Leu Lys Gin Lys Lys lie Glu Glu Thr Ser Glu Gin Glu Gin Glu 

US 120 12 5 

Thr Asn Thr Asn Ala Gin Asn Pro Ser Ala Glu Ala Val Asn Gin Gin 

Asp Ala Asn Val Leu Pro Leu Thr Leu Glu Glu Lys Glu Asn Lys Glu 

Tyr Leu Lys Ser Leu Phe Glu He Leu Val Leu Met Gly Lys Gin Asn 

165 170 175 

He Pro Leu Asp Gly His Glu Ala Asp Glu Val Pro Glu Gly Leu Phe 

180 185 190 

Ala Pro Asp Asn Phe Gin Ala Leu Leu Glu Cys Arg He Asn Ser Gly - 

195 200 205 

Glu Glu Val Leu Arg Lys Arg Phe Glu Ala Thr Ala Val Asn Thr Leu 

210 215 220 

Phe Cys Ser Lys Thr Gin Gin Arg His Met Leu Glu He Cys Glu Ser 

230 235 
Cys He Arg Glu Glu Thr Leu Arg Glu Val Arg Asp Ser His Phe Phe 

245 250 255 

Ser He He Thr Asp Asp Val Val Asp He Ala Gly Glu Glu His Leu 

260 265 270 

Pro Val Leu Val Arg Phe Val Asp Asp Ala His Asn Leu Arg Glu Glu 

Zlb 280 285 

Phe Val Gly Phe Leu Pro Tyr Glu Ala Asp Ala Glu He Leu Ala Val 

290 295 300 

Lys Phe His Thr Thr He Thr Glu Lys Trp Gly Leu Asn Met Glu Tyr 

310 315 
Cys Arg Gly Gin Ala Tyr He Val Ser Ser Gly Phe Ser Ser Lys III 

325 330 335 

Lys Val Val Ala Ser Arg Leu Leu Glu Lys Tyr Pro Gin Ala Val Tyr 

340 345 350 

Thr Leu Cys Ser Ser Cys Ala Leu Asn Ala Trp Leu Ala Lys Ser Val 

355 360 365 

Pro val He Gly Val Ser Val Ala Leu Gly Thr He Glu Glu Val Cys 

370 375 380 

Ser Phe Phe His Arg Ser Pro Gin Leu Leu Leu Glu Leu Asp Ser Val 
! 5 „ 390 395 40Q 

He Ser Val Leu Phe Gin Asn Ser Glu Glu Arg Ala Lys Glu Leu Lys 

405 41° 415 

Glu He Cys His Ser Gin Trp Thr Gly Arg His Asp Ala Phe Glu He 

420 425 430 

Leu Val Asp Leu Leu Gin Ala Leu Val Leu Cys Leu Asp Gly He He 

Asn Ser Asp Thr Asn Val Arg Trp Asn Asn Tyr He Ala Gly Arg Ala 

Phe Val Leu Cys Ser Ala Val Thr Asp Phe Asp Phe He Val Thr He 

470 475 480 

Val Val Leu Lys Asn Val Leu Ser Phe Thr Arg Ala Phe Gly Lys Asn 

485 490 495 

Leu Gin Gly Gin Thr Ser Asp Val Phe Phe Ala Ala Ser Ser Leu Thr 
500 505 5io 
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Ala 


Val 


Leu 
515 


His 


Ser 


Leu 


Asn 


Glu 
520 


Val 


Met 


Glu 


Asn 


He 
525 


Glu 


Val 


Tyr 


His 


Glu 
530 


Phe 


Trp 


Phe 


Glu 


Glu 

535 


Ala 


Thr 


Asn 


Leu 


Ala 
540 


Thr 


Lys 


Leu 


Asp 


He 


Gin 


Met 


Lys 


Leu 


Pro 


Gly 


Lys 


Phe 


Arg 


Arg 


Ala 


Gin 


Gin 


Gly 


Asn 


545 










550 










555 










560 


Leu 


Glu 


Ser 


Gin 


Leu 
565 


Thr 


Ser 


Glu 


Ser 


Tyr 
570 


Tyr 


Lys 


Asp 


Thr 


Leu 
575 


Ser 


Val 


Pro 


Thr 


Val 
580 


Glu 


His 


He 


He 


Gin 
585 


Glu 


Leu 


Lys 


Asp 


He 
590 


Phe 


Ser 


Glu 


Gin 


His 
595 


Leu 


Lys 


Ala 


Leu 


Lys 
600 


Cys 


Leu 


Ser 


Leu 


Val 
605 


Pro 


Ser 


Val 


Met 


Gly 
610 


Gin 


Leu 


Lys 


Phe 


Asn 
615 


Thr 


Ser 


Glu 


Glu 


His 
620 


His 


Ala 


Asp 


Met 


Tyr 


Arg 


Ser 


Asp 


Leu 


Pro 


Asn 


Pro 


Asp 


Thr 


Leu 


Ser 


Ala 


Glu 


Leu 


His 


625 










630 










635 










640 


Cys 


Trp 


Arg 


lie 


Lys 
645 


Trp 


Lys 


His 


Arg 


Gly 
650 


Lys 


Asp 











<210> 108 
<211> 180 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<223> RAT THAP 

<221> UNSURE 
<222> 95 

<223> Xaa = any of the twenty amino acids 



<400> 108 



Arg 


Gin 


Cys 


Cys 


Asn 


Arg 


Tyr 


Ser 


Ser 


Arg 


Arg 


Lys 


Gin 


Leu 


Thr 


Phe 


1 








5 










10 










15 




His 


Arg 


Phe 


Pro 


Phe 


Ser 


Arg 


Pro 


Glu 


Leu 


Leu 


Arg 


Glu 


Trp 


Val 


Leu 








20 










25 










30 






Asn 


He 


Gly 


Arg 


Ala 


Asp 


Phe 


Lys 


Pro 


Lys 


Gin 


His 


Thr 


Val 


He 


Cys 






35 










40 










45 








Ser 


Glu 


His 


Phe 


Arg 


Pro 


Glu 


Cys 


Phe 


Ser 


Ala 


Phe 


Gly 


Asn 


Arg 


Lys 




50 










55 










60 










Asn 


Leu 


Lys 


His 


Asn 


Ala 


Val 


Pro 


Thr 


Val 


Phe 


Ala 


Phe 


Gin 


Asn 


Pro 


65 










70 










75 










80 


Ala 


Gin 


Val 


Cys 


Pro 


Glu 


Val 


Gly 


Ala 


Gly 


Gly 


Asp 


Ser 


Ser 


Xaa 


Arg 










85 










90 










95 




Asn 


Met 


Asp 


Ala 


Thr 


Leu 


Glu 


Glu 


Leu 


Gin 


Ser 


Pro 


Asn 


Thr 


Glu 


Gly 








100 










105 










110 






Pro 


Met 


Gin 


Gin 


Val 


Leu 


Pro 


Asp 


Arg 


Gin 


Ala 


Thr 


Glu 


Ala 


Met 


Glu 






115 










120 










125 








Ala 


Ala 


Gly 


Leu 


Pro 


Ala 


Gly 


Pro 


Leu 


Gly 


Leu 


Lys 


Arg 


Pro 


Leu 


Pro 




130 










135 










140 










Gly 


Gin 


Pro 


Ser 


Asp 


His 


Ser 


Tyr 


Ala 


Leu 


Leu 


Asp 


Leu 


Asp 


Thr 


Leu 


145 










150 










155 










160 


Lys 


Lys 


Lys 


Leu 


Phe 


Leu 


Thr 


Leu 


Lys 


Glu 


Asn 


Lys 


Arg 


Leu 


Arg 


Lys 










165 










170 










175 




Arg 


Leu 


Lys 


Ala 



























180 



<210> 109 
<211> 82 
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<212> PRT 

<213> Rattus norvegicus 
<400> 109 

Met Val Lys Cys Cys Ser Ala lie 

1 5 
Asn Ser Lys Leu Lys Gly Leu Thr 
20 

Asn lie Lys Arg Lys Trp Val Leu 

35 40 
Thr Ala Gly He Trp Glu Pro Ser 

50 55 
He Phe Met Glu Asn Leu Phe Phe 
65 70 
Ala Val 



Gly Cys Ala Ser Arg Cys Leu Pro 

10 15 
Phe His Val Phe Pro Thr Asp Glu 
25 30 
Ala Met Lys Arg Leu Asp Val Asn 
45 

Leu Gin Pro Glu Ser Phe Tyr Phe 
60 

He Leu Pro Pro Gin Leu Ser His 
75 80 



<210> 110 
<211> 309 
<212> PRT 

<213> Rattus norvegicus 
<400> 110 

Met Pro Arg His Cys Ser Ala Ala Gly Cys Cys Thr Arg Asp Thr Arg 

15 10 15 

Glu Thr Arg Asn Arg Gly He Ser Phe His Arg Leu Pro Lys Lys Asp 

20 25 30 

Asn Pro Arg Arg Gly Leu Trp Leu Ala Asn Cys Gin Arg Leu Asp Pro 

35 40 45 

Ser Gly Gin Gly Leu Trp Asp Pro Thr Ser Glu Tyr He Tyr Phe Cys 

50 55 60 

Ser Lys His Phe Glu Glu Asn Cys Phe Glu Leu Val Gly He Ser Gly 
65 70 75 80 

Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr He Phe Glu Ser Phe 

85 90 95 

Ser Lys Leu Arg Arg Thr Ala Lys Thr Lys Val His Gly Tyr Pro Pro 

100 105 no 

Gly Leu Pro Asp Val Ser Arg Leu Arg Arg Cys Arg Lys Arg Cys Ser 

115 120 125 

Glu Arg Gin Gly Pro Thr He Pro Phe Ser Pro Pro Pro Arg Ala Asp 

I 30 135 140 

He He Arg Phe Pro Val Glu Glu Ala Ser Ala Pro Ala Thr Leu Pro 
145 150 155 i 6 o 

Ala Ser Pro Ala Ala Arg Leu Asp Pro Gly Leu Asn Ser Pro Phe Ser 

165 170 175 

Asp Leu Leu Gly Pro Leu Gly Ala Gin Ala Asp Glu Ala Gly Cys Ser 

180 185 190 

Ala Gin Pro Ser Pro Glu Gin His Pro Ser Pro Leu Glu Pro Gin His 

195 200 205 

Val Ser Pro Ser Thr Tyr Met Leu Arg Leu Pro Pro Pro Ala Gly Ala 

210 215 220 

Tyr He Gin Asn Glu His Ser Tyr Gin Val Gly Ser Ala Leu Leu Trp 
225 230 235 240 

Lys Arg Arg Ala Glu Ala Ala Leu Asp Ala Leu Asp Lys Thr Gin Arg 

245 250 255 

Gin Leu Gin Ala Cys Lys Arg Arg Glu Gin Arg Leu Arg Leu Arg Leu 

260 265 270 

Thr Lys Leu Gin Gin Glu Arg Ala Arg Glu Lys Arg Ala Gin Ala Asp 

275 280 285 

Ala Arg Gin Thr Leu Lys Asp His Val Gin Asp Phe Ala Met Gin Leu 
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290 295 300 

Ser Ser Ser Met Ala 
305 



<210> 111 
<211> 142 
<212> PRT 

<213> Rattus norvegicus 
<400> 111 



Met 


Pro 


Asn 


Phe 


Cys 


Ala 


Ala 


Pro 


Asn 


Cys 


Thr 


Arg 


Lys 


Ser 


Thr 


Gin 


1 








5 










10 










15 




Ser 


Asp 


Leu 


Ala 


Phe 


Phe 


Arg 


Phe 


Pro 


Arg 


Asp 


Pro 


Ala 


Arg 


Cys 


Gin 








20 










25 










30 






Lys 


Trp 


Val 


Glu 


Asn 


Cys 


Arg 


Arg 


Ala 


Asp 


Leu 


Glu 


Asp 


Lys 


Thr 


Pro 






35 










40 










45 








Asp 


Gin 


Leu 


Asn 


Lys 


His 


Tyr 


Arg 


Leu 


Cys 


Ala 


Lys 


His 


Phe 


Glu 


Thr 




50 










55 










60 










Ser 


Met 


He 


Cys 


Arg 


Thr 


Ser 


Pro 


Tyr 


Arg 


Thr 


Val 


Leu 


Arg 


Asp 


Asn 


65 










70 










75 










80 


Ala 


He 


Pro 


Thr 


He 


Phe 


Asp 


Leu 


Thr 


Ser 


His 


Leu 


Asn 


Asn 


Pro 


His 










85 










90 










95 




Ser 


Arg 


His 


Arg 


Lys 


Arg 


lie 


Lys 


Glu 


Leu 


Ser 


Glu 


Asp 


Glu 


He 


Arg 








100 










105 










110 






Thr 


Leu 


Lys 


Gin 


Lys 


Lys 


He 


Glu 


Glu 


Thr 


Ser 


Glu 


Gin 


Glu 


Gin 


Gly 






115 










120 










125 








Thr 


Asn 


Ser 


Asn 


Ala 


Gin 


Tyr 


Pro 


Ser 


Ala 


Glu 


Val 


Gly Asn 








130 










135 










140 











<210> 112 
<211> 104 
<212> PRT 
<213> Sus scrofa 

<400> 112 



Met 


Val 


Lys 


Cys 


Cys 


Ser 


Ala 


He 


Gly 


Cys 


Ala 


Ser 


Arg 


Cys 


Leu 


Pro 


1 








5 










10 










15 




Asn 


Ser 


Lys 


Leu 
20 


Lys 


Gly 


Leu 


Thr 


Phe 
25 


His 


Val 


Phe 


Pro 


Thr 
30 


Asp 


Glu 


Lys 


Val 


Lys 
35 


Arg 


Lys 


Trp 


Val 


Leu 
40 


Ala 


Met 


Lys 


Arg 


Leu 
45 


Asp 


Val 


Asn 


Ala 


Ala 


Gly 


Met 


Trp 


Glu 


Pro 


Lys 


Lys 


Gly Asp 


Val 


Leu 


Cys 


Ser 


Arg 




50 










55 










60 










His 


Phe 


Lys 


Lys 


Thr 


Asp 


Phe 


Asp 


Arg 


Thr 


Thr 


Pro 


Asn 


He 


Lys 


Leu 


65 










70 










75 










80 


Lys 


Pro 


Gly 


Val 


He 
85 


Pro 


Ser 


He 


Phe 


Asp 
90 


Ser 


Pro 


Ser 


His 


Leu 
95 


Thr 


Gly 


Glu 


Glu 


Arg 
100 


Lys 


Ala 


Pro 


Leu 



















<210> 113 
<211> 235 
<212> PRT 
<213> Sus scrofa 

<220> 

<221> UNSURE 
<222> 57, 124, 192 
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<223> Xaa = any of the twenty amino acids 
<400> 113 

Met Pro Arg His Cys Ser Ala Ala Gly Cys Cys Thr Arg Asp Thr Arg 

Glu Thr Arg Asn Arg Gly lie Ser Phe His Arg Leu Pro Lys Us Asp 

20 25 30 

Asn Pro Arg Arg Gly Le u Trp Leu Ala Asn Cys Gin Arg Leu Asp Pro 

35 4 0 4 5 

Ser Gly Gin Gly Leu Trp Asp Pro Xaa Ser Glu Tyr lie Tyr Phe Cys 

Ser Lys His Phe Glu Glu Asn Cys Phe Glu Leu Val Gly He Ser Gly 

70 75 80 

Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr lie Phe Glu Ser Phe 

85 90 95 

Ser Lys Leu Arg Arg Thr Ala Lys Thr Lys Gly His Ser Tyr Pro Pro 

100 1Q 5 110 

Gly Pro Pro Asp Val Ser Arg Leu Arg Arg Cys Xaa Lys Arg Cys Ser 

Glu Gly Arg Gly Pro Thr Thr Pro Phe Ser Pro Pro Pro Pro Ala Asp 

30 13 5 140 

Val Thr Cys Phe Pro Val Glu Glu Ala Ser Ala Pro Ala Ala Leu Ser 

150 155 
Ala Ser Pro Thr Gly Arg Leu Glu Pro Gly Leu Ser Ser Pro Phe Ser 

Asp Leu Leu Gly Pro Leu Gly Ala Gin All Asp Glu Ala Gly Jy5 Xaa 

180 185 19 Q 

Thr Gin Pro Ser Pro Glu Arg Glu Pro Glu Arg Gin Pro Ser Pro Leu 

195 200 205 

Glu Pro Arg Pro Val Ser Pro Ser Ala Tyr Met Leu Arg Leu Pro Pro 

U 215 220 

Pro Ala Gly Ala Tyr He Gin Asn Glu His Ser 
225 2 3o 235 

<210> 114 
<211> 149 
<212> PRT 
<213> Sus scrofa 

<400> 114 

Met Thr Arg Ser Cys Ser Ala Val Gly Cys Ser Thr Arg Asp Thr Val 

t 5 10 15 

Leu Ser Arg Glu Arg Gly Leu Ser Phe His Gin Phe Pro Thr Asp Thr 

20 25 30 

He Gin Arg Ser Gin Trp lie Arg Ala Val Asn Arg Met Asp Pro Arg 

35 40 45 

Ser Lys Lys He Trp He Pro Gly Pro Gly Ala Met Leu Cys Ser Lys 

55 60 
His Phe Gin Glu Ser Asp Phe Glu Ser Tyr Gly He Arg Arg Lys Leu 

Lys Lys Gly Ala Val Pro Ser Val Ser Leu ^yr Lys Val Leu Gin IL 

_ 85 90 95 

Ala His Leu Lys Gly Lys Ala Arg Gin Lys He Leu Lys Gin Pro Leu 

100 105 110 

Pro Asp Asn Ser Gin Glu Val Ala Thr Glu Asp His Asn Tyr Ser Leu 

115 12 ° 125 

Lys Gly Pro Leu Thr He Gly Ala Glu Lys Leu Ala Glu Val Gin Gin 

130 135 140 

Met Leu Gin Val Ser 

145 



-51- 



BNSDOCID: <WO 2004055050A2J_> 



WO 2004/055050 



PCT/IB2003/006434 



<210> 115 
<211> 43 
<212> PRT 

<213> Mus musculus 
<400> 115 

Val Leu Glu Asp Val Ala Ala Ala Glu Gin Gly Leu Arg Glu Leu Gin 

15 10 15 

Arg Gly Arg Arg Gin Cys Arg Glu Arg Val Cys Ala Leu Arg Ala Ala 

20 25 30 

Ala Glu Gin Arg Glu Ala Arg Cys Arg Asp Gly 
35 40 



<210> 116 
<211> 45 
<212> PRT 

<213> Mus musculus 
<400> 116 

Gin Leu Glu Gin Gin Val Glu Lys Leu Arg Lys Lys Leu Lys Thr Ala 

15 10 15 

Gin Gin Arg Cys Arg Arg Gin Glu Arg Gin Leu Glu Lys Leu Lys Glu 

20 25 30 

Val Val His Phe Gin Arg Glu Lys Asp Asp Ala Ser Glu 
35 40 45 



<210> 117 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<400> 117 

Gin Leu Glu Gin Gin Val Glu Lys Leu Arg Lys Lys Leu Lys Thr Ala 

15 10 15 

Gin Gin Arg Cys Arg Arg Gin Glu Arg Gin Leu Glu Lys Leu Lys Glu 

20 25 30 

Val Val His Phe Gin Lys Glu Lys Asp Asp Val Ser Glu 
35 40 45 



<210> 118 
<211> 342 
<212> PRT 

<213> Homo sapiens 
<400> 118 

Met Ala Thr Gly Gly Tyr Arg Thr 

1 5 
Thr Asp Phe Leu Glu Glu Trp Lys 
20 

Lys Gin Asn Pro Pro Gly Pro Ala 

35 40 
Ala Ala Gly Lys Pro Pro Ala Gly 

50 55 
Ala Ala Asn Glu Leu Asn Asn Asn 
65 70 
Pro Ala Val Pro Gly Pro Gly Gly 



Ser Ser Gly Leu Gly Gly Ser Thr 

10 15 
Ala Lys Arg Glu Lys Met Arg Ala 
25 30 
Pro Pro Gly Gly Gly Ser Ser Asp 
45 

Ala Leu Gly Thr Pro Ala Ala Ala 
60 

Leu Pro Gly Gly Ala Pro Ala Ala 

75 80 
Val Asn Cys Ala Val Gly Ser Ala 
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Met Leu Thr Arg 
100 

Glu Pro Pro Ala 
115 

Glu Glu Glu Pro 
130 

Ser Ala Arg Lys 
14 5 

Lys Arg Arg Ser 

Asp Glu Tyr Glu 
180 

Asp Ala lie Thr 
195 

Leu Asp Pro Gly 
210 

Ser Gly Arg Tyr 

225 

Ser Arg Tyr Ser 

Asp Ala Asn Val 
260 

Lys lie Glu Asp 
27 5 

Leu Arg Leu Val 
290 

Leu Lys Glu Glu 
305 

Asp Glu Asn Glu 

Val Gly Gin Leu 
340 



85 

Ala Pro Pro Ala 

Ala Ser Ala Ser 
120 

Asp Gly Val Pro 
135 

Gly Lys Gly Gin 
150 

Thr Gly Val Val 
165 

Asp Asp Glu Ala 

Gin Gin Asn Thr 
200 

Ser Ser Tyr Leu 
215 

Lys Ser Thr Thr 
230 

Arg Thr Asp Arg 
245 

Ser Gly Thr Leu 

Leu Glu Lys Glu 
280 

Arg Leu Met Gin 

295 

lie Asp Leu Leu 
310 

Gin Leu Lys Gin 
325 

Thr Arg 



90 

Arg Gly Pro Arg 
105 

Ala Ala Pro Pro 

Glu Lys Gly Lys 
140 

lie Glu Lys Arg 
155 

Asn lie Pro Ala 
170 

Gly Gin Lys Glu 
185 

He Gin Asn Glu 

Leu Gin Glu Pro 
220 

Ser Val Ser Glu 
235 

Ser Gly Phe Pro 
250 

Val Ser Ser Ser 
265 

Val Val Thr Glu 

Asp Lys Glu Glu 
300 

Asn Arg Asp Leu 
315 

Glu Asn Lys Thr 
330 



95 

Arg Ser Glu Asp 
110 

Pro Gin Arg Asp 
125 

Ser Ser Gly Pro 

Lys Leu Arg Glu 
160 

Ala Glu Cys Leu 
175 

Arg Lys Arg Glu 
190 

Ala Val Asn Leu 
205 

Pro Arg Thr Val 

Glu Asp Val Ser 
240 

Arg Tyr Asn Arg 
255 

Thr Leu Glu Lys 
270 

Arg Gin Glu Asn 
285 

Met He Gly Lys 

Asp Asp He Glu 
320 

Leu Leu Lys Val 
335 



<210> 119 
<211> 134 
<212> PRT 

<213> Homo sapiens 
<400> 119 

Met Ala Gin Ser Leu Ala Leu Ser Leu Leu He Leu Val Leu Ala Phe 

5 10 15 

Gly He Pro Arg Thr Gin Gly Ser Asp Gly Gly Ala Gin Asp Cys Cys 

20 25 30 

Leu Lys Tyr Ser Gin Arg Lys He Pro Ala Lys Val Val Arg Ser Tvr 

35 40 45 

Arg Lys Gin Glu Pro Ser Leu Gly Cys Ser lie Pro Ala He Leu Phe 

50 55 60 " 

Leu Pro Arg Lys Arg Ser Gin Ala Glu Leu Cys Ala Asp Pro Lys Glu 

70 75 80 

Leu Trp Val Gin Gin Leu Met Gin His Leu Asp Lys Thr Pro Ser Pro 

85 90 95 

Gin Lys Pro Ala Gin Gly Cys Arg Lys Asp Arg Gly Ala Ser Lys Thr 

100 105 no 

Gly Lys Lys Gly Lys Gly Ser Lys Gly Cys Lys Arg Thr Glu Arg Ser 

115 120 125 

Gin Thr Pro Lys Gly Pro 
130 
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<210> 120 
<211> 766 
<212> PRT 

<213> Drosophila melanogaster 
<400> 120 

Met Lys Tyr Cys Lys Phe Cys Cys Lys Ala Val Thr Gly Val Lys Leu 

15 10 15 

lie His Val Pro Lys Cys Ala lie Lys Arg Lys Leu Trp Glu Gin Ser 

20 25 30 

Leu Gly Cys Ser Leu Gly Glu Asn Ser Gin lie Cys Asp Thr His Phe 

35 40 45 

Asn Asp Ser Gin Trp Lys Ala Ala Pro Ala Lys Gly Gin Thr Phe Lys 

50 55 60 

Arg Arg Arg Leu Asn Ala Asp Ala Val Pro Ser Lys Val lie Glu Pro 
65 70 75 ~ 80 

Glu Pro Glu Lys lie Lys Glu Gly Tyr Thr Ser Gly Ser Thr Gin Thr 

85 90 95 

Glu Ser Cys Ser Leu Phe Asn Glu Asn Lys Ser Leu Arg Glu Lys He 

100 105 110 

Arg Thr Leu Glu Tyr Glu Met Arg Arg Leu Glu Gin Gin Leu Arg Glu 

115 120 125 

Ser Gin Gin Leu Glu Glu Ser Leu Arg Lys He Phe Thr Asp Thr Gin 

130 135 140 

He Arg He Leu Lys Asn Gly Gly Gin Arg Ala Thr Phe Asn Ser Asp 
145 150 155 160 

Asp He Ser Thr Ala He Cys Leu His Thr Ala Gly Pro Arg Ala Tyr 

165 170 175 

Asn His Leu Tyr Lys Lys Gly Phe Pro Leu Pro Ser Arg Thr Thr Leu 

180 185 190 

' Tyr Arg Trp Leu Ser Asp Val Asp He Lys Arg Gly Cys Leu Asp Val 

195 200 205 

Val He Asp Leu Met Asp Ser Asp Gly Val Asp Asp Ala Asp Lys Leu 

210 215 ' 220 

Cys Val Leu Ala Phe Asp Glu Met Lys Val Ala Ala Ala Phe Glu Tyr 
225 230 235 240 

Asp Ser Ser Ala Asp He Val Tyr Glu Pro Ser Asp Tyr Val Gin Leu 

245 250 255 

Ala He Val Arg Gly Leu Lys Lys Ser Trp Lys Gin Pro Val Phe Phe 

260 265 270 

Asp Phe Asn Thr Arg Met Asp Pro Asp Thr Leu Asn Asn He Leu Arg 

275 280 285 

Lys Leu His Arg Lys Gly Tyr Leu Val Val Ala lie Val Ser Asp Leu 

290 295 300 

Gly Thr Gly Asn Gin Lys Leu Trp Thr Glu Leu Gly lie Ser Glu Ser 
305 310 315 320 

Lys Thr Trp Phe Ser His Pro Ala Asp Asp His Leu Lys He Phe Val 

325 330 335 

Phe Ser Asp Thr Pro His Leu lie Lys Leu Val Arg Asn His Tyr Val 

340 345 350 

Asp Ser Gly Leu Thr lie Asn Gly Lys Lys Leu Thr Lys Lys Thr He 

355 360 365 

Gin Glu Ala Leu His Leu Cys Asn Lys Ser Asp Leu Ser He Leu Phe 

370 375 380 

Lys lie Asn Glu Asn His He Asn Val Arg Ser Leu Ala Lys Gin Lys 
385 390 395 400 

Val Lys Leu Ala Thr Gin Leu Phe Ser Asn Thr Thr Ala Ser Ser lie 

405 410 415 

Arg Arg Cys Tyr Ser Leu Gly Tyr Asp He Glu Asn Ala Thr Glu Thr 

420 425 430 

Ala Asp Phe Phe Lys Leu Met Asn Asp Trp Phe Asp lie Phe Asn Ser 
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435 440 445 

Lys Leu Ser Thr Ser Asn Cys He Glu Cys Ser Gin Pro Tyr Gly Lvs 

450 455 '460 

Gin Leu Asp He Gin Asn Asp He Leu Asn Arg Met Ser Glu He Met 
465 470 475 4 8 o 

Arg Thr Gly He Leu Asp Lys Pro Lys Arg Leu Pro Phe Gin Lys Gly 

4 85 4 90 495 

He He Val Asn Asn Ala Ser Leu Asp Gly Leu Tyr Lys Tyr Leu Gin 

500 505 510 

Glu Asn Phe Ser Met Gin Tyr He Leu Thr Ser Arg Leu Asn Gin Asp 

515 520 525 

He Val Glu His Phe Phe Gly Ser Met Arg Ser Arg Gly Gly Gin Phe 

530 535 540 

Asp His Pro Thr Pro Leu Gin Phe Lys Tyr Arg Leu Arg Lys Tyr He 
545 550 555 5 6 o 

He Ala Arg Asn Thr Glu Met Leu Arg Asn Ser Gly Asn He Glu Glu 

565 570 575 

Gly Met Thr Asn Leu Lys Glu Cys Val Asn Lys Asn Val He Pro Asp 

580 585 590 

Asn Ser Glu Ser Trp Leu Asn Leu Asp Phe Ser Ser Lys Glu Asn Glu 

595 600 6 05 

Asn Lys Ser Lys Asp Asp Glu Pro Val Asp Asp Glu Pro Val Asp Glu 

610 615 6 20 

Met Leu Ser Asn He Asp Phe Thr Glu Met Asp Glu Leu Thr Glu Asp 
625 630 635 640 

Ala Met Glu Tyr He Ala Gly Tyr Val He Lys Lys Leu Arg He Ser 

645 650 655 

Asp Lys Val Lys Glu Asn Leu Thr Phe Thr Tyr Val Asp Glu Val Ser 

660 665 670 

His Gly Gly Leu He Lys Pro Ser Glu Lys Phe Gin Glu Lys Leu Lys 

6? 5 680 685 

Glu Leu Glu Cys He Phe Leu His Tyr Thr Asn Asn Asn Asn Phe Glu 

690 695 700 

He Thr Asn Asn Val Lys Glu Lys Leu He Leu Ala Ala Arg Asn Val 
705 710 715 720 

Asp Val Asp Lys Gin Val Lys Ser Phe Tyr Phe Lys He Arg He Tyr 

725 730 735 

Phe Arg He Lys Tyr Phe Asn Lys Lys He Glu He Lys Asn Gin Lys 

740 745 750 

Gin Lys Leu He Gly Asn Ser Lys Leu Leu Lys He Lys Leu 
755 760 765 

<210> 121 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<400> 121 

Asp Glu Leu Cys Val Val Cys Gly Asp Lys Ala Thr Gly Tyr His Tyr 

5 10 15 

Arg Cys He Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Thr He 

20 25 30 

Gin Lys Asn Leu His Pro Ser Tyr Ser Cys Lys Tyr Glu Gly Lys Cys 

35 40 4 5 

Val He Asp Lys Val Thr Arg Asn Gin Cys Gin Glu Cys Arg Phe Lvs 

50 55 60 

Lys Cys He Tyr Val Gly Met Ala Thr Asp Leu Val Leu Asp Asp Ser 
f 5 70 75 80 

Lys Arg Leu Ala Lys Arg Lys Leu He Glu Glu Asn Arg Glu Lys Arg 
85 90 95 
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Arg Arg Glu Glu Leu Glu Lys 
100 



<210> 122 
<211> 81 
<212> PRT 

<213> Homo sapiens 



<400> 122 



Met 


Lys 


Pro 


Ala 


Arg 


1 








5 


Cys 


His 


Tyr 


Gly 


Val 








20 




Arg 


Ala 


Val 


Glu 


Gly 






35 






Cys 


lie 


lie 


Asp 


Lys 




50 








Arg 


Lys 


Cys 


Leu 


Gin 


65 










Lys 











Pro Cys Leu Val Cys Ser 
10 

Leu Thr Cys Gly Ser Cys 
25 

Gin His Asn Tyr Leu Cys 
40 

lie Arg Arg Lys Asn Cys 
55 

Ala Gly Met Asn Leu Glu 
70 75 



Asp Glu Ala Ser Gly 
15 

Lys Val Phe Phe Lys 
30 

Ala Gly Arg Asn Asp 
4 5 

Pro Ala Cys Arg Tyr 
60 

Ala Arg Lys Thr Lys 
80 



<210> 123 












<211> 89 












<212> PRT 












<213> Homo 


sapiens 








<400> 123 












Met Val 


Gin 


Ser 


Cys 


Ser 


Ala 


Tyr 


1 






5 








Asp Lys 


Pro 


Val 


Ser 


Phe 


His 


Lys 






20 










Cys Lys 


Glu 


Trp 


Glu 


Ala 


Ala 


Val 




35 










40 


Lys Tyr 


Ser 


Ser 


He 


Cys 


Ser 


Glu 


50 










55 




Arg Glu 


Cys 


Asn 


Asn 


Lys 


Leu 


Leu 


65 








70 






Phe Leu 


Cys 


Thr 


Glu 


Pro 


His 


Asp 



85 



Gly Cys Lys Asn Arg Tyr Asp Lys 

10 ' 15 
Phe Pro Leu Thr Arg Pro Ser Leu 
25 30 
Arg Arg Lys Asn Phe Lys Pro Thr 
4 5 

His Phe Thr Pro Asp Cys Phe Lys 
60 

Lys Glu Asn Ala Val Pro Thr He 
75 80 

Lys 



<210> 124 
<211> 85 
<212> PRT 

<213> Drosophila melanogaster 
<400> 124 



Met 


Lys 


Tyr 


Cys 


Lys 


Phe 


Cys 


Cys 


Lys 


Ala 


Val 


Thr 


Gly 


Val 


Lys 


Leu 


1 








5 










10 










15 




He 


His 


Val 


Pro 


Lys 


Cys 


Ala 


He 


Lys 


Arg 


Lys 


Leu 


Trp 


Glu 


Gin 


Ser 








20 










25 










30 






Leu 


Gly 


Cys 


Ser 


Leu 


Gly 


Glu 


Asn 


Ser 


Gin 


He 


Cys 


Asp 


Thr 


His 


Phe 






35 










40 










45 








Asn 


Asp 


Ser 


Gin 


Trp 


Lys 


Ala 


Ala 


Pro 


Ala 


Lys 


Gly Gin 


Thr 


Phe 


Lys 




50 










55 










60 










Arg 


Arg 


Arg 


Leu 


Asn 


Ala 


Asp 


Ala 


Val 


Pro 


Ser 


Lys 


Val 


He 


Glu 


Pro 


65 










70 










75 










80 


Glu 


Pro 


Glu 


Lys 


He 
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85 



<210> 125 
<211> 58 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> THAP Domain consensus 



<221> UNSURE 

<222> 2-3, 7, 9, 13-17, 19, 21-23, 25-26, 28, 35, 38-39, 41, 45-50, 
52, 55-56 

<223> Xaa * any of the twenty amino acids 
<400> 125 

Met Val Xaa Xaa Cys Ser Xaa Tyr Xaa Cys Lys Asn Xaa Xaa Xaa Xaa 

1 5 io 15 

Xaa Lys Xaa Val Xaa Xaa Xaa Lys Xaa Xaa Leu Xaa Arg Pro Ser Leu 

20 25 30 

Cys Lys Xaa Trp Glu Xaa Xaa Val Xaa Arg Lys Asn Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Ser Xaa lie Cys Xaa Xaa His Phe 
50 55 



<210> 126 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<400> 126 

Met Val Gin Ser Cys Ser Ala Tyr Gly Cys Lys Asn Arg Tyr Asp Lys 

1 5 io 15 

Asp Lys Pro Val Ser Phe His Lys Phe Pro Leu Thr Arg Pro Ser Leu 

20 25 30 

Cys Lys Glu Trp Glu Ala Ala Val Arg Arg Lys Asn Phe Lys Pro Thr 

35 40 45 

Lys Tyr Ser Ser lie Cys Ser Glu His Phe Thr Pro Asp Cys Phe Lys 

50 55 60 

Arg Glu Cys Asn Asn Lys Leu Leu Lys Glu Asn Ala Val Pro Thr He 
65 70 75 so 

Phe Leu Cys Thr Glu Pro His Asp Lys 
85 



<210> 127 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<400> 127 

Met Pro Lys Ser Cys Ala Ala Arg Gin Cys Cys Asn Arg Tyr Ser Ser 

15 10 15 

Arg Arg Lys Gin Leu Thr Phe His Arg Phe Pro Phe Ser Arg Pro Glu 

20 25 30 

Leu Leu Lys Glu Trp Val Leu Asn He Gly Arg Gly Asn Phe Lys Pro 

35 40 45 

Lys Gin His Thr Val He Cys Ser Glu His Phe Arg Pro Glu Cvs Phe 
50 55 60 
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Ser Ala Phe Gly Asn Arg Lys Asn Leu Lys His Asn Ala Val Pro Thr 
65 70 " 75 80 

Val Phe Ala Phe Gin Asp Pro Thr Gin 
85 



<210> 128 
<211> 90 
<212> PRT 

<213> Homo sapiens 



<400> 128 
Met Pro Arg Tyr 
1 

Asn Asn Lys Asp 
20 

Lys Glu Arg Leu 
35 

Val Pro Ser Lys 
50 

Ser Leu Asp lie 
65 

Pro Thr lie Phe 



Cys Ala Ala lie 
5 

Arg Lys Leu Ser 

Glu Lys Trp Leu 
40 

Tyr Gin Phe Leu 
55 

Arg Trp Gly lie 
70 

Ser Leu Pro Glu 
85 



Cys Cys Lys Asn 
10 

Phe Tyr Pro Phe 
25 

Lys Asn Met Lys 

Cys Ser Asp His 
60 

Arg Tyr Leu Lys 
75 

Asp Asn 
90 



Arg Arg Gly Arg 
15 

Pro Leu His Asp 
30 

Arg Asp Ser Trp 
45 

Phe Thr Pro Asp 

Gin Thr Ala Val 
80 



<210> 129 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<400> 129 












Met Pro 


Lys 


Tyr 


Cys 


Arg 


Ala 


Pro 


1 






5 








Leu Gly 


Ala 


Asp 


Asn 


Arg 


Pro 


Val 






20 










Asp Gly 


Pro 


Arg 


Leu 


Gin 


Ala 


Trp 




35 










40 


Trp Val 


Pro 


Ser 


Cys 


His 


Gin 


His 


50 










55 




Ser Cys 


Phe 


Gin 


Trp 


Arg 


Trp 


Gly 


65 








70 






Val Pro 


Ser 


He 


Phe 


Ser 


Arg 


Gly 



85 



Asn Cys Ser Asn Thr Ala Gly Arg 

10 15 
Ser Phe Tyr Lys Phe Pro Leu Lys 
25 30 
Leu Gin His Met Gly Cys Glu His 
4 5 

Leu Cys Ser Glu His Phe Thr Pro 
60 

Val Arg Tyr Leu Arg Pro Asp Ala 

75 80 
Pro Pro Ala Lys 
90 



<210> 130 
<211> 90 
<212> PRT 

<213> Homo sapiens 



<400> 130 




























Met Val 
1 


He 


Cys 


Cys 
5 


Ala 


Ala 


Val 


Asn 


Cys 
10 


Ser 


Asn 


Arg 


Gin 


Gly 
15 


Lys 


Gly Glu 


Lys 


Arg 

20 


Ala 


Val 


Ser 


Phe 


His 
25 


Arg 


Phe 


Pro 


Leu 


Lys 
30 


Asp 


Ser 


Lys Arg 


Leu 

35 


He 


Gin 


Trp 


Leu 


Lys 
40 


Ala 


Val 


Gin 


Arg 


Asp Asn 
45 


Trp 


Thr 


Pro Thr 
50 


Lys 


Tyr 


Ser 


Phe 


Leu 
55 


Cys 


Ser 


Glu 


His 


Phe 
60 


Thr 


Lys 


Asp 


Ser 


Phe Ser 


Lys 


Arg 


Leu 


Glu 


Asp 


Gin 


His 


Arg 


Leu 


Leu 


Lys 


Pro 


Thr 


Ala 
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65 70 75 

Val Pro Ser lie Phe His Leu Thr Glu Lys 
85 90 



80 



<210> 131 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<400> 131 

Met Pro Thr Asn Cys Ala Ala Ala Gly Cys Ala Thr Thr Tyr Asn Lys 

1 5 10 15 

His He Asn He Ser Phe His Arg Phe Pro Leu Asp Pro Lys Arg Arg 

20 25 30 

Lys Glu Trp Val Arg Leu Val Arg Arg Lys Asn Phe Val Pro Gly Lvs" 

35 40 45 

His Thr Phe Leu Cys Ser Lys His Phe Glu Ala Ser Cys Phe Asp Leu 

50 55 6 o 

Thr Gly Gin Thr Arg Arg Leu Lys Met Asp Ala Val Pro Thr He Phe 

65 70 75 80 

Asp Phe Cys Thr His He Lys Ser Met 
85 



<210> 132 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<400> 132 

Met Pro Asn Phe Cys Ala Ala Pro 

1 5 
Ser Asp Leu Ala Phe Phe Arg Phe 
20 

Lys Trp Val Glu Asn Cys Arg Arg 

35 40 
Asp Gin Leu Asn Lys His Tyr Arg 

50 55 
Ser Met lie Cys Arg Thr Ser Pro 
65 70 
Ala He Pro Thr He Phe Asp Leu 
85 



Asn Cys Thr Arg Lys Ser Thr Gin 

10 15 
Pro Arg Asp Pro Ala Arg Cys Gin 
25 30 
Ala Asp Leu Glu Asp Lys Thr Pro 
. 45 

Leu Cys Ala Lys His Phe Glu Thr 
60 

Tyr Arg Thr Val Leu Arg Asp Asn 
75 80 

Thr Ser 
90 



<210> 133 
<211> 97 
<212> PRT 

<213> Homo sapiens 



<400> 133 
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Tyr His Arg Leu Lys Glu Gly Ala Val Pro Thr lie Phe Glu Ser Phe 
85 90 95 

Ser 



<210> 134 
<211> 92 
<212> PRT 

<213> Homo sapiens 
<400> 134 

Met Thr Arg Ser Cys Ser Ala Val Gly Cys Ser Thr Arg Asp Thr Val 

15 10 15 

Leu Ser Arg Glu Arg Gly Leu Ser Phe His Gin Phe Pro Thr Asp Thr 

20 25 30 

lie Gin Arg Ser Lys Trp lie Arg Ala Val Asn Arg Val Asp Pro Arg 

35 4 0 4 5 

Ser Lys Lys lie Trp lie Pro Gly Pro Gly Ala lie Leu Cys Ser Lys 

50 ~ 55 ' ' 60 

His Phe Gin Glu Ser Asp Phe Glu Ser Tyr Gly lie Arg Arg Lys Leu 
65 70 75 ~ 80 

Lys Lys Gly Ala Val Pro Ser Val Ser Leu Tyr Lys 
85 90 



<210> 135 
<211> 96 
<212> PRT 

<213> Homo sappiens 
<400> 135 

Met Val Lys Cys Cys Ser Ala lie Gly Cys Ala Ser Arg Cys Leu Pro 

1 5 10 ^ 15 

Asn Ser Lys Leu Lys Gly Leu Thr Phe His Val Phe Pro Thr Asp Glu 

20 25 30 

Asn lie Lys Arg Lys Trp Val Leu Ala Met Lys Arg Leu Asp Val Asn 

35 40 45 

Ala Ala Gly lie Trp Glu Pro Lys Lys Gly Asp Val Leu Cys Ser Arg 

50 55 60 

His Phe Lys Lys Thr Asp Phe Asp Arg Ser Ala Pro Asn lie Lys Leu 
65 70 75 80 

Lys Pro Gly Val lie Pro Ser lie Phe Asp Ser Pro Tyr His Leu Gin 
85 90 95 



<210> 136 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<400> 136 

Met Pro Gly Phe Thr Cys Cys Val Pro Gly Cys Tyr Asn Asn Ser His 

15 10 15 

Arg Asp Lys Ala Leu His Phe Tyr Thr Phe Pro Lys Asp Ala Glu Leu 

20 25 30 

Arg Arg Leu Trp Leu Lys Asn Val Ser Arg Ala Gly Val Ser Gly Cys 

35 40 ~ 45 

Phe Ser Thr Phe Gin Pro Thr Thr Gly His Arg Leu Cys Ser Val His 

50 55 60 

Phe Gin Gly Gly Arg Lys Thr Tyr Thr Val Arg Val Pro Thr lie Phe 
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65 70 75 

Pro Leu Arg Gly Val Asn Glu Arg Lys Val 
85 90 

<210> 137 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<400> 137 

Met Pro Ala Arg Cys Val Ala Ala His Cys Gly Asn Thr Thr Lys Ser 

15 10 15 

Gly Lys Ser Leu Phe Arg Phe Pro Lys Asp Arg Ala Val Arg Leu Leu 

20 25 ~ 30 

Trp Asp Arg Phe Val Arg Gly Cys Arg Ala Asp Trp Tyr Gly Gly Asn 

35 4 0 4 5 

Asp Arg Ser Val lie Cys Ser Asp His Phe Ala Pro Ala Cys Phe Aso 

50 55 60 

Val Ser Ser Val He Gin Lys Asn Leu Arg Phe Ser Gin Arg Leu Arq 

65 70 75 so 

Leu Val Ala Gly Ala Val Pro Thr Leu His 
85 90 

<210> 138 
<211> 85 
<212> PRT 

<213> Drosophila melanogaster 
<400> 138 

Met Lys Tyr Cys Lys Phe Cys Cys Lys Ala Val Thr Gly Val Lys Leu 

1 5 10 1S 

He His Val Pro Lys Cys Ala He Lys Arg Lys Leu Trp Glu Gin Ser 

20 25 30 

Leu Gly Cys Ser Leu Gly Glu Asn Ser Gin He Cys Asp Thr His Phe 

35 40 45 

Asn Asp Ser Gin Trp Lys Ala Ala Pro Ala Lys Gly Gin Thr Phe Lys 

50 55 60 

Arg Arg Arg Leu Asn Ala Asp Ala Val Pro Ser Lys Val lie Glu Pro 

6 * • 7 ° 75 * 80 

Glu Pro Glu Lys He 

85 

<210> 139 
<211> 63 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> THAP Domain consensus 
<22l> UNSURE 

a^S Kl'V l' 10 ' 12 ' 15 ~ 20 ' 22 ' 24 ' 32 ' 35 ' 3 8-39, 42-43, 46-47, 
4y-ol, t> 3-61, 63 

<223> Xaa = any of the twenty amino acids 
<400> 139 

Met Pro Lys Xaa Xaa Cys Xaa Ala Xaa Xaa Cys Xaa Asn Arg Xaa Xaa 
1 5 10 
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80 



15 
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Xaa Xaa Xaa Xaa Lys Xaa Lys Xaa Val Ser Phe His Lys Phe Pro Xaa 

20 25 30 

His Asp Xaa His Asp Xaa Xaa Arg Arg Xaa Xaa Trp Val Xaa Xaa Val 

35 40 45 

Xaa Xaa Xaa Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp Xaa 

50 55 60 



<210> 140 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 
<400> 140 

gggcatacta ctggcaa 17 

<210> 141 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 

<400> 141 

gggcaaactg tgggcat 17 

<210> 142 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR- 5- related sequence 
<400> 142 

gggcatacta ctggcaa 17 

<210> 143 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 
<400> 143 

gggcaaacta ctggcaa 17 

<210> 144 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 
<400> 144 
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gggccagttc gttgcaa 

<210> 145 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 

<400> 145 
gggcatgtac tggcaa 

<210> 146 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 

<400> 146 
gggcaactgt gggcaa 

<210> 147 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 
<400> 147 

gggcaacact actggcaa 

<210> 148 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-related sequence 
<400> 148 

gggcaaagta ctggcaa 

<210> 149 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5 consensus sequence 

<221> unsure 
<222> 7-11 

<223> n = any of the four nucleotides 
<400> 149 

gggcaannnn ntggcaa 17 
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17 



16 



18 



17 
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<210> 150 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 
<400> 150 

ttgccagtac taagtgtggg caa 

<210> 151 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 
<400> 151 

ctgccagtac atagtgtggg caa 

<210> 152 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 

<400> 152 

ttgccagtac taagtgtggg caa 

<210> 153 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 
<400> 153 

ctgccagtag atactgtggg caa 

<210> 154 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 
<400> 154 

ttgccagtag ttaggtgtgg gcga 

<210> 155 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 
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23 



23 



23 



23 



24 
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<223> ER-1 1-related sequence 
<400> 155 

ttgccagtag ttagtgtggg caa 

<210> 156 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 
<400> 156 

ttgccagtac ctactaaggg caa 

<210> 157 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 
<400> 157 

ttgccagtag ttagtgtggg cag 

<210> 158 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-ll-related sequence 
<400> 158 

ctgccagtag taagtgtggg cag 

<210> 159 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ER-11 consensus sequence 

<221> unsure 
<222> 7-17 

<223> n = any of the four nucleotides 
<400> 159 

ttgccannnn nnnnnnnggg caa 

<210> 160 
<211> 642 
<212> DNA 

<213> Homo sapiens 
<400> 160 



23 



23 



23 



23 



23 



atggtgcagt cctgctccgc ctacggctgc aagaaccgct acgacaagga caagcccgtt 60 
tctttccaca agtttcctct tactcgaccc agtctttgta aagaatggga ggcagctgtc 120 
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agaagaaaaa 
gactgctt ta 
tttctttgta 
cccccacctc 
ccgcctcttc 
gatacaatgc 
aagaagctca 
aaggaggttg 
ctaccaaatg 



actt taaacc 
agagagagtg 
ctgagccaca 
ctttaccgcc 
agacccctgt 
accagcggaa 
agaccgcaca 
ttcacttcca 
actactttga 



caccaagtat 
caacaacaag 
tgacaagaaa 
tcctgtttcc 
taatctctca 
aaggat teat 
geagegatge 
gaaagagaaa 
aatagttgaa 



agcagtattt 
ttactgaaag 
gaagatcttc 
caggttgatg 
gttttctgtg 
cagctagaac 
agaaggcaag 
gaegaegtat 
gtaccagcat 



gttcagagca 
agaatgetgt 
tggagecaca 
ctgctattgg 
accacaacta 
agcaagttga 
aaeggcaget 
cagaaagagg 
aa 



ctttactcca 
gcccacaata 
ggaacagctt 
attactaatg 
tactgtggag 
aaaactcaga 
tgaaaaatta 
ttatgtgatt 



180 
240 
300 
360 
420 
480 
540 
600 
642 



<210> 161 
<211> 687 
<212> DNA 
<213> Homo sapiens 



<400> 161 

atgccgacca 

agcttccaca 

cgcaaaaatt 

tgttttgacc 

gatttttgta 

aacagttgtt 

ctact tgaac 

ctggaaaaag 

agagcaactc 

gtat taccta 

gaagatttta 

aaacagacca 



attgegctge 
ggtttccttt 
ttgtgccagg 
taacaggaca 
cccatataaa 
ctccagctgg 
acagctatgc 
aaat agcaag 
gaagatggat 
aaggtacatc 
agatccttga 
agagtacctt 



ggcgggctgt 
ggatcctaaa 
aaaacacact 
aactcgacga 
gtctatgaaa 
accatctaat 
ctttaggaat 
ctt aagaaga 
caaagccacg 
agaacacatg 
acaaga tcaa 
catttaa 



gccactacct 
agaagaaaag 
tttctttgtt 
cttaaaatgg 
ctcaagtcaa 
ttaaaatcaa 
cctatggagg 
aaaatgaaaa 
tgtttggtaa 
ttaccaactg 
caagataaaa 



acaacaagca 
aatgggttcg 
caaagcactt 
atgctgttcc 
ggaatctttt 
acattagtag 
caaaaaagag 
cttgcctaca 
agaatttaga 
ccttaagcag 
cactgetaag 



ca ttaacatc 
cctggttagg 
tgaagcctcc 
aaccatt ttt 
gaagaaaaac 
tcagcaagta 
gatcattaaa 
aaaggaaege 
agcaaatagt 
tcttcctttg 
tctaaatcta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

687 



<210> 162 
<211> 720 
<212> DNA 
<213> Homo sapiens 



<400> 162 

atgccgaagt cgtgcgcggc ccggcagtgc 
ctcaccttcc accggtttcc gttcagccgc 
ateggceggg gcaacttcaa gcccaagcag 
ecagagtget tcagcgcctt tggaaacege 
gtgttcgcct ttcaggaccc cacacagcag 
agaggaaatg ccagctcttc tcagaaagaa 
gacagtcctg ggagaaacat ggacactgea 
gaaggccacg taaaacaggt ctcgccacgg 
ccgactggcc ctgcaggcct gagaaggacc 
geccttttgg acttagattc cctgaagaaa 
aagctccgga agegcttgea ggeccagagg 
cgtgcttgca aagggcacca gggactccag 



tgcaaccgct acagcagccg caggaagcag 60 
ccggagctgc tgaaggaatg ggtgctgaac 120 
cacacggtca tctgctccga gcacttccgg 180 
aagaacctaa ageacaatge cgtgcccacg 240 
gtgagggaga acacagaccc tgccagtgag 300 
aaggtcctcc ctgaggcggg ggceggagag 360 
cttgaagagc ttcagttgcc cccaaatgcc 420 
aggcegcaag caacagaggc tgttggccgg 480 
cccaacaagc agecatctga tcacagctat 540 
aaactcttcc tcactctgaa ggaaaatgaa 600 
ctggtgatgc gaaggatgtc cagccgcctc 660 
gecagacttg ggecagagea gcagagctga 720 



<210> 163 

<211> 1734 

<212> DNA 

<213> Homo sapiens 

<400> 163 

atggtgatct gctgtgcggc cgtgaactgc 
gccgtctcct tccacaggtt ccccctaaag 
gctgttcaga gggataactg gactcccact 
accaaagaca gcttctccaa gaggctggag 
gtgccatcca tcttccacct gaccgagaag 
eggagaaaag atgccagcaa ggccacaggg 



tccaaccggc agggaaaggg egagaagege 60 
gactcaaaac gtctaatcca atggttaaaa 120 
aagtattcat ttctctgtag tgagcatttc 180 
gaccagcatc gectgetgaa gcccacggcc 240 
aagagggggg ctggaggcca tggccgcacc 300 
ggtgtgaggg gacactcgag tgccgccacc 360 
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ggcagaggag ctgcaggttg gtcaccgtcc tcgagtggaa acccgatggc caagccagag 420 
tcccgcaggt tgaagcaagc tgctctgcaa ggtgaagcca cacccagggc ggcccaggag 480 
gccgccagcc aggagcaggc ccagcaagct ctggaacgga ctccaggaga tggactggcc 54 0 
accatggtgg caggcagtca gggaaaagca gaagcgtctg ccacagatgc tggcgatgag 600 
agcgccactt cctccatcga agggggcgtg acagataaga gtggcatttc tatggatgac 660 
tttacgcccc caggatctgg ggcgtgcaaa tttatcggct cacttcattc gtacagtttc 720 
tcctctaagc acacccgaga aaggccatct gtcccccgag agcccattga ccgcaagagg 780 
ctgaagaaag atgtggaacc aagctgcagt gggagcagcc tgggacccga caagggcctg 840 
gcccagagcc ctcccagctc atcacttacc gcgacaccgc agaagccttc ccagagcccc 900 
tctgcccctc ctgccgacgt caccccaaag ccagccacgg aagccgtgca gagcgagcac 960 
agcgacgcca gccccatgtc catcaacgag gtcatcctgt cggcgtcagg ggcctgcaag 1020 
ctcatcgact cactgcactc ctactgcttc tcctcccggc agaacaagag ccaggtgtgc 1080 
tgcctgcggg agcaggtgga gaagaagaac ggcgagctga agagcctgcg gcagagggtc 1140 
agccgctccg acagccaggt gcggaagcta caggagaagc tggatgagct gaggagagtg 1200 
agcgtcccct atccaagtag cctgctgtcg cccagccgcg agccccccaa gatgaaccca 1260 
gtggtggagc cactgtcctg gatgctgggc acctggctgt cggacccacc tggagccggg 1320 
acctacccca cactgcagcc cttccagtac ctggaggagg ttcacatctc ccacgtgggc 1380 
cagcccatgc tgaacttctc gttcaactcc ttccacccgg acacgcgcaa gccgatgcac 1440 
agagagtgtg gcttcattcg cctcaagccc gacaccaaca aggtggcctt tgtcagcgcc 1500 
cagaacacag gcgtggtgga agtggaggag ggcgaggtga acgggcagga gctgtgcatc 1560 
gcatcccact ccatcgccag gatctccttc gccaaggagc cccacgtaga gcagatcacc 1620 
cggaagttca ggctgaattc tgaaggcaaa cttgagcaga cggtctccat ggcaaccacg 1680 
acacagccaa tgactcagca tcttcacgtc acctacaaga aggtgacccc gtaa 1734 

<210> 164 

<211> 1188 

<212> DNA 

<213> Homo sapiens 

<400> 164 

atgccccgct attgcgcagc gatttgttgt aagaaccgcc ggggacgaaa caataaagac 60 
cggaagctga gtttttatcc atttcctcta catgacaaag aaagactgga aaagtggtta 120 
aagaatatga agcgagattc atgggttccc agtaaatacc agtttctatg tagtgaccat 180 
tttactcctg actctcttga catcagatgg ggtattcgat atttaaaaca aactgcagtt 240 
ccaacaatat tttctttgcc tgaagacaat cagggaaaag acccttctaa aaaaaaatcc 300 
cagaagaaaa acttggaaga tgagaaagaa gtatgcccaa aagccaagtc agaagaatca 360 
tttgtattaa atgagacaaa gaaaaatata gttaacacag atgtgcccca tcaacatcca 420 
gaattacttc attcatcttc cttggtaaag ccaccagctc ccaaaacagg aagtatacaa 480 
aataacatgt taactcttaa tctagttaaa caacatactg ggaaaccaga atctaccttg 540 
gaaacatcag ttaaccaaga tacaggtaga ggtggttttc acacatgttt tgagaatcta 600 
aattctacaa ctattacttt gacaacttca aattcagaaa gtattcatca atctttggaa 660 
actcaagaag ttcttgaagt aactaccagt catcttgcta atccaaactt tacaagtaat 720 
tccatggaaa taaagtcagc acaggaaaat ccattcttat tcagcacaat taatcaaaca 780 
gttgaagaat taaacacaaa taaagaatct gttattgcca tttttgtacc tgctgaaaat 840 
tctaaaccct cagttaattc ttttatatct gcacaaaaag aaaccacgga aatggaagac 900 
acagacattg aagactcctt gtataaggat gtagactatg ggacagaagt tttacaaatc 960 
gaacattctt actgcagaca agatataaat aaggaacatc tttggcagaa agtctctaag 1020 
ctacattcaa agataactct tctagagtta aaagagcaac aaactctagg tagattgaag 1080 
tctttggaag ctcttataag gcagctaaag caggaaaact ggctatctga agaaaacgtc 1140 
aagattatag aaaaccattt tacaacatat gaagtcacta tgatatag 1188 

<210> 165 
<211> 669 
<212> DNA 
<213> Homo sapiens 

<400> 165 

atggtgaaat gctgctccgc cattggatgt gcttctcgct gcttgccaaa ttcgaagtta 60 

aaaggactga catttcacgt attccccaca gatgaaaaca tcaaaaggaa atgggtatta 120 

gcaatgaaaa gacttgatgt gaatgcagcc ggcatttggg agcctaaaaa aggagatgtg 180 

ttgtgttcga ggcactttaa gaagacagat tttgacagaa gtgctccaaa tattaaactg 240 
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aaacctggag tcataccttc tatctttgat 
aaacttcatt gtagaaaaaa cttcaccctc 
catcttgttg gtgcttcctc atgtattgaa 
agctacagtg taatggacag tccaaagaaa 
gagctagagg atacaaagga aagtctacgg 
aaatcattga ggaagacaat cagggaatta 
gcaaatagac tggacacttt ctgttgggac 
atttcatga 



tctccatatc acctacaggg gaaaagagaa 300 
aaaaccgttc cagccactaa ctacaatcac 360 
gaattccaat cccagttcat ttttgaacat 420 
cttaagcata aattagatca tgtgatcggc 480 
aatgttttag accgagaaaa acgttttcag 540 
aaggatgaat gtctgatcag ccaagaaaca 600 
tgttgtcagg agagcataga acaggactat 660 

669 



<210> 166 
<211> 930 
<212> DNA 

<213> Homo sapiens 
<400> 166 

atgccgcgtc actgctccgc cgccggctgc tgcacacggg acacgcgcga gacgcgcaac 60 

cgcggcatct ccttccacag acttcccaag aaggacaacc cgaggcgagg cttgtggctg 120 

gccaactgcc agcggctgga ccccagcggc cagggcctgt gggacccggc atccgagtac 180 

atctacttct gctccaaaca ctttgaggag gactgctttg agctggtggg aatcagtgga 240 

tatcacaggc taaaggaggg ggcagtcccc accatatttg agtctttctc caagttgcgc 300 

cggacaacca agaccaaagg acacagttac ccacctggcc cccctgaagt cagccggctc 360 

agacgatgca ggaagcgctg ctccgagggc cgagggccca caactccatt ttctccacct 420 

ccacctgctg atgtcacctg ctttcctgtg gaagaggcct cagcacctgc cactttgccg 480 

gcctccccag ctgggaggct ggagcctggc cttagcagcc ccttttcaga cctactgggc 540 

cccttgggtg cccaggcaga tgaagcaggc tgcagcgccc agccttcacc agagcggcag 600 

ccctcccctc tcgaaccacg gccagtctcc ccctcagcgt atatgctgcg cctgccccca 660 

cccgccggag cctacatcca gaatgaacac agctaccagg tgggcagcgc cttactctgg 720 

aagcggcgag ccgaggcagc ccttgatgcc cttgacaagg cccagcgcca gctgcaggcc 780 

tgcaagcggc gggagcagcg gctgcggttg agactgacca agctgcagca ggagcgggca 84 0 

cgggagaagc gggcacaggc agatgcccgc cagactctga aggagcatgt gcaggacttt 900 
gccatgcagc tgagcagcag catggcctga 930 

<210> 167 
<211> 825 
<212> DNA 

<213> Homo sapiens 
<400> 167 

atgcccaagt actgcagggc gccgaactgc tccaacactg cgggccgcct gggtgcagac 60 

aaccgccctg tgagcttcta caagttccca ctgaaggatg gtccccggct gcaggcctgg 120 

ctgcagcaca tgggctgtga gcactgggtg cccagctgcc accagcactt gtgcagcgag 180 

cacttcacac cctcctgctt ccagtggcgc tggggtgtgc gctacctgcg gcctgatgca 240 

gtgccctcca tcttctcccg gggaccacct gccaagagtc agcggaggac ccgaagcacc 300 

cagaagccag tctcgccgcc gcctccccta cagaagaata cacccctgcc ccagagccct 360 

gccatcccag tctctggccc agtgcgccta gtggtgctgg gccccacatc ggggagcccc 420 

aagactgtgg ccaccatgct cctgaccccc ctggcccctg cgccaactcc tgagcggtca 4 80 

caacctgaag tccctgccca acaggcccag accgggctgg gcccagtgct gggagcactg 54 0 

caacgccggg tgcggaggct gcaacggtgc caggagcggc accaggcgca gctgcaggcc 600 

ctggaacggc tggcacagca gctacacggg gagagcctgc tggcacgggc acgccggggt 660 

ctgcagcgcc tgacaacagc ccagaccctt ggacctgagg aatcccaaac cttcaccatc 720 

atctgtggag ggcctgacat agccatggtc cttgcccagg accctgcacc tgccacagtg 780 

gatgccaagc cggagctcct ggacactcgg atccccagtg cataa 825 

<210> 168 

<211> 3171 

<212> DNA 

<213> Homo sapiens 

<400> 168 

atgacccgaa gttgctccgc agtgggctgc agcacccgtg acaccgtgct cagccgggag 60 

cgcggcctct ccttccacca atttccaact gataccatac agcgctcaaa atggatcagg 120 
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gctgttaatc gtgtggaccc cagaagcaaa aagatttgga ttccaggacc aggtgctata 180 
ctgtgttcca aacattttca agaaagtgac tttgagtcat atggcataag aagaaagctg 240 
aaaaaaggag ctgtgccttc tgtttctcta tacaagattc ctcaaggtgt acatcttaaa 300 
ggtaaagcaa gacaaaaaat cctaaaacaa cctcttccag acaattctca agaaqttqct 360 
actgaggacc ataactatag tttaaagaca cctttgacga taggtgcaga gaaactggct 420 
gaggtgcaac aaatgttaca agtgtccaaa aaaagactta tctccgtaaa gaactacagg 480 
atgatcaaga agagaaaggg tttacgatta attgatgcac ttgtagaaga gaaactactt 540 
tctgaagaaa cagagtgtct gctacgagct caattttcag attttaagtg ggagttatat 600 
aattggagag aaacagatga gtactccgca gaaatgaaac aatttgcatg tacactctac 660 
ttgtgcagta gcaaagtcta tgattatgta agaaagattc ttaagctgcc tcattcttcc 720 
! ^ a ?? a c 9tggttatc caaatgccaa cccagtccag gtttcaacag caacattttt 780 
tcttttcttc aacgaagagt agagaatgga gatcagctct atcaatactg ttcattgtta 840 
ataaaaagta tacctctcaa gcaacagctt cagtgggatc ctagcagtca cagtttgcag 900 
gggtttatgg actttggtct tggaaaactt gatgctgatg aaacgccact tgcttcagaa 960 
actgttttgt taatggcagt gggtattttt ggccattgga gaacacctct tggttatttt 1020 
tttgtaaaca gagcatctgg atatttgcag gctcagctgc ttcgtctgac tattggtaaa 1080 
ctgagtgaca taggaatcac agttctggct gttacatctg atgccacagc acatagtgtt 1140 
cagatggcaa aagcattggg gatacatatt gatggagacg acatgaaatg tacatttcag 1200 
catccttcat cttctagtca acagattgca tacttctttg actcttgcca cttgctaaga 1260 
ttaataagaa atgoatttca gaattttcaa agcattcagt ttattaatgg tatagcacat 1320 
tggcagcacc tcgtggagtt agtagcactg gaggaacagg aattatcaaa tatggaaaga 1380 
tlt-ttttT^ CaC " gCaaa tttgaaaaat catgtactga aagtgaatag tgccacccaa 1440 
ctctttagtg agagtgtagc cagtgcatta gaatatttgt tatccttaga cctgccacct 1500 
tttcaaaact gtattggtac catccatttt ttacgtttaa ttaacaatct gtttgacatc 1560 
tttaatagta ggaactgtta tggaaaggga cttaaagggc ctctgttgcc tgaaacttac 1620 
agtaaaataa accacgtgtt aattgaagcc aagactattt ttgttacatt atctgacact 1680 
agcaataatc aaataattaa aggtaagcaa aaactaggat tcctgggatt tttgctcaat 1740 
gctgagagct taaaatggct ctaccaaaat tatgttttcc caaaggtcat gccttttcct 1800 
^ C "^ 9a f tacaaatt cagtcatgat catctggaat tatttctaaa gatgcttagg 1860 
caggtattag taacaagttc tagccctacc tgcatggcat tccagaaagc ttactataat 1920 
ttggagacca gatacaaatt tcaagatgaa gtttttctaa gcaaagtaag catctttgac 1980 
atttcaattg ctcgaaggaa agacttggcg ctttggacag ttcaacgtca gtatggtgtc 2040 
agcgttacaa agactgtctt tcacgaagag ggtatttgtc aagactggtc tcattgttca 2100 
ctaagtgagg cattactaga cctgtcagat cataggcgaa atctcatctg ttatgctggt 2160 
tatgttgcaa acaagttatc agctctttta acttgtgagg actgcatcac tgcactgtat 2220 
??tlZ??tt C t " 3agCCtC taaaattggg tcactattat ttgttaaaaa gaagaatggt 2280 
ttgcattttc cttcagaaag tctgtgtcgg gtcataaata tttgtgagcg agttgtaaga 2340 
,T Cat J S t ga f ggcaat ttttgaacta gtttctaaac aaagggaatt gtatcttcaa 2400 
cagaaaatat tatgtgagct ttctgggcat attgatcttt ttgtagatgt gaataagcat 2460 
££tt?t ? 9agaagtgtg tgccatcaat cactttgtca agttgctaaa ggatataata 2520 
atctgtttct taaatatcag agctaaaaat gttgcacaga atcctttaaa acatcattca 2580 
gagagaactg atatgaaaac tttatcaagg aaacactggt cacctgtaca ggattataaa 2640 
tgttcaagtt ttgctaatac cagtagtaaa ttcaggcatt tgctaagtaa cgatggatat 2700 
tt£ia!!£ gagaga = cta aaatatatta acattttaat taagaatact tgatcaacat 2760 
""^ aagt ^ aatttacc atattttata aattgcgcat tctgcacagt ggacaagttt 2820 
gcaattctga cttattaaaa tttcaaattc tgcatatcac aaaatctcct tatacttttg 2880 
gtatggcttg cagcatttat gagttttcca aaatatagaa agcagtaggt cagtaggagc 2940 
aaactagcca acaggtactg tctttgaatt tactactgta agactaagca gtgtta^tgg 3000 
™T ,1 aacttgttca atctgcttca aaaacaagaa aaacaacaac Ltgagtta? 3060 
C C ^ 9 actccattta tgactagact acatttctga aagatctttg gtttacgatt 3120 

cttaagaata ttgacaatac ctataaaact ttgaagataa cttttactta a 3171 



<210> 169 
<211> 774 
<212> DNA 

<213> Homo sapiens 



<400> 169 

atgccggccc gttgtgtggc cgcccactgc ggcaacacca ccaagtctgg gaagtcgctg 60 

ttccgctttc ccaaggaccg ggccgtgcgg ctgctctggg accgcttcgt gcggggttgc 120 

cgcgccgact ggtacggagg caatgaccgc tcggtcatct gctctgacca ctttgcccca 180 



-69- 



BNSDOCID: <WO__2tXM0550S0A2J_> 



WO 2004/055050 



POYIB2003/006434 



gcctgttttg 
ctggtggcag 
gaggagggag 
tctgaggctg 
gcttcacaga 
tctaatactg 
caaatttctt 
tttggaaaaa 
tctctctttg 
gaacagagtg 



acgtctcttc 
gcgccgtgcc 
accaagcagg 
ccccaggtcc 
ttacgtgtga 
tcacttcagt 
tgaaaaggcc 
gactgtgtaa 
acatttactc 
atttgtctta 



ggttatccag 
caccctgcac 
ccgcctggac 
agtctcctgt 
aaatgaactt 
acctactcac 
ccgtcaccgt 
tgcaactact 
cagtgattca 
tatggctgta 



<210> 170 
<211> 945 
<212> DNA 

<213> Homo sapiens 



aagaacctgc 
cgggtgcccg 
acgcgaggag 
acacgccccc 
gtgcaaaccc 
tgtgaagaag 
agtgtgggta 
cagacagagg 
gaaacagata 
caggtgaaag 



get tctccca 
ccccggcacc 
agctccaggc 
gagctgggaa 
aaccccatgc 
gcccagtgca 
ttcaagccaa 
aattgtggtc 
cagactggga 
aagaaacatg 



gegectgagg 
taagagggga 
agecaggcat 
geaggctgea 
tgataatcca 
taaaagtaca 
agtgaaagcg 
tagaacttcc 
tatcaagagt 
ttaa 



240 
300 
360 
420 
480 
540 
600 
660 
720 
774 



<400> 170 

atgcctggct 

ctgcacttct 

tcgcgtgccg 

tgcagcgttc 

ccgctgcgcg 

cgccgcaggc 

cagcaacagc 

cagactgccc 

caggccactg 

actggagaag 

ggcgcagccg 

gccgagtgcc 

ggctccgacc 

ctgaatgagc 

agcattcgcc 

eggctgettg 



ttacgtgctg 

acacgtttcc 

gcgtcagtgg 

act tccaggg 

gcgtcaatga 

agcagcagca 

agcagcagca 

agctgcagcc 

tagacagcag 

aegtgaagee 

ctgcggccgc 

ctatgggccc 

attegtaetc 

agegggacat 

acctgcgtct 

ccatggctgt 



cgtgccaggc 
aaaggacget 
gtgcttctcc 
cggccgcaag 
gcgcaaagta 
acagcagcag 
gcagcagcag 
gaacctggta 
tcaggctccg 
ca tcgatctc 
cgegteggag 
ccagttggtg 
ct tgtegtea 
cctggctctg 
cactgaggcc 
catccgcaag 



tgctacaaca 
gagttgegge 
acct tccagc 
acctacacgg 
gcgcgcagac 
cagcagcaac 
cagcagtcct 
tctgcttccg 
ggatcegtae 
acagtgcaag 
ttacaggctg 
gtggtagggg 
ggcaccacgg 
atggaagtga 
aagctgcgcg 
aagcacggaa 



actcgcaccg 
gcctctggct 
ccaccacagg 
tacgcgtccc 
ccgctggggc 
agcagcaaca 
caccctctgc 
cggccgtgct 
agccggcgcc 
tggagtttgc 
ctaccgcagg 
aagagggctt 
aggaggagct 
agatgaaaga 
aagaactgcg 
tgtga 



ggacaaggcg 
caagaaegtg 
ccaccgtctc 
caccatcttc 
cgcggccgcc 
gcagcagcag 
ctccactgcc 
tctcaccctt 
catcactccc 
ageegcagag 
gctggaggct 
ccctgatact 
cctgcgcaag 
gatgaaaggc 
tgagaaggat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

945 



<210> 171 
<211> 2286 
<212> DNA 

<213> Homo sapiens 



<400> 171 

atgecgaact 

ttcttcaggt 

gcagacttag 

catt ttgaga 

gcaataccaa 

aaacgaataa 

gaaacttctg 

agegaagaag 

gaaaacaaag 

atacctctgg 

tttcaggcac 

gagacaacag 

atctgtgaga 

tccattatca 

aggtttgttg 

gecgatgeag 

aatatggagt 

aaagttgttg 

tcctgtgcct 

ttaggaacaa 

cttgacaacg 



tctgcgctgc 

tcccgcggga 

aagataaaac 

cctctatgat 

caatatttga 

aagaactgag 

agcaggaaca 

agggtgaagg 

aatacctaaa 

atggacatga 

tgctggagtg 

cagttaacac 

gctgtattcg 

ctgacgatgt 

atgaatctca 

aaattttggc 

attgtcgtgg 

cttctagact 

taaatatgtg 

ttgaggaagt 

taatttctgt 



ccccaactgc 

ccctgccaga 

acctgatcag 

ctgtagaact 

tct taccagt 

tgaagatgaa 

aaaacataaa 

gcaagatgag 

atctctatt t 

ggctgatgaa 

teggataaat 

gttgttttgt 

agaagaaact 

agtggacata 

taacctaaga 

tgtgaaatt t 

ccaggcttac 

t ttagagaaa 

gt tggcaaaa 

ttgttctttt 

tctttttcag 



aegeggaaga 

tgccagaagt 

ctaaataaac 

agtccttata 

catttgaaca 

atcaggacac 

gaaaccaaca 

gacattttac 

gaaatcttga 

atcccagaag 

tctggtgaag 

tcaaaaacac 

ctcagggaag 

gcaggggaag 

gaggaattta 

cacactatga 

attgtctcta 

tatccccaag 

tcagtacctg 

ttccatcgat 

aacagtaaag 



gcacgcagtc 

gggtggagaa 

attatcgatt 

ggacagttct 

acccacatag 

tgaaacagaa 

atagcaatgc 

ctctaaccct 

ttctgatggg 

gtctctttac 

aggttctgag 

agcagaggca 

tgagagactc 

agcacctacc 

taggcttcct 

taactgagaa 

gtggattttc 

ctatctacac 

ttatgggagt 

caccacaact 

aaaggggtaa 



cgacttggcc 

ctgtaggaga 

atgtgccaaa 

tcgagataat 

tagacacaga 

aaaaattgat 

tcagaacccc 

tgaagagaag 

aaagcaaaac 

tccagataac 

aaagcggttt 

gatgetagag 

acacttcttt 

tgtgttggtg 

gecttatgaa 

gtggggatta 

ttccaaaatg 

actctgctct 

atctgttgca 

gcttttagaa 

agaactgaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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gaaatctgcc attctcagtg gacaggcagg catgatgctt ttgaaatttt agtggaactc 1320 
ctgcaagcac ttgttttatg tttagatggt ataaatagtg acacaaatat tagatggaat 1380 
aactatatag ctggccgagc atttgtactc tgcagtgcag tgtcagattt tgatttcatt 1440 
gttactattg ttgttcttaa aaatgtccta tcttttacaa gagcctttgg gaaaaacctc 1500 
caggggcaaa cctctgatgt cttctttgcg gccggtagct tgactgcagt actgcattca 1560 
ctcaacgaag tgatggaaaa tattgaagtt tatcatgaat tttggtttga ggaagccaca 1620 
aatttggcaa ccaaacttga tattcaaatg aaactccctg ggaaattccg cagagctcac 1680 
cagggtaact tggaatctca gctaacctct gagagttact ataaagaaac cctaagtgtc 1740 
ccaacagtgg agcacattat tcaggaactt aaagatatat tctcagaaca gcacctcaaa 1800 
gctcttaaat gcttatctct ggtaccctca gtcatgggac aactcaaatt caatacgtcg 1860 
gaggaacacc atgctgacat gtatagaagt gacttaccca atcctgacac gctgtcagct 1920 
gagcttcatt gttggagaat caaatggaaa cacaggggga aagatataga gcttccgtcc 1980 
accatctatg aagccctcca cctgcctgac atcaagtttt ttcctaatgt gtatgcattg 2040 
ctgaaggtcc tgtgtattct tcctgtgatg aaggttgaga atgagcggta tgaaaatgga 2100 
cgaaagcgtc ttaaagcata tttgaggaac actttgacag accaaaggtc aagtaacttg 2160 
gctttgctta acataaattt tgatataaaa cacgacctgg atttaatggt ggacacatat 2220 
attaaactct atacaagtaa gtcagagctt cctacagata attccgaaac tgtggaaaat 2280 
acctaa 

2286 

<210> 172 
<211> 633 
<212> DNA 
<213> Mus musculus 

<400> 172 

atggtgcagt cctgctccgc ctacggctgc aagaaccgct acgacaagga caagcccgtc 60 
tccttccaca agtttcctct tactcgcccc agcctttgta agcagtggga ggcagctgtt 120 
aaaaggaaaa acttcaagcc caccaagtac agcagcatct gctcggagca cttcaccccg 180 
gactgcttta agagggagtg caacaacaag ctactgaagg agaacgctgt gcccacaata 240 
tttctctata tcgagccaca tgagaagaag gaagacctgg aatcccaaga acagctcccc 300 
tctccttcac cccccgcttc ccaggttgat gctgctattg ggctgctaat gccccctctg 360 
cagacccctg ataacctgtc ggttttctgt gaccacaatt acactgtgga ggatacgatg 420 
caccagagga agaggatcct gcagctggag cagcaggtgg agaaactcag gaagaagctc 480 
aagacggccc agcagcggtg ccggcggcag gagaggcagc tcgagaagct caaggaagtc 540 
gtccactttc agagagagaa ggacgacgcg tccgagaggg gctacgtgat cctaccaaat 600 
gactactttg aaattgttga agttccagca tga 633 

<210> 173 
<211> 654 
<212> DNA 
<213> Mus musculus 

<400> 173 

atgccgacca attgcgccgc ggcgggctgt gctgctacct acaacaagca cattaacatc 60 
agcttccaca ggtttccttt ggatcctaaa agaagaaaag aatgggttcg cctggttagg 120 
cgcaaaaatt ttgtgccagg aaaacacact tttctttgct caaagcactt tgaagcctcc 180 
tgttttgatc taacaggaca aacccgacga cttaaaatgg atgctgttcc aaccattttt 240 
gatttttgta cccatataaa gtctctgaaa ctcaagtcaa ggaatcttct gaagacaaac 300 
aacagttttc ctccaactgg accatgtaat ttaaagctga acggcagtca gcaagtactg 360 
cttgaacaca gttatgcctt taggaaccct atggaggcga aaaaaaggat aattaaacta 420 
gaaaaggaaa tagcaagctt gagaaaaaaa atgaaaactt gcctgcaaag agaacgcaga 480 
gcaactcgaa ggtggatcaa agccacgtgc tttgtgaaga gcttagaagc aagtaacatg 540 
ctacctaagg gcatctcaga acagatttta ccaactgcct taagcaatct tcctctggaa 600 
gatttaaaaa gtcttgaaca agatcaacaa gataaaacag tacccattct ctaa 654 

<210> 174 
<211> 657 
<212> DNA 
<213> Mus musculus 

<400> 174 
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atgccgaagt 
ctcaccttcc 
atcggccggg 
cccgagtgct 
gtgttcgctt 
tcagggagga 
gtgcagcagg 
gccagccctc 
ctttcggact 
cttcggaagc 
gcctacagag 



cttgcgcggc 
accggttccc 
ctgacttcaa 
tcagcgcctt 
ttcagaaccc 
acatggacac 
tcttaccaga 
tggggttgaa 
tggataccct 
ggctgaaagc 
agggacagcc 



ccggcaatgc 
cttcagccgc 
gcctaa.gcag 
tgggaaccgc 
cacagaggtc 
cacactggaa 
tcgagaagca 
gaggcccctt 
caaaaaaaaa 
ccagaggctg 
gggacctcgg 



<210> 175 
<211> 558 
<212> DNA 
<213> Mus musculus 



tgcaaccgct 
ccggagctgt 
cacacagtca 
aagaacctga 
tgccctgagg 
gaacttcagc 
atggaggcca 
ccgggacagc 
ctctt tctca 
ctgttgcgga 
gccagacggc 



acagcagccg 
tgagggagtg 
tctgctcgga 
aacacaatgc 
tgggggctgg 
ctccaacccc 
cggaggccgc 
cgtctgatca 
cactgaagga 
ggacatgtgg 
cggcacaggg 



caggaagcag 
ggtgctcaac 
acacttcaga 
tgtgcccacg 
tggggacagc 
ggaaggcccc 
tggcctgcct 
cagttatgcc 
aaacaagagg 
ccgcctgaga 
aagctga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

657 



<400> 175 

atactgcaag 

aagacagact 

atctttgaat 

ttccttctca 

attgaagaat 

aagaagctta 

ctacggaatg 

gaactaaagg 

tgggagtgct 

cttcatctgc 



catttggaag 
ttgacagaag 
gtccatatca 
aaacccttcc 
tcgaacccca 
agcataagct 
ttttagcccg 
atgaaagtct 
atcatgaaag 
agttgaca 



cctaaaaaaa 
cactctaaac 
cttacaggag 
catcacccac 
gttcattttt 
agaccgtgtg 
agaaaaacac 
gatcagccag 
cacagcagga 



<210> 176 
<211> 1719 
<212> DNA 

<213> Homo sapiens 



<400> 176 

ctt tccgcgc 

cctacgcctc 

cgtcctctcc 

gcggccggag 

ggcgaccggt 

ggagtggaag 

cccgggaggg 

ggcggccgcc 

tgccgtcccc 

gcccccggcc 

tgcaccgccg 

ctcgggcccc 

gcggcgctcc 

tgatgaagca 

tcagaatgaa 

tagaacagtt 

tagatattct 

aggtactctg 

agtaacagaa 

gat tggaaaa 

tgaaaatgaa 

caggtagagg 

atggtcaatg 

atttatttct 

aaaaagcatt 

taatgccttt 



ggcggaagag 
gtcccctaca 
tggaggtcgt 
ccctggtggg 
ggctaccgga 
gcgaaacgcg 
ggcagcagcg 
gctgccaacg 
ggtcccgggg 
cgcggcccgc 
ccccagcgtg 
agtgccagga 
accggcgtgg 
gggcagaaag 
gctgtaaact 
tcaggcagat 
cgaacagat a 
gtttcaagta 
agacaagaaa 
ctcaaagaag 
cagctaaagc 
attcaagact 
ctagcacaat 
agcaaacaga 
atccttttat 
ttggttttgt 



cgcgcgccag 
agctcctcca 
ctcctggcat 
cggcctgagg 
ccagcagcgg 
agaagatgcg 
acgccgctgg 
agctcaacaa 
gcgtgaactg 
ggcggtcgga 
acgaggagga 
aaggcaaggg 
tcaacatccc 
agcggaaacg 
tactagatcc 
ataaaagcac 
gaagtgggtt 
gcacactgga 
acctaagact 
aaattgattt 
aggaaaataa 
caatgtggaa 
attcctatgc 
tgtttgtttt 
ctcacaaata 
gtggtattca 



ggagatgtgc 
actaagctga 
aaaagagaaa 
catggccgcc 
gaacatagct 
atcatcgagc 
tttcaaaagt 
gaaacagcca 
ggctgtagtt 



tgtgttcaag 
aggcaggagc 
aacttcactg 
agcttgttgg 
acagtgttat 
tggagaatac 
cactga,ggaa 
atagtctggg 
gtgaagtcat 



acacttcaag 
catcccttct 
tagaaaaaac 
tgcctcctgc 
ggacagccca 
caaggaaagc 
gacaatcatg 
tgctttctgt 
ttcttatatg 



cttcggcaca 
agccccgccg 
cctcggggcc 
tgagagcccg 
cctcggcggc 
cgccaagcag 
gaagcccccc 
caacctcccg 
cgcggtcggc 
ggacgagccc 
gccggacggc 
gcagatcgag 
tgccgcagag 
agaagatgca 
aggcagttcc 
aaccagtgtc 
ccctagatat 
aaagaaaatt 
tgtgagactg 
attaaataga 
aactcttttg 
aaaatatttt 
tgcaatacat 
caaaatactt 
agtaatatct 
actaatacat 



cttgggagcc 
gctgctgtgg 
gcaggaagga 
accggcccct 
agcaccacag 
aaccccecgg 
gcgggggctc 
ggcggcgcgc 
tccgccatgc 
ccagccgcct 
gtcccagaga 
aagaggaagc 
tgcttagatg 
attacacaac 
tatctgctac 
tctgaagaag 
aacagggatg 
gaagatcttg 
atgcaagata 
gacctagatg 
aaagt tgtgg 
aaactactga 
taaaataact 
ctttttcatt 
ttcagttatt 
ggtttaaagt 



ggatcccagc 
gagcggcggc 
agaggaggca 
ttgggaatat 
acttcctgga 
gcccggcccc 
tgggcacccc 
cggccgcacc 
tgacgcgggc 
ctgcctcggc 
agggcaagag 
tgcgggagaa 
agtacgaaga 
agaacactat 
aggagccacc 
atgtctcaag 
caaatgtttc 
aaaaggaagt 
aagaggaaat 
aca tagaaga 
gtcagctgac 
ttgaatgtta 
aagcaagtat 
attggtttta 
aaatgataga 
cacagccgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

558 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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tgaatatatt ttatcttggt agtacatttt ctcccttagg aatatacata gtctttgttt 1620 
acatgagttc caatactttt gggatgttac cctcacatgt ccctatactg atgtgtgcca 1680 
ccttttatgt gttgatgact cactcataag gttttggtc " ~ 1719 



<210> 177 
<211> 878 
<212> DNA 

<213> Homo sapiens 



<400> 177 

atcccagccc 

tcttactcac 

gttctggcct 

ctcaagtaca 

ccaagcttag 

gagctatgtg 

acaccatccc 

ggcaagaaag 

gggccatagc 

gaagcctgaa 

catgctggcc 

taccctgcat 

tctcccaggg 

ccctggacct 

cctaactgaa 



acgcacagac 
cctctaccac 
ttggcatccc 
gccaaaggaa 
gctgctccat 
cagacccaaa 
cacagaaacc 
gaaagggctc 
ccagtgagca 
cccaagatgc 
t tgccacact 
ggctgagctg 
agcatgagag 
gatacggctc 
taaaaagctg 



ccccaacttg 
agacatggct 
caggacccaa 
gattcccgcc 
cccagctatc 
ggagctctgg 
agcccagggc 
caaaggctgc 
gcctggagcc 
aagaaggagg 
ct ttctcctg 
cccacagcag 
gaggcagcag 
cccagtacac 
ttctgtcttc 



cagctgccca 
cagtcactgg 
ggcagtgat g 
aaggttgtcc 
ctgttcttgc 
gtgcagcagc 
tgcaggaagg 
aagaggactg 
ctggagaccc 
ctatgctcag 
ctttaaccac 
gccaggtcca 
gactgtcccc 
cccacctctt 
ccacccaa 



cctcaccctc 
ctctgagcct 
gaggggctca 
gcagctaccg 
cccgcaagcg 
tgatgcagca 
acaggggggc 
agcggtcaca 
caccagcctc 
gggccctgga 
cccatctgca 
gagagaccga 
ttgaaggaga 
ccttgtaaat 



agctctggcc 
ccttatcctg 
ggactgttgc 
gaagcaggaa 
ctctcaggca 
tctggacaag 
ctccaagact 
gacccctaaa 
accagcgctt 
gcagccaccc 
t tcccagctc 
ggagggagag 
atcatcagga 
atgatttata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

878 



<210> 178 
<211> 34 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Interferon gamma homology motif of THAP1 
<400> 178 

Asn Tyr Thr Val Glu Asp Thr Met His Gin Arg Lys Arg He His Gin 

1 5 10 15 

Leu Glu Gin Gin Val Glu Lys Leu Arg Lys Lys Leu Lys Thr Ala Gin 
20 25 30 

Gin Arg 



<210> 179 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Nuclear localization sequence of THAP1 
<400> 179 

Arg Lys Arg He His Gin Leu Glu Gin Gin Val Glu Lys Leu Arg Lys 

1 5 10 15 

Lys Leu Lys Thr 
20 



<210> 180 
<211> 38 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Consensus sequence for PAR 4 binding domain of THAP 
<221> UNSURE 

<222> 3-16, 19, 23, 25-35 

<223> Xaa = any of the twenty amino acids 

<221> VARIANT 
<222> 37 

<223> Xaa = Arg or Lys 
<400> 180 

Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Gin Arg Xaa Arg Arg Gin Xaa Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Gin Arg Glu 
35 



<210> 181 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 181 

gaattcggcc attatggcct gcaggatccg gccgcctcgg cccaggatcc 50 

<210> 182 

<211> 111 

<212> PRT 

<213> Homo sapiens 

<220> 

<400> 182 



Ser 


Asp 


Gly 


Gly 


Ala 


Gin 


Asp 


Cys 


Cys 


Leu Lys 


Tyr 


Ser 


Gin 


Arg 


Lys 


1 








5 










10 








15 




He 


Pro 


Ala 


Lys 


Val 


Val 


Arg 


Ser 


Tyr 


Arg Lys Gin Glu 


Pro 


Ser 


Leu 








20 










25 








30 






Gly 


Cys 


Ser 


He 


Pro 


Ala 


He 


Leu 


Phe 


Leu Pro 


Arg 


Lys 


Arg 


Ser 


Gin 






35 










40 








45 








Ala 


Glu 


Leu 


Cys 


Ala 


Asp 


Pro 


Lys 


Glu 


Leu Trp 


Val 


Gin 


Gin 


Leu 


Met 




50 










55 








60 










Gin 


His 


Leu 


Asp 


Lys 


Thr 


Pro 


Ser 


Pro 


Gin Lys 


Pro 


Ala 


Gin 


Gly 


Cys 


65 










70 








75 










80 


Arg 


Lys 


Asp 


Arg 


Gly 


Ala 


Ser 


Lys 


Thr 


Gly Lys 


Lys 


Gly 


Lys 


Gly 


Ser 










85 










90 








95 




Lys 


Gly 


Cys 


Lys 


Arg 


Thr 


Glu 


Arg 


Ser 


Gin Thr 


Pro 


Lys 


Gly 


Pro 





100 105 110 



<210> 183 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 183 

gcgggatccg tagtgatgga ggggctcagg actgttg 

<210> 184 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 184 

gcgggatccc tatggccctt taggggtctg tgacc 

<210> 185 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 185 

ccgaattcag gatggtgcag tcctgctccg cct 

<210> 186 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 186 

cgcggatcct gctggtactt caactatttc aaagtagtc 

<210> 187 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 187 

ccgaattcag gatggtgcag tcctgctccg cct 

<210> 188 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 188 

cgcggatcct gctggtactt caactatttc aaagtagtc 
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<210> 189 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 189 

gcggaattca tggcgaccgg tggctaccgg acc 33 

<210> 190 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 190 

gcgggatccc tctacctggt cagctgaccc acaac 35 

<210> 191 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 191 

ccgaattcag gatggtgcag tcctgctccg cct 33 

<210> 192 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 192 

cgcggatcct gctggtactt caactatttc aaagtagtc 39 

<210> 193 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 193 

cgcgaattcg ccatcatggg gttccctaga tataacaggg atgcaa 46 

<210> 194 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<A00> 194 

gccggatccg ggttccctag atataacagg gatgcaa 37 

<210> 195 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<4O0> 195 

gcgctctaga gccatcatgg aggagcagaa gctgatc 37 

<210> 196 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 196 

cttgcggccg cctctacctg gtcagctgac ccacaac 37 

<210> 197 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 197 

gcggaattca aagaagatct tctggagcca caggaac 37 

<210> 198 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 198 

cgcggatcct gctggtactt caactatttc aaagtagtc 39 

<210> 199 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 199 

gcggaattca tgccgcctct tcagacccct gttaa 35 
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<210> 200 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 200 

gcggaattca tgcaccagcg gaaaaggatt catcag 36 

<210> 201 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 201 

ccgaattcag gatggtgcag tcctgctccg cct 33 

<210> 202 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 202 

gcgggatccc ttgtcatgtg gctcagtaca aagaaatat 39 

<210> 203 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 203 

cgggatcctg tgcggtcttg agcttctttc tgag 34 

<210> 204 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 204 

gcgggatccg tcgtctttct ctttctggaa gtgaac 36 

<210> 205 
<211> 36 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Consensus sequence for PAR 4 binding domain of THAP 
<221> UNSURE 

<222> 3-14, 17, 21, 23-33, 35 

<223> Xaa = any of the twenty amino acids 

<400> 205 

Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gin Arg 

5 10 15 

Xaa Arg Arg Gin Xaa Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Gin Xaa Glu 
35 

<210> 206 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 206 

ccgcacagca gcgatgcgct gctcaagaac ggcagcttg 39 

<210> 207 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 207 

caagctgccg ttcttgagca gcgcatcgct gctgtgcgg 39 

<210> 208 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 208 

gctcaagacc gcacagcaag aacggcagct tg 32 

<210> 209 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 209 

caagctgccg ttcttgctgt gcggtcttga gc 32 

<210> 210 
<211> 36 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 210 

gcgggatccc taaattagaa aggggtgggg gtagcc 36 



<210> 211 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 211 

gcggaattca tggagcctgc acccgcccga tc 

<210> 212 
<211> 37 
<212> DNA 

<213> Artificial Sequence 



32 



<220> 

<223> Primer 
<400> 212 

gcggaattca aagaagatct tctggagcca caggaac 37 

<210> 213 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 213 

cgcggatcct gctggtactt caactatttc aaagtagtc 39 

<210> 214 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 214 

cgcggatccg tgcagtcctg ctccgcctac ggc 33 

<210> 215 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 215 

ccgaattctt atgctggtac ttcaactatt tcaaagtag 39 

<210> 216 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 216 

gccgaattca tgccgaactt ctgcgctgcc ccc 33 

<210> 217 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 217 

cgcggatcct taggttattt tccacagttt cggaattatc 4Q 

<210> 218 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<4 0O> 218 

gcgctgcagc aagctaaatt taaatgaagg tactcttgg 39 

<210> 219 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 219 

gcgagatctg ggaaatgccg accaattgcg ctgcg 35 

<210> 220 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 220 

agaggatcct tagctctgct gctctggccc aagtc 35 

<210> 221 
<211> 32 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> -221 

agagaattca tgccgaagtc gtgcgcggcc eg 32 

<210> 222 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 222 

geggaattea tgccgcgtca ctgctccgcc gc 32 

<210> 223 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 223 

gegggatect caggccatgc tgctgctcag ctgc 34 

<210> 224 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 224 

gegagatetc gatggtgaaa tgctgctccg ccattgga 38 

<210> 225 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 225 

gegggatect catgaaatat agtcctgttc tatgetetc 39 

<210> 226 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 226 
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gcgagatctc gatgcccaag tactgcaggg cgccg 35 

<210> 227 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 227 

gcggaattct tatgcactgg ggatccgagt gtccagg 37 

<210> 228 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 228 

gcggaattca tgccggcccg ttgtgtggcc gc 32 

<210> 229 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 229 

gcgggatcct taacatgttt cttctttcac ctgtacagc 39 

<210> 230 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 230 

gcgagatctc gatgcctggc tttacgtgct gcgtgc 36 

<210> 231 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 231 

gcggaattct cacattccgt gcttcttgcg gatgac 36 

<210> 232 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 232 

ccgaattcag gatggtgcag tcctgctccg cct 

<210> 233 
<211> 39 
<212> DNA 

<213> Artificial Sequence 



33 



<220> 

<223> Primer 
<400> 233 

cgcggatcct gctggtactt caactatttc aaagtagtc 39 

<210> 234 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 234 

gcgctctaga gccatcatgg aggagcagaa gctgatc 37 

<210> 235 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 235 

gcgctctaga ttatgctggt acttcaacta tttcaaagta g 41 

<210> 236 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 236 

cgcggatccg tgcagtcctg ctccgcctac ggc 33 

<210> 237 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<4O0> 237 

cgcggatcct gctggtactt caactatttc aaagtagtc 39 
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<210> 238 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 238 

gccggatccg ggttccctag atataacagg gatgcaa 

<210> 239 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 239 

gcgggatccc tctacctggt cagctgaccc acaac 

<210> 240 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 240 

gcgggatcca gtgatggagg ggctcaggac tgttg 

<210> 241 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 241 

gcgggatccc tatggccctt taggggtctg tgacc 



<210> 242 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 242 

gcgcatatgg tgcagtcctg ctccgcctac ggc 

<210> 243 
<211> 36 
<212> DNA 
<213> Artificial 



33 



Sequence 
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<220> 

<223> Primer 
<400> 243 

gcgctcgagt ttcttgtcat gtggctcagt acaaag 36 

<210> 244 
<211> 62 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<221> unsure 
<222> 21-45 

<223> n - any of the four nucleotides 
<400> 244 

tgggcactat ttatatcaac nnnnnnnnnn nnnnnnnnnn nnnnnaatgt cgttggtggc 60 
cc 62 

<210> 245 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 245 

accgcaagct tgggcactat ttatatcaac 

<210> 246 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



30 



<400> 246 

ggtctagagg gccaccaacg catt 

<210> 247 
<211> 2173 
<212> DNA 

<213> Homo sapiens 



24 



<400> 247 

gacgggcgat 

ccccgaggga 

tccaaccgag 

aagtgcctga 

tacgacaagg 

aaagaatggg 

tgttcagagc 

gagaatgctg 

ctggagccac 

gctgctattg 

gaccacaact 



ggctgtggtc 
ggccaccatc 
cgcagcgaca 
ggaccggaag 
acaagcccgt 
aggcagctgt 
actttactcc 
tgcccacaat 
aggaacagct 
gattactaat 
atactgtgga 



cttctgctaa 
actgtaactg 
ctgagaacag 
gatggtgcag 
ttctttccac 
cagaagaaaa 
agactgcttt 
atttctttgt 
tcccccacct 
gccgcctctt 
ggatacaatg 



tgcaaacaac 
ttggccaaag 
cttcccctgc 
tcctgctccg 
aagtttcctc 
aactttaaac 
aagagagagt 
actgagccac 
cctttaccgc 
cagacccctg 
caccagcgga 



aaaacgggca 
ctacaaaaga 
cttctgcggc 
cctacggctg 
ttactcgacc 
ccaccaagta 
gcaacaacaa 
atgacaagaa 
ctcctgtttc 
ttaatctctc 
aaaggattca 



cactagtcac 
agcgagggaa 
ggcagaagtg 
caagaaccgc 
cagtctttgt 
tagcagtatt 
gttactgaaa 
agaagatctt 
ccaggttgat 
agttttctgt 
tcagctagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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cagcaagttg aaaaactcag aaagaagctc aagaccgcac agcagcgatg cagaaggcaa 720 
gaacggcagc ttgaaaaatt aaaggaggtt gttcacttcc agaaagagaa agacgacgta 780 
tcagaaagag gttatgtgat tctaccaaat gactactttg aaatagttga agtaccagca 840 
taaaaaaatg aaatgtgtat tgatttctaa tggggcaata ccacatatcc tcctctagcc 900 
tgtaaaggag tttcatttaa aaaaataaca tttgattact tatataaaaa cagttcagaa 960 
tattttttta aaaaaaattc tatatatact gtaaaattat aaattttttt gtttgtaatt 1020 
tcaggttttt tacattttaa caaaatattt taaaagttat aaactaacct cagacctcta 1080 
atgtaagttg gtttcaagat tggggatttt ggggtttttt ttttagtatt tatagaaata 1140 
atgtaaaaat aaaaagtaaa gagaatgaga acagtgtggt aaaagggtga tttcagttta 1200 
aaacttaaaa ttagtactgt tttattgaga gaatttagtt atattttaaa tcagaagtat 1260 
gggtcagatc atgggacata acttcttaga atatatatat acatatgtac atattctcat 1320 
atgtaaagtc acaaggttca tttatctttc tgaatcagtt atcaaagata aattggdaag 1380 
tcagtactta agaaaaaaga tttgattatc atcacagcag aaaaaagtca ttgcatatct 1440 
gatcaataac ttcagattct aagagtggat tttttttttt tacatgggct cctatttttt 1500 
cccctactgt cttgcattat aaaattagaa gtgtattttc agtggaagaa acatttttca 1560 
ataaataaag taaggcattg tcatcaatga agtaattaaa actgggacct gatctatgat 1620 
acgctttttt ctttcattac accctagctg aaggacatcc cagttcccca gctgtagtta 1680 
tgtatctgcc ttcaagtctc tgacaaatgt gctgtgttag tagagtttga tttgtatcat 1740 
atgataatct tgcacttgac tgagttggga caaggcttca cataaaaaat tatttcttca 1800 
cttttaacac aagttagaaa ttatatccca tttagttaaa tgcgtgattt atattcagaa 1860 
caacctacta tgtagcgttt attttactga atgtggagat ttaaacactg aggtttctgt 1920 
tcaaactgtg agttctgttc tttgtgagaa attttacata tattggaagt gaaaatatgt 1980 




aaaaaaaaaa aaa 



<210> 248 

<211> 1302 

<212> DNA 

<213> Homo sapiens 



2173 



<400> 248 

aattgctctg aggaccgctg ccaaagaaac gcagtagatc cgctccctct tgggggcggg 60 
gagaaagaac gggttgtgtc cgccatgttg gtgaagtcaa gcgaaggcga ctagagctcc 120 
aggagggcca gttctgtggg ctctagtcgg ccatattaat aaagagaaag ggaaggctga 180 
ccgtccttcg cctccgcccc cacatacaca ccccttcttc ccactccgct ctcacgacta 240 
agctctcacg attaaggcac gcctgcctcg attgtccagc ctctgccaga agaaagctta 300 
gcagccagcg cctcagtaga gacctaaggg cgctgaatga gtgggaaagg gaaatgccga 360 
ccaattgcgc tgcggcgggc tgtgccacta cctacaacaa gcacattaac atcagcttcc 420 




gtacccatat aaagtctatg aaactcaagt caaggaatct tttgaagaaa aacaacagtt 660 
gttctccagc tggaccatct aatttaaaat caaacattag tagtcagcaa gtactacttg 720 
aacacagcta tgcctttagg aatcctatgg aggcaaaaaa gaggatcatt aaactggaaa 780 
aagaaatagc aagcttaaga agaaaaatga aaacttgcct acaaaaggaa cgcagagcaa 840 
ctcgaagatg gatcaaagcc acgtgtttgg taaagaattt agaagcaaat agtgtattac 900 
ctaaaggtac atcagaacac atgttaccaa ctgccttaag cagtcttcct ttggaagatt 960 
ttaagatcct tgaacaagat caacaagata aaacactgct aagtctaaat ctaaaacaga 1020 
ccaagagtac cttcatttaa atttagcttg cacagagctt gatgcctatc cttcattctt 1080 
ttcagaagta aagataatta tggcacttat gccaaaattc attatttaat aaagttttac 1140 
ttgaagtaac attactgaat ttgtgaagac ttgattacaa aagaataaaa aacttcatat 1200 
ggaaatttta tttgaaaatg agtggaagtg ccttacatta gaattacgga cttaaaaatt 1260 
ttgctaataa attgtgtatt tgaaaaaaaa aaaaaaaaaa aa " 1302 

<210> 249 

<211> 1995 

<212> DNA 

<213> Homo sapiens 
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<400> 249 

ccagtgacgt cagaggagtc cagacctatt 
gttggccgtt ggtttccagt tgtccaagcc 
tcaaagcaag gaggtgaaag gcgaccagca 
caaggctagg agttggggtt tcgggcctga 
gccccgggtc ccgtccgacc ctggggagac 
ccctggaggc gagccaggcc cgtcaccacc 
cctcccctct ccgcaggccc cgccgccgcc 
ctcgagatgc cgaagtcgtg cgcggcccgg 
aagcagctca ccttccaccg gtttccgttc 
ctgaacatcg gccggggcaa cttcaagccc 
ttccggccag agtgcttcag cgcctttgga 
cccacggtgt tcgcctttca ggaccccaca 
agtgagagag gaaatgccag ctcttctcag 
ggagaggaca gtcctgggag aaacatggac 
aatgccgaag gccacgtaaa acaggtctcg 
ggccggccga ctggccctgc aggcctgaga 
agctatgccc ttttggactt agattccctg 
aatgaaaagc tccggaagcg cttgcaggcc 
cgcctccgtg cttgcaaagg gcaccaggga 
agctgagccc cacaggctcc ggacgcagag 
tggagctctg gctgtggaca tttttgtctg 
cctgcttggc cggggatcga gacagtagcc 
tgggggacgt ttagaggcgt ggcactagga 
ctcccatggt aacagaagtc caggctgagg 
gcagagcaaa gatcgttgga agccccagtg 
tgagggggct ggctctgtgg cgggtgagtg 
tcggggttct aaggagtgac tcctgtcccg 
gtctttctat acggtgtcgc tcccatcatc 
aagactgact tccgtggccg ggcgcagtgg 
gccgaggtgg ggagatcact tgaggtcagg 
aatcccatgt ctactaaaaa agacacaaat 
cccagctacc tgggaggctg agacaggagg 
gtgagccgag atcacaccac tgccctctag 
gtctgaaaaa aaaaa 

<210> 250 

<211> 1999 

<212> DNA 

<213> Homo sapiens 

<400> 250 

gggctagggc cggggcctgg ctgcgcggct 
gcggccgtga actgctccaa ccggcaggga 
aggttccccc taaaggactc aaaacgtcta 
aactggactc ccactaagta ttcatttctc 
tccaagaggc tggaggacca gcatcgcctg 
cacctgaccg agaagaagag gggggctgga 
agcaaggcca cagggggtgt gaggggacac 
ggttggtcac cgtcctcgag tggaaacccg 
caagctgctc tgcaaggtga agccacaccc 
caggcccagc aagctctgga acggactcca 
agtcagggaa aagcagaagc gtctgccaca 
atcgaagggg gcgtgacaga taagagtggc 
tctggggcgt gcaaatttat cggctcactt 
cgagaaaggc catctgtccc ccgagagccc 
gaaccaagct gcagtgggag cagcctggga 
agctcatcac ttaccgcgac accgcagaag 
gacgtcaccc caaagccagc cacggaagcc 
atgtccatca acgaggtcat cctgtcggcg 
cactcctact gcttctcctc ccggcagaac 



cacaattcaa agccctaaaa acactgaggg 60 
tgtgagtggc tatgcgtcct ggttgggtgc 120 
ttggcgaatg gggtaagact tgcacaggcc 180 
attggggccc ggagcacccc tttacgtggc 240 
gcgggtggct gggatggcag gatgagcgcg 300 
tcccagcggc cccgcccctc cccgcaggtc 360 
gccatctttg ttgggggcag ccaggcctgg 420 
cagtgctgca accgctacag cagccgcagg 480 
agccgcccgg agctgctgaa ggaatgggtg 540 
aagcagcaca cggtcatctg ctccgagcac 600 
aaccgcaaga acctaaagca caatgccgtg 660 
cagcaggtga gggagaacac agaccctgcc 720 
aaagaaaagg tcctccctga ggcgggggcc 780 
actgcacttg aagagcttca gttgccccca 840 
ccacggaggc cgcaagcaac agaggctgtt 900 
aggaccccca acaagcagcc atctgatcac 960 
aagaaaaaac tcttcctcac tctgaaggaa 1020 
cagaggctgg tgatgcgaag gatgtccagc 1080 
ctccaggcca gacttgggcc agagcagcag 1140 
gtggcagtgg caccagggcc ggcagagctt 1200 
ctgtggacac tgagaaagtt ggccatgagg 1260 
aagctccccg gcgagagccc caatgccgtc 1320 
gtgcacatct gtgagcatga caagcttatc 1380 
ctgattctgg acgctgccct ttcagcacac 1440 
tgggagatgc tcctcaggga ggaagccatg 1500 
gtcccctcct ccatcagcct ggacagccgc 1560 
gcctggtgtg agtgggcagt gtaataaagt 1620 
attttctcta gtgccgtgat tccttctaag 1680 
ctcatgcctg taatcccagc actttgagag 1740 
agttcaagac cagcctggcc aacatggtga 1800 
tagccaggcg tggtggcaca cacctgtagt 1860 
atcagctgaa cccgggaggt ggaggttgca 1920 
tattgtcact gggtgacaga gcgagactca 1980 

1995 



gggccaaggc ccgcgatggt gatctgctgt 60 
aagggcgaga agcgcgccgt ctccttccac 120 
atccaatggt taaaagctgt tcagagggat 180 
tgtagtgagc atttcaccaa agacagcttc 240 
ctgaagccca cggccgtgcc atccatcttc 300 
ggccatggcc gcacccggag aaaagatgcc 360 
tcgagtgccg ccaccggcag aggagctgca 420 
atggccaagc cagagtcccg caggttgaag 480 
agggcggccc aggaggccgc cagccaggag 540 
ggagatggac tggccaccat ggtggcaggc 600 
gatgctggcg atgagagcgc cacttcctcc 660 
atttctatgg atgactttac gcccccagga 720 
cattcgtaca gtttctcctc taagcacacc 780 
attgaccgca agaggctgaa gaaagatgtg 840 
cccgacaagg gcctggccca gagccctccc 900 
ccttcccaga gcccctctgc ccctcctgcc 960 
gtgcagagcg agcacagcga cgccagcccc 1020 
tcaggggcct gcaagctcat cgactcactg 1080 
aagagccagg tgtgctgcct gcgggagcag 1140 
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gtggagaaga 

caggtgcgga 

agtagcctgc 

tcctggatgc 

cagcccttcc 

ttctcgttca 

attcgcctca 

gtggaagtgg 

gccaggatct 

aattctgaag 

cagcatcttc 

cgggagggcc 

aaaagtgggt 

ctggcataga 

aaaaaaaaaa 



agaacggcga 

agctacagga 

tgtcgcccag 

tgggcacctg 

agtacctgga 

actccttcca 

agcccgacac 

aggagggcga 

ccttcgccaa 

gcaaacttga 

acgtcaccta 

tggctactgt 

gtggccgtga 

aaaaccaaat 

aaaaaaaaa 



gctgaagagc 

gaagctggat 

ccgcgagccc 

gctgtcggac 

ggaggttcac 

cccggacacg 

caacaaggtg 

ggtgaacggg 

ggagccccac 

gcagacggtc 

caagaaggtg 

gcctcaacgg 

gcctctgcag 

aaaaggcctt 



ctgcggcaga 

gagctgagga 

cccaagatga 

ccacctggag 

atctcccacg 

cgcaagccga 

gcctttgtca 

caggagctgt 

gtagagcaga 

tccatggcaa 

accccgtaaa 

t tcggctcct 

gctcaagagt 

tatttttatg 



gggtcagccg 

gagtgagcgt 

acccagtggt 

ccgggaccta 

tgggccagcc 

tgcacagaga 

gcgcccagaa 

gcatcgcatc 

tcacccggaa 

ccacgacaca 

cctagagctt 

caacagacag 

gttgtccaga 

gctgaggatt 



ctccgacagc 

cccctatcca 

ggagccactg 

ccccacactg 

catgctgaac 

gtgtggcttc 

cacaggcgtg 

ccactccatc 

gttcaggctg 

gccaatgact 

ctggagccct 

tccctgcggc 

tgtttctgta 

ttgaatatta 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

1999 



<210> 251 

<211> 1398 

<212> DNA 

<213> Homo sapiens 



<400> 251 
ggctgtgcgc 
gaagtgagtc 
cgagaggcct 
gccccggtct 
aagaaccgcc 
catgacaaag 
agtaaatacc 
ggtattcgat 
cagggaaaag 
gtatgcccaa 
gt taacacag 
ccaccagctc 
caacatactg 
ggtggttttc 
aatt cagaaa 
catcttgcta 
ccattcttat 
gttattgcca 
gcacaaaaag 
gtagactatg 
aaggaacatc 
aaagagcaac 
caggaaaact 
gaagtcacta 



cacttccggc 
gacagacgag 
ggacctgtgg 
ccaggggcct 
ggggacgaaa 
aaagactgga 
agtttctatg 
att taaaaca 
acccttctaa 
aagccaagtc 
atgtgcccca 
ccaaaacagg 
ggaaaccaga 
acacatgttt 
gtattcatca 
atccaaactt 
tcagcacaat 
tttttgtacc 
aaaccacgga 
ggacagaagt 
tttggcagaa 
aaactctagg 
ggctatctga 
tgatatag 



t tcaaccccc 

gcggctttcc 

cgcatcctca 

cacccgagtc 

caataaagac 

aaagtggtta 

tagtgaccat 

aactgcagtt 

aaaaaaatcc 

agaagaatca 

tcaacatcca 

aagtatacaa 

atctaccttg 

tgagaatcta 

atctttggaa 

tacaagtaat 

taatcaaaca 

tgctgaaaat 

aatggaagac 

tttacaaatc 

agtctctaag 

tagattgaag 

agaaaacgtc 



gaaaaggcgg tgcttaaacc 
cggcagaatg ctagcgcagg 
gtgaggaggg ccgccctjgca 
atgccccgct attgcgcagc 
cggaagctga gtttttatcc 
aagaatatga agcgagattc 
tttactcctg actctcttga 
ccaacaatat tttctttgcc 
cagaagaaaa acttggaaga 
tttgtattaa atgagacaaa 
gaattacttc attcatcttc 
aataacatgt taactcttaa 
gaaacatcag ttaaccaaga 
aattctacaa ctattacttt 
actcaagaag ttcttgaagt 
tccatggaaa taaagtcagc 
gttgaagaat taaacacaaa 
tctaaaccct cagttaattc 
acagacattg aagactcctt 
gaacattctt actgcagaca 
ctacattcaa agataactct 
tctttggaag ctcttataag 
aagattatag aaaaccattt 



ggaggaggcg 60 
cgcaggggct 120 
tccgtcgccg 180 
gatttgttgt 240 
atttcctcta 300 
atgggttccc 360 
catcagatgg 420 
tgaagacaat 480 
tgagaaagaa 540 
gaaaaatata 600 
cttggtaaag 660 
tctagttaaa 720 
tacaggtaga 780 
gacaacttca 840 
aactaccagt 900 
acaggaaaat 960 
taaagaatct 1020 
ttttatatct 1080 
gtataaggat 1140 
agatataaat 1200 
tctagagtta 1260 
gcagctaaag 1320 
tacaacatat 1380 
1398 



<210> 252 

<211> 2291 

<212> DNA 

<213> Homo sapiens 



<400> 252 

agcgaaggca 

tgttatgagt 

tgccaaattc 

aaaggaaatg 

ctaaaaaagg 

ctccaaatat 

tacaggggaa 

ccactaacta 



gacgcagtct 
tgctaaaatg 
gaagttaaaa 
ggtattagca 
agatgtgttg 
taaactgaaa 
aagagaaaaa 
caatcaccat 



ccatcgttga 
gtgaaatgct 
ggactgacat 
atgaaaagac 
tgttcgaggc 
cctggagtca 
cttcattgta 
cttgttggtg 



cgttagtcgc 
gctccgccat 
ttcacgtatt 
ttgatgtgaa 
actttaagaa 
taccttctat 
gaaaaaactt 
cttcctcatg 



agtcttcgct 
tggatgtgct 
ccccacagat 
tgcagccggc 
gacagatttt 
ctttgattct 
caccctcaaa 
tattgaagaa 



gctaacgttt 
tctcgctgct 
gaaaacatca 
atttgggagc 
gacagaagtg 
ccatatcacc 
accgttccag 
ttccaatccc 



60 

120 

180 

240 

300 

360 

420 

480 
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agttcatttt tgaacatagc tacagtgtaa 
tagatcatgt gatcggcgag ctagaggata 
gagaaaaacg ttttcagaaa tcattgagga 
tgatcagcca agaaacagca aatagactgg 
gcatagaaca ggactatatt tcatgaaata 
attggtacaa atttttataa aatctcattt 
gtgctgcttt ggattctctg gagcattatg 
aaaatatgca gaaatttgtg gtaattatgt 
gacctacact agtgccaggt cactattgta 
agctaaagaa atgatgtcaa attagtcaca 
ctccagatat ttggcttcaa aggagtacct 
tgaattttac tttaaatgca ttttactaca 
cttggattca atccaaggtg ctttagctat 
ttcctgtggt attgactctg agaataacac 
ttcacagcca atttaagaag acccctcatg 
cctcctcact aggagcactt tgatgtaaac 
gtagatctga tttttaaatg gttggttgct 
tatttcgtca taaaggtata tgtttaaaat 
atactgcttt gtattttata tgtaaagtag 
acaaaagaaa ccattagaaa ttaataactg 
agaaaggatt agaatatttt aattagggga 
agaaacgggt aattaaaaca gcagctttag 
aattccttgt tatttcctaa ttgatccaag 
ttaccgaaaa caactgaaat tgagagtcat 
gccaaagcaa cactctactt agaagcacat 
tgttgtagca gttagaattt gagtatcagc 
cattttgtgc tcagttgttt attgtaattt 
atggtcactt gaattttttg tatttaagaa 
tgtaggattc ccaaccttcc ctctaaatgg 
atgctaagag gaaatcactg aggccatatc 
aggtagttaa a 

<210> 253 

<211> 1242 

<212> DNA 

<213> Homo sapiens 

<400> 253 

cgtgagtgcc gctgacagaa gtcaagagaa 
gccggggggg acccgacagg ccagagcccc 
gctcctccgg atgcccggag agccgcttgc 
gcgtcactgc tccgccgccg gctgctgcac 
catctccttc cacagacttc ccaagaagga 
ctgccagcgg ctggacccca gcggccaggg 
cttctgctcc aaacactttg aggaggactg 
caggctaaag gagggggcag tccccaccat 
aaccaagacc aaaggacaca gttacccacc 
atgcaggaag cgctgctccg agggccgagg 
tgctgatgtc acctgctttc ctgtggaaga 
cccagctggg aggctggagc ctggccttag 
gggtgcccag gcagatgaag caggctgcag 
ccctctcgaa ccacggccag tctccccctc 
cggagcctac atccagaatg aacacagcta 
gcgagccgag gcagcccttg atgcccttga 
gcggcgggag cagcggctgc ggttgagact 
gaagcgggca caggcagatg cccgccagac 
gcagctgagc agcagcatgg cctgaggggc 
gactgcagcc tcttcctccc tcagatccca 
attctagacg gagaaaaaaa aaaaaaaaaa 

<210> 254 
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tggacagtcc aaagaaactt aagcataaat 540 
caaaggaaag tctacggaat gttttagacc 600 
agacaatcag ggaattaaag gatgaatgtc 660 
acactttctg ttgggactgt tgtcaggaga 720 
atttcatgtt acgttccacc taaaattgtc 780 
accatcacta aataatatcc atcatttaaa 840 
cattatagtt gttatccaaa gacttttttg 900 
atttgtgtct tgtgacaatt atgttttata 960 
agatgttaaa atctcaagaa aatttcacag 1020 
ttaagctata gtagaaggaa ttggacactt 1080 
ttacttacat gtgctttatg gtaagtacat 1140 
aagcacaatt catttgtaat gcatatccat 1200 
cagtagtacc aaaggatctt tttacaaggc 1260 
atagtgaaga tctgtgggct tttaaaattg 1320 
aagtctcagt tttcagtaca gtacatcatt 1380 
cagaatagct ttaaaaagac aaaaaggatc 1440 
ctgacagatc tgaacacttt gcttcatgac 1500 
ctgaatggca gtactagctc tatactttta 1560 
tattgctgac attttaaaaa aatacaaaat 1620 
tggctcttcc agttgaaata ggaattggag 1680 
gtagattatt gtccaaaggc ttttatttag 1740 
aatagcttct tactgaatat gcaaaagaat 1800 
tctcataaat ttagcttttg tcataattcc 1860 
aaatactgtg ggttagaata aaaaccattt 1920 
gtacatacat ggacct.catt cagaagtcca 1980 
catttcattg tagtaacaaa aattgaattg 2040 
tatttttgtt acattaatat tagttaagat 2100 
ttttctgttt taatgcatgt tatactttta 2160 
gatttaaccc acatctgcga gatcagcgtt 2220 
tttttacaat ctgaaaaaaa agtagtaaaa 2280 

2291 



tcggctggga cggggttggg gcgacaacgg 60 
ttggggagga gcggcggctg gaggcgcgag 120 
gacttaactc ccgcctcttt cccagatgcc 180 
acgggacacg cgcgagacgc gcaaccgcgg 240 
caacccgagg cgaggcttgt ggctggccaa 300 
cctgtgggac ccggcatccg agtacatcta 360 
ctttgagctg gtgggaatca gtggatatca 420 
atttgagtct ttctccaagt tgcgccggac 480 
tggcccccct gaagtcagcc ggctcagacg 540 
gcccacaact ccattttctc cacctccacc 600 
ggcctcagca cctgccactt tgccggcctc 660 
cagccccttt tcagacctac tgggcccctt 720 
cgcccagcct tcaccagagc ggcagccctc 780 
agcgtatatg ctgcgcctgc ccccacccgc 840 
ccaggtgggc agcgccttac tctggaagcg 900 
caaggcccag cgccagctgc aggcctgcaa 960 
gaccaagctg cagcaggagc gggcacggga 1020 
tctgaaggag catgtgcagg actttgccat 1080 
tgctggactg accgaggggc tgcccagcaa 1140 
ccagacccac caggtgccat aataaagcgg 1200 
aaaaaaaaaa aa 1242 
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<211> 1383 
<212> DNA 

<213> Homo sapiens 



<400> 254 

agcgggggtc 

aaactcagcg 

actgcgggcc 

gatggtcccc 

tgccaccagc 

gtgcgctacc 

agtcagcgga 

aatacacccc 

ctgggcccca 

cctgcgccaa 

ctgggcccag 

cggcaccagg 

ctgctggcac 

gaggaatccc 

caggaccctg 

agtgcataag 

agacattata 

ccct caaggg 

cctgggccca 

ggctcagatc 

aatcatacct 

taccatggtg 

gacgatgaac 

aaa 



ggggttaggc 

ggggctggat 

gcctgggtgc 

ggctgcaggc 

acttgtgcag 

tgcggcctga 

ggacccgaag 

tgccccagag 

catcggggag 

ctcctgagcg 

tgctgggagc 

cgcagctgca 

gggcacgccg 

aaaccttcac 

cacctgccac 

gatcaagaca 

cgtgggcttg 

cctgggttct 

agtctctcat 

tttgcaaatc 

ggttctaagg 

gacacagctg 

tagtaaacaa 



ggcgctccgc 
agccatgccc 
agacaaccgc 
ctggctgcag 
cgagcacttc 
tgcagtgccc 
cacccagaag 
ccctgccatc 
ccccaagact 
gtcacaacct 
actgcaacgc 
ggccctggaa 
gggtctgcag 
catcatctgt 
agtggatgcc 
gacaatgtcg 
gcccagcccc 
accaccccac 
cagcccccaa 
agtacgacag 
agtcccacgc 
agaaaatccc 
ataaacaaga 



gagaaccaaa 

aagtactgca 

cctgtgagct 

cacatgggct 

acaccctcct 

tccatcttct 

ccagtctcgc 

ccagtctctg 

gtggccacca 

gaagtccctg 

cgggtgcgga 

cggctggcac 

cgcctgacaa 

ggagggcctg 

aagccggagc 

agggacaaaa 

accgcccacg 

tcctagggat 

tcccctgggt 

cctcagagca 

tttttgccaa 

tgccctcatg 

acactgcaga 



gtgcagccgc 

gggcgccgaa 

tctacaagtt 

gtgagcactg 

gcttccagtg 

cccggggacc 

cgccgcctcc 

gcccagtgcg 

tgctcctgac 

cccaacaggc 

ggctgcaacg 

agcagctaca 

cagcccagac 

acatagccat 

tcctggacac 

gatagaagat 

cctgggtagt 

ctcttgaacc 

accaggcttc 

gagcaagggt 

gcctggtact 

gtgctcattc 

catgaaaaaa 



tgacccggca 

ctgctccaac 

cccactgaag 

ggtgcccagc 

gcgctggggt 

acctgccaag 

cctacagaag 

cctagtggtg 

ccccctggcc 

ccagaccggg 

gtgccaggag 

cggggagagc 

ccttggacct 

ggtccttgcc 

tcggatcccc 

ggaggaggaa 

agcagtgcct 

ttaggggtga 

tgccaccccc 

tgtttgggag 

gagttcatga 

tacttgagta 

aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1383 



<210> 255 

<211> 3627 

<212> DNA 

<213> Homo sapiens 



<400> 255 

attcatgctg 

gttgctccgc 

ccttccacca 

gtgtggaccc 

aacattttca 

ctgtgccttc 

gacaaaaaat 

ataactatag 

aaatgttaca 

agagaaaggg 

cagagtgtct 

aaacagatga 

gcaaagtcta 

cgtggttatc 

aacgaagagt 

tacctctcaa 

actttggtct 

taatggcagt 

gagcatctgg 

taggaatcac 

aagcattggg 

cttctagtca 

atgcatttca 

tcgtggagtt 

cacttgcaaa 



tcgcgggaac 

agtgggctgc 

atttccaact 

cagaagcaaa 

agaaagtgac 

tgtttctcta 

cctaaaacaa 

tttaaagaca 

agtgtccaaa 

tttacgatta 

gctacgagct 

gtactccgca 

tgattatgta 

caaatgccaa 

agagaatgga 

gcaacagctt 

tggaaaactt 

gggtattttt 

atatttgcag 

agttctggct 

gatacatatt 

acagattgca 

gaattttcaa 

agtagcactg 

tttgaaaaat 



cccgaaggtg 

agcacccgtg 

gataccatac 

aagatttgga 

tttgagtcat 

tacaagattc 

cctcttccag 

cctttgacga 

aaaagactta 

attgatgcac 

caattttcag 

gaaatgaaac 

agaaagattc 

cccagtccag 

gatcagctct 

cagtgggatc 

gatgctgatg 

ggccattgga 

gctcagctgc 

gttacatctg 

gatggagacg 

tacttctttg 

agcattcagt 

gaggaacagg 

catgtactga 



gggccccacg 
acaccgtgct 
agcgctcaaa 
ttccaggacc 
ratggcataag 
ctcaaggtgt 
acaattctca 
taggtgcaga 
tctccgtaaa 
ttgtagaaga 
attttaagtg 
aatttgcatg 
ttaagctgcc 
gtttcaacag 
atcaatactg 
ctagcagtca 
aaacgccact 
gaacacctct 
ttcgtctgac 
atgccacagc 
acatgaaatg 
actcttgcca 
ttattaatgg 
aattat caaa 
aagtgaatag 



taacaagaag 

cagccgggag 

atggatcagg 

aggtgctata 

aagaaagctg 

acatcttaaa 

agaagttgct 

gaaactggct 

gaactacagg 

gaaactactt 

ggagttatat 

tacactctac 

tcattcttcc 

caacattttt 

ttcattgtta 

cagtttgcag 

tgcttcagaa 

tggttatttt 

tat tggtaaa 

acatagtgtt 

tacatttcag 

cttgctaaga 

tatagcacat 

tatggaaaga 

tgccacccaa 



atgacccgaa 

cgcggcctct 

gctgttaatc 

ctgtgttcca 

aaaaaaggag 

ggtaaagcaa 

actgaggacc 

gaggtgcaac 

atgatcaaga 

tctgaagaaa 

aattggagag 

ttgtgcagta 

atcctcagaa 

tcttttcttc 

ataaaaagta 

gggtttatgg 

actgttttgt 

tttgtaaaca 

ctgagtgaca 

cagatggcaa 

catccttcat 

ttaataagaa 

tggcagcacc 

ataccaagta 

ctctttagtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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agagtgtagc cagtgcatta gaatatttgt 
gtattggtac catccatttt ttacgtttaa 
ggaactgtta tggaaaggga cttaaagggc 
accacgtgtt aattgaagcc aagactattt 
aaataattaa aggtaagcaa aaactaggat 
taaaatggct ctaccaaaat tatgttttcc 
cttacaaatt cagtcatgat catctggaat 
taacaagttc tagccctacc tgcatggcat 
gatacaaatt tcaagatgaa gtttttctaa 
ctcgaaggaa agacttggcg ctttggacag 
agactgtctt tcacgaagag ggtatttgtc 
cattactaga cctgtcagat cataggcgaa 
acaagttatc agctctttta acttgtgagg 
tcaaagcctc taaaattggg tcactattat 
cttcagaaag tctgtgtcgg gtcataaata 
gaatggcaat ttttgaacta gtttctaaac 
tatgtgagct ttctgggcat attgatcttt 
gagaagtgtg tgccatcaat cactttgtca 
taaatatcag agctaaaaat gttgcacaga 
atatgaaaac tttatcaagg aaacactggt 
ttgctaatac cagtagtaaa ttcaggcatt 
gagagaccta aaatatatta acattttaat 
tcaatttacc atattttata aattgcgcat 
cttattaaaa tttcaaattc tgcatatcac 
cagcatttat gagttttcca aaatatagaa 
acaggtactg tctttgaatt tactactgta 
aacttgttca atctgcttca aaaacaagaa 
actccattta tgactagact acatttctga 
ttgacaatac ctataaaact ttgaagataa 
aaaattaggc tcaagcaaat atcaaatact 
aaatagaagt atattttgat gtttgtttat 
gtgtaaccag tttatacttc attttataca 
gctttttgtt gtttgtaaat agaattgaat 
ggaagcctgt ttgaaaatca ccaactttaa 
cctgatcttt atgtaaagca agattcatat 
tttgactaaa atacaaatgt ctgtatt 

<210> 256 
<211> 771 
<212> DNA 

<213> Homo sapiens 
<400> 256 

atgccggccc gttgtgtggc cgcccactgc 
ttccgctttc ccaaggaccg ggccgtgcgg 
cgcgccgact ggtacggagg caatgaccgc 
gcctgttttg acgtctcttc ggttatccag 
ctggtggcag gcgccgtgcc caccctgcac 
gaggagggag accaagcagg ccgcctggac 
tctgaggctg ccccaggtcc agtctcctgt 
gcttcacaga ttacgtgtga aaatgaactt 
tctaatactg tcacttcagt acctactcac 
caaatttctt tgaaaaggcc ccgtcaccgt 
tttggaaaaa gactgtgtaa tgcaactact 
tctctctttg acatttactc cagtgattca 
gaacagagtg atttgtctta tatggctgta 

<210> 257 
<211> 942 
<212> DNA 
<213> Homo sapiens 



tatccttaga cctgccacct tttcaaaact 1560 
ttaacaatct gtttgacatc tttaatagta 1620 
ctctgttgcc tgaaacttac agtaaaataa 1680 
ttgttacatt atctgacact agcaataatc 1740 
tcctgggatt tttgctcaat gctgagagct 1800 
caaaggtcat gccttttcct tatcttctga 1860 
tatttctaaa gatgcttagg caggtattag 1920 
tccagaaagc ttactataat ttggagacca 1980 
gcaaagtaag catctttgac atttcaattg 2040 
ttcaacgtca gtatggtgtc agcgttacaa 2100 
aagactggtc tcattgttca ctaagtgagg 2160 
atctcatctg ttatgctggt tatgttgcaa 2220 
actgcatcac tgcactgtat gcatcggatc 2280 
ttgttaaaaa gaagaatggt ttgcattttc 2340 
tttgtgagcg agttgtaaga acccattcaa 2400 
aaagggaatt gtatcttcaa cagaaaatat 2460 
ttgtagatgt gaataagcat ctctttgatg 2520 
agttgctaaa ggatataata atctgtttct 2580 
atcctttaaa acatcattca gagagaactg 2640 
cacctgtaca ggattataaa tgttcaagtt 2700 
tgctaagtaa cgatggatat ccattcaaat 2760 
taagaatact tgatcaacat tttttgaagt 2820 
tctgcacagt ggacaagttt gcaattctga 2880 
aaaatctcct tatacttttg gtatggcttg 2940 
agcagtaggt cagtaggagc aaactagcca 3000 
agactaagca gtgttactgg acacagtttt 3060 
aaacaacaac tatgagttat caaaatattg 3120 
aagatctttg gtttacgatt cttaagaata 3180 
cttttactta aatatgaaaa ttatagtttg 3240 
gcaaaaatcc ccttgtccca ggatacccta 3300 
tctacctcaa acagaggctt aagttttgaa 3360 
aaacttattg ctgagaagtc tgaatattgt 3420 
ttaaatacac caggataaat cttatttaaa 3480 
cttattgctt atataaatcc aagctctgta 3540 
gtgtagtatc taatgccctt tggtgttaca 3600 

3627 



ggcaacacca ccaagtctgg gaagtcgctg 60 
ctgctctggg accgcttcgt gcggggttgc 120 
tcggtcatct gctctgacca ctttgcccca 180 
aagaacctgc gcttctccca gcgcctgagg 240 
cgggtgcccg ccccggcacc taagagggga 300 
acgcgaggag agctccaggc agccaggcat 360 
acacgccccc gagctgggaa gcaggctgca 420 
gtgcaaaccc aaccccatgc tgataatcca 480 
tgtgaagaag gcccagtgca taaaagtaca 540 
agtgtgggta ttcaagccaa agtgaaagcg 600 
cagacagagg aattgtggtc tagaacttcc 660 
gaaacagata cagactggga tatcaagagt 720 
caggtgaaag aagaaacatg t 771 
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<400> 257 

atgcctggct 

ctgcacttct 

tcgcgtgccg 

tgcagcgttc 

ccgctgcgcg 

cgccgcaggc 

cagcaacagc 

cagactgccc 

caggccactg 

actggagaag 

ggcgcagccg 

gccgagtgcc 

ggctccgacc 

ctgaatgagc 

agcattcgcc 

cggctgcttg 



ttacgtgctg 
acacgtttcc 
gcgtcagtgg 
acttccaggg 
gcgtcaatga 
agcagcagca 
agcagcagca 
agctgcagcc 
tagacagcag 
acgtgaagcc 
ctgcggccgc 
ctatgggccc 
attcgtactc 
agcgggacat 
acctgcgtct 
ccatggctgt 



cgtgccaggc 
aaaggacgct 
gtgcttctcc 
cggccgcaag 
gcgcaaagt a 
acagcagcag 
gcagcagcag 
gaacctggta 
tcaggctccg 
catcgatctc 
cgcgtcggag 
ccagttggtg 
cttgtcgtca 
cctggctctg 
cactgaggcc 
catccgcaag 



tgctacaaca 
gagttgcggc 
accttccagc 
acctacacgg 
gcgcgcagac 
cagcagcaac 
cagcagt cct 
tctgcttccg 
ggatccgtac 
acagtgcaag 
ttacaggctg 
gtggtagggg 
ggcaccacgg 
atggaagtga 
aagctgcgcg 
aagcacggaa 



actcgcaccg 
gcctctggct 
ccaccacagg 
tacgcgtccc 
ccgctggggc 
agcagcaaca 
caccctctgc 
cggccgtgct 
agccggcgcc 
tggagtttgc 
ctaccgcagg 
aagagggctt 
aggaggagct 
agatgaaaga 
aagaactgcg 

tg 



ggacaaggcg 
caagaacgtg 
ccaccgtctc 
caccatcttc 
cgcggccgcc 
gcagcagcag 
ctccactgcc 
tctcaccctt 
catcact ccc 
agccgcagag 
gctggaggct 
ccctgatact 
cctgcgcaag 
gatgaaaggc 
tgagaaggat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

942 



<210> 258 

<211> 2283 

<212> DNA 

<213> Homo sapiens 



<400> 258 
atgccgaact 
ttcttcaggt 
gcagacttag 
cattttgaga 
gcaataccaa 
aaacgaataa 
gaaacttctg 
agcgaagaag 
gaaaacaaag 
atacctctgg 
tttcaggcac 
gagacaacag 
atctgtgaga 
tccattatca 
aggtttgttg 
gccgatgcag 
aatatggagt 
aaagttgttg 
tcctgtgcct 
ttaggaacaa 
cttgacaacg 
gaaatctgcc 
ctgcaagcac 
aactatatag 
gttactattg 
caggggcaaa 
ctcaacgaag 
aatttggcaa 
cagggtaact 
ccaacagtgg 
gctcttaaat 
gaggaacacc 
gagcttcatt 
accatctatg 
ctgaaggtcc 
cgaaagcgtc 



tctgcgctgc 
tcccgcggga 
aagataaaac 
cctctatgat 
caatatttga 
aagaactgag 
agcaggaaca 
agggtgaagg 
aatacctaaa 
atggacatga 
tgctggagtg 
cagttaacac 
gctgtattcg 
ctgacgatgt 
atgaatctca 
aaattttggc 
attgtcgtgg 
cttctagact 
taaatatgtg 
ttgaggaagt 
taatttctgt 
attctcagtg 
ttgttttatg 
ctggccgagc 
ttgttcttaa 
cctctgatgt 
tgatggaaaa 
ccaaacttga 
tggaatctca 
agcacattat 
gcttatctct 
atgctgacat 
gttggagaat 
aagccctcca 
tgtgtattct 
ttaaagcata 



ccccaactgc 
ccctgccaga 
acctgatcag 
ctgtagaact 
tcttaccagt 
tgaagatgaa 
aaaacataaa 
gcaagatgag 
atctctattt 
ggctgatgaa 
tcggataaat 
gttgttttgt 
agaagaaact 
agtggacata 
taacctaaga 
tgtgaaattt 
ccaggcttac 
tttagagaaa 
gttggcaaaa 
ttgttctttt 
tctttttcag 
gacaggcagg 
tttagatggt 
atttgtactc 
aaatgtccta 
cttctttgcg 
tattgaagtt 
tattcaaatg 
gctaacctct 
tcaggaactt 
ggtaccctca 
gtatagaagt 
caaatggaaa 
cctgcctgac 
tcctgtgatg 
tttgaggaac 



acgcggaaga 
tgccagaagt 
ctaaataaac 
agtccttata 
catttgaaca 
atcaggacac 
gaaaccaaca 
gacattttac 
gaaatcttga 
atcccagaag 
tctggtgaag 
tcaaaaacac 
ctcagggaag 
gcaggggaag 
gaggaattta 
cacactatga 
attgtctcta 
tatccccaag 
tcagtacctg 
ttccatcgat 
aacagtaaag 
catgatgctt 
ataaatagtg 
tgcagtgcag 
tcttttacaa 
gccggtagct 
tatcatgaat 
aaactccctg 
gagagttact 
aaagatatat 
gtcatgggac 
gacttaccca 
cacaggggga 
atcaagtttt 
aaggttgaga 
actttgacag 



gcacgcagtc 
gggtggagaa 
attatcgatt 
ggacagttct 
acccacatag 
tgaaacagaa 
atagcaatgc 
ctctaaccct 
ttctgatggg 
gtctctttac 
aggttctgag 
agcagaggca 
tgagagactc 
agcacctacc 
taggcttcct 
taactgagaa 
gtggattttc 
ctatctacac 
ttatgggagt 
caccacaact 
aaaggggtaa 
ttgaaatttt 
acacaaatat 
tgtcagattt 
gagcctttgg 
tgactgcagt 
tttggtttga 
ggaaattccg 
ataaagaaac 
tctcagaaca 
aactcaaatt 
atcctgacac 
aagatataga 
ttcctaatgt 
atgagcggta 
accaaaggtc 



cgacttggcc 
ctgtaggaga 
atgtgccaaa 
tcgagataat 
tagacacaga 
aaaaattgat 
tcagaacccc 
tgaagagaag 
aaagcaaaac 
tccagataac 
aaagcggttt 
gatgctagag 
acacttcttt 
tgtgttggtg 
gccttatgaa 
gtggggatta 
ttccaaaatg 
actctgctct 
atctgttgca 
gcttttagaa 
agaactgaag 
agtggaactc 
tagatggaat 
tgatttcatt 
gaaaaacctc 
actgcattca 
ggaagccaca 
cagagctcac 
cctaagtgtc 
gcacctcaaa 
caatacgtcg 
gctgtcagct 
gcttccgtcc 
gtatgcattg 
tgaaaatgga 
aagtaacttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 



-93- 



BNSDOCID: <WO 2004055050A2J_> 



WO 2004/055050 



PCT/IB2003/006434 



gctttgctta acataaattt tgatataaaa cacgacctgg atttaatggt ggacacatat 2220 
attaaactct atacaagtaa gtcagagctt cctacagata attccgaaac tgtggaaaat 2280 
acc 2283 



<210> 259 
<211> 986 
<212> DNA 
<213> Mus musculus 



<400> 259 

cttctgctaa 

cccacggcga 

tatcaaggcc 

agtgtggagg 

cgacaaggac 

gcagtgggag 

ctcggagcac 

gaacgctgtg 

atcccaagaa 

gctgctaatg 

cactgtggag 

gaaactcagg 

cgagaagctc 

ctacgtgatc 

gatgtgttag 

aaaaaataac 

atatactgta 



agcaaacccc 
ggtgat cgtc 
gagcgcggga 
gccagaagga 
aagcccgtct 
gcagctgtta 
ttcaccccgg 
cccacaatat 
cagctcccct 
ccccctctgc 
gatacgatgc 
aagaagctca 
aaggaagtcg 
ctaccaaatg 
tgggacaaga 
acttaattac 
aaaaaaaaaa 



acaacggaca 
cccgtaactg 
ccccgacggc 
tggtgcagtc 
ccttccacaa 
aaaggaaaaa 
actgctttaa 
ttctctatat 
ctcct tcacc 
agacccctga 
accagaggaa 
agacggccca 
tccactttca 
actactttga 
ctatacacct 
ttgtattaaa 
aaaaaa 



gggtagtcac 
ctgaccgacg 
ccccttcgcc 
ctgctccgcc 
gtttcctctt 
cttcaagccc 
gagggagtgc 
cgagccaca t 
ccccgcttcc 
taacctgtcg 
gaggatcctg 
gcagcggtgc 
gagagagaag 
aattgttgaa 
tcttttagcc 
aaaacaatat 



tcgcccaccc 
ccaccgagag 
tgcctcccgg 
tacggctgca 
actcgcccca 
accaagtaca 
aacaacaagc 
gagaagaagg 
caggt tgatg 
gttttctgtg 
cagctggagc 

cggcggcagg 

gacgacgcgt 
gttccagcat 
tacatacagg 
ttttttaaaa 



caacccccac 
cggcgagcgt 
gccgaaggag 
agaaccgcta 
gcctttgtaa 
gcagcatctg 
tactgaagga 
aagacctgga 
ctgctattgg 
accacaatta 
agcaggtgga 
agaggcagct 
ccgagagggg 
gaaaaaatga 
agt tcatttg 
taaattagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

986 



<210> 260 

<211> 1515 

<212> DNA 

<213> Mus musculus 



<400> 260 

gctctgccct ccccgcgctc tgcaccgagc 
ctagcgtttg acagaagctt gcttagcggg 
cgaatgaggg gcccggggaa atgccgacca 
acaacaagca cattaacatc agcttccaca 
aatgggttcg cctggttagg cgcaaaaatt 
caaagcactt tgaagcctcc tgttttgatc 
atgctgttcc aaccattttt gatttttgta' 
ggaatcttct gaagacaaac aacagttttc 
acggcagtca gcaagtactg cttgaacaca 
aaaaaaggat aattaaacta gaaaaggaaa 
gcctgcaaag agaacgcaga gcaactcgaa 
gcttagaagc aagtaacatg ctacctaagg 
taagcaatct tcctctggaa gatttaaaaa 
tacccattct ctaaatgtaa aatggaagag 
aacccagtgc ccagctcctg ccgtccccac 
aagtcctgca gttttacttg aagtagtagt 
tgggaaatcc aatgacaagc ttgacaccga 
ttaggaaaca ctgccagcag ggtttctgct 
atgacaagtg atatgttcat aaaagtaaaa 
tctaaagaac atgtgaagta ggagctaact 
tattttgtat gaattaaatt attgggattg 
999gtacggg gcaaggtttg tttctttgtt 
tgtattttaa ctagtaaaag taaacttatc 
aaaagtaggt tgtaaaataa tctgaaatag 
caaactaggt aaggccccaa gcacctcaga 
tctaagaaaa tatac 



tggcggcgcg gggtcgcctg cctcgtttgt 60 
cagcgcctcc gaagtggcgt aaggtggcgc 120 
attgcgccgc ggcgggctgt gctgctacct 180 
ggtttccttt ggatcctaaa agaagaaaag 240 
ttgtgccagg aaaacacact tttctttgct 300 
taacaggaca aacccgacga cttaaaatgg 360 
cccatataaa gtctctgaaa ctcaagtcaa 420 
ctccaactgg accatgtaat ttaaagctga 480 
gttatgcctt taggaaccct atggaggcga 540 
tagcaagctt gagaaaaaaa atgaaaactt 600 
ggtggatcaa agccacgtgc tttgtgaaga 660 
gcatctcaga acagatttta ccaactgcct 720 
gtcttgaaca agatcaacaa gataaaacag 780 
actctctgca ctcaagtttt cctcacacag 840 
ccaccgcact ctgacagtta cactacaatc 900 
gtcagtgtca ctctctggag actgaggaag 960 
gcagaagtgc cttacatgag ggtcacggac 1020 
cttgtttttt taagctgctg tcaaatagga 1080 
gcattccgca ccaaagctgg gatattacat 1140 
gcattaaata tgatcttaaa actactaatg 1200 
tggttgaaaa ttttatagaa taaaacctct 1260 
ttgttttgtt ttttgtcttt tttagccttt 1320 
atggcctttt tttataagaa cattgaattt 1380 
tattttgaat gtgaaatacc tttgaaactc 1440 
ctgggaaaac ccagtgagtt atagtcaacg 1500 

1515 
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<210> 261 
<211> 1120 
<212> DNA 

<213> Mus musculus 
<400> 261 

gaggggcagt gggcccatct ccgagatgcc 
ccgctacagc agccgcagga agcagctcac 
gctgttgagg gagtgggtgc tcaacatcgg 
agtcatctgc tcggaacact tcagacccga 
cctgaaacac aatgctgtgc ccacggtgtt 
tgaggtgggg gctggtgggg acagctcagg 
tcagcctcca accccggaag gccccgtgca 
ggccacggag gccgctggcc tgcctgccag 
acagccgtct gatcacagtt atgccctttc 
tctcacactg aaggaaaaca agaggcttcg 
gcggaggaca tgtggccgcc tgagagccta 
acggccggca cagggaagct gagcctgagc 
ccttagcagg aagtggtgtt ctggcctgct 
aggtgccttg agagtgggat gggatgctgc 
actgcggagg caccgtccca ggtttcttgg 
gtgaccaaat gtgagccgtc acaaccccct 
attcttacag ccggtggggt ccttactgtc 
gggcaagggt ccccgtcagc ctgtatttct 
agatgtggaa taaatctttt gaagtctcca 

<210> 262 
<211> 558 
<212> DNA 

<213> Mus musculus 



gaagtcttgc gcggcccggc aatgctgcaa 60 
cttccaccgg ttccccttca gccgcccgga 120 
ccgggctgac ttcaagccta agcagcacac 180 
gtgcttcagc gcctttggga accgcaagaa 240 
cgcttttcag aaccccacag aggtctgccc 300 
gaggaacatg gacaccacac tggaagaact 360 
gcaggtctta ccagatcgag aagcaatgga 420 
ccctctgggg ttgaagaggc cccttccggg 480 
ggacttggat accctcaaaa aaaaactctt 540 
gaagcggctg aaagcccaga ggctgctgtt 600 
cagagaggga cagccgggac ctcgggccag 660 
aagctctggg atgtgggggt ggtggcaaca 720 
atgggcgttt ctacccgctg ctgatgctgc 780 
gacaggcagt tgtcgggtgg gggcccaagt 840 
gctgaggctg tcagctgtgg ggaagcagca 900 
caagagatgc tcccagaggg agagctggtc 960 
tccccatagg agccattctg atggcaggca 1020 
gagtgactct tttttctgcc tggttcgtgt 1080 
aaaaaaaaaa 1120 



<400> 262 

atactgcaag 

aagacagact 

atctttgaat 

ttccttctca 

attgaagaat 

aagaagctta 

ctacggaatg 

gaactaaagg 

tgggagtgct 

cttcatctgc 



catttggaag 
ttgacagaag 
gtccatatca 
aaacccttcc 
tcgaacccca 
agcataagct 
ttttagcccg 
atgaaagtct 
atcatgaaag 
agttgaca 



cctaaaaaaa 
cactctaaac 
cttacaggag 
catcacccac 
gttcattttt 
agaccgtgtg 
agaaaaacac 
gatcagccag 
cacagcagga 



ggagatgtgc 
actaagctga 
aaaagagaaa 
catggccgcc 
gaacatagct 
atcatcgagc 
tttcaaaagt 
gaaacagcca 
ggctgtagtt 



tgtgttcaag 
aggcaggagc 
aacttcactg 
agcttgttgg 
acagtgttat 
tggagaatac 
cactgaggaa 
atagtctggg 
gtgaagtcat 



acacttcaag 
catcccttct 
tagaaaaaac 
tgcctcctgc 
ggacagccca 
caaggaaagc 
gacaatcatg 
tgctttctgt 
ttcttatatg 



<210> 263 
<211> 37 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Consensus sequence for PAR 4 binding domain of THAP 
<221> UNSURE 

<222> 3-15, 18, 22, 24-34, 36 

<223> Xaa = any of the twenty amino acids 

<400> 263 

Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gin 

5 10 15 

Arg Xaa Arg Arg Gin Xaa Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Gin Xaa Glu 



60 

120 

180 

240 

300 

360 

420 

480 

540 

558 
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35 



<210> 264 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 264 

ccgctcgagg tgcagtcctg ct 

<210> 265 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 265 

cgggatccgc tggtacttca actatttca 

<210> 266 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<4O0> 266 

ccgctcgagg atacaatgca cc 

<210> 267 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 267 

gcgggatccg ctggtacttc aactatttca 

<210> 268 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 268 

ccgctcgagc caccatggag acagacacac 
caggttccac tggtgacctc gagatt 

<210> 269 
<211> 26 



29 



22 



aag 33 



tcctgctatg ggtactgctg ctctgggttc 60 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 269 

tagggtcgac gccaccatgg agacag 26 

<210> 270 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 270 

ccgctcgagg tcaccagtgg a 21 

<210> 271 
<211> 134 
<212> PRT 

<213> Homo sapiens 



15 

Gly Ala Gin Asp Cys Cy; 
30 

Lys Val Val Arg Ser Ty: 
4 5 

He Pro Ala He Leu Ph< 
60 

Cys Ala Asp Pro Lys Gli 
75 8 0 

Asp Lys Thr Pro Ser Pre 
95 

Arg Gly Ala Ser Lys Thi 
110 

Lys Arg Thr :.Glu Arg Sei 
125 



<400> 271 
















Met 


Ala 


Gin 


Ser 


Leu 


Ala 


Leu 


Ser 


Leu 


Leu 


1 








5 










10 


Gly 


He 


Pro 


Arg 


Thr 


Gin 


Gly 


Ser 


Asp Gly 








20 










25 




Leu 


Lys 


Tyr 
35 


Ser 


Gin 


Arg 


Lys 


He 
40 


Pro 


Ala 


Arg 


Lys 


Gin 


Glu 


Pro 


Ser 


Leu 


Gly 


Cys 


Ser 




50 










55 




Leu 


Pro 


Arg 


Lys 


Arg 


Ser 


Gin 


Ala 


Glu 


Leu 


65 










70 










Leu 


Trp 


Val 


Gin 


Gin 


Leu 


Met 


Gin 


His 


Leu 


Gin 








85 










90 


Lys 


Pro 


Ala 


Gin 


Gly 


Cys 


Arg 


Lys 


Asp 








100 










105 


Gly 


Lys 


Lys 
115 


Gly 


Lys 


Gly 


Ser 


Lys 
120 


Gly 


Cys 


Gin 


Thr 
130 


Pro 


L Y S 


Gly 


Pro 











<210> 272 
<211> 878 
<212> DNA 

<213> Homo sapiens 
<400> 272 



atcccagccc acgcacagac ccccaacttg cagctgccca cctcaccctc agctctggcc 60 
tcttactcac cctctaccac agacatggct cagtcactgg ctctgagcct ccttatcctg 120 
gttctggcct ttggcatccc caggacccaa ggcagtgatg gaggggctca ggactgttgc 180 
ctcaagtaca gecaaaggaa gattcccgcc aaggttgtcc gcagctaccg gaagcaggaa 240 
ccaagcttag gctgctccat cccagctatc ctgttcttgc cccgcaagcg ctctcaggca 300 
gagctatgtg cagacccaaa ggagctctgg gtgeagcage tgatgeagea tctggacaag 360 
acaccatccc cacagaaacc agcccagggc tgcaggaagg acaggggggc ctccaagact 420 
ggcaagaaag gaaagggctc caaaggctgc aagaggactg ageggtcaca gacccctaaa 480 
gggecatage ccagtgagca gcctggagcc ctggagaccc caccagcctc accagcgctt 540 
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gaagcctgaa cccaagatgc aagaaggagg ctatgctcag gggccctgga gcagccaccc 600 
catgctggcc ttgccacact ctttctcctg ctttaaccac cccatctgca ttcccagctc 660 
taccctgcat ggctgagctg cccacagcag gccaggtcca gagagaccga ggagggagag 720 
tctcccaggg agcatgagag gaggcagcag gactgtcccc ttgaaggaga atcatcagga 780 
ccctggacct gatacggctc cccagtacac cccacctctt ccttgtaaat atgatttata 840 
cctaactgaa taaaaagctg ttctgtcttc ccacccaa 878 

<210> 273 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<400> 273 

Met Ala Leu Leu Leu Ala Leu Ser Leu Leu Val Leu Trp Thr Ser Pro 

15 10 15 

Ala Pro Thr Leu Ser Gly Thr Asn Asp Ala Glu Asp Cys Cys Leu Ser 

20 25 30 

Val Thr Gin Lys Pro lie Pro Gly Tyr He Val Arg Asn Phe His Tyr 

35 40 45 

Leu Leu He Lys Asp Gly Cys Arg Val Pro Ala Val Val Phe Thr Thr 

50 55 60 

Leu Arg Gly Arg Gin Leu Cys Ala Pro Pro Asp Gin Pro Trp Val Glu 
65 70 75 80 

Arg He He Gin Arg Leu Gin Arg Thr Ser Ala Lys Met Lys Arg Arg 
8 5 90 .95 

Ser Ser 



<210> 274 

<211> 684 

<212> DNA 

<213> Homo sapiens 



<400> 274 

cattcccagc 

agcctcacac 

tgcctctgtt 

tccccagccc 

cagaaaccca 

tgcagggtgc 

gaccagccct 

cgccgcagca 

gaat tat tac 

ctgcaccaga 

gagagggtga 

aagccgcctg 



ctcacatcac 
agatcctgca 
caccctccat 
caactctgag 
tccctgggta 
ctgctgtagt 
gggtagaacg 
gttaacctat 
ctaacctggg 
cctgaccagc 
gtgtggtcag 
gtgtttacaa 



tcacaccttg 
cacacccaga 
ggccctgcta 
tggcaccaat 
catcgtgagg 
gttcaccaca 
catcatccag 
gaccgtgcag 
gaaccgagga 
caggacaggg 
agtaaagctg 
ct'aa 



catttcaccc 
cagctggcgc 
ctggccctca 
gatgctgaag 
aacttccact 
ctgaggggcc 
agactgcaga 
agggagcccg 
ccagaaggaa 
cctggggtgt 
ctccaccccc 



ctgcatccca 
tcacacattc 
gcctgctggt 
actgctgcct 
accttctcat 
gccagctctg 
ggacctcagc 
gagtccgagt 
ggaccaggct 
gtgtgagtgt 
agattgcaat 



gtcgccctgc 
accgttggcc 
tctctggact 
gtctgtgacc 
caaggatggc 
tgcaccccca 
caagatgaag 
caagcattgt 
tccagctcct 
gagtgtgagc 
gctaccaata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

684 



<210> 275 
<211> 125 
<212> PRT 

<213> Homo sapiens 
<400> 275 

Met Lys Lys Ser Gly Val Leu Phe Leu Leu Gly He He Leu Leu Val 

1 5 10 15 

Leu He Gly Val Gin Gly Thr Pro Val Val Arg Lys Gly Arg Cys Ser . 

20 25 ^ 30 

Cys lie Ser Thr Asn Gin Gly Thr He His Leu Gin Ser Leu Lys Asp 

35 40 45 

Leu Lys Gin Phe Ala Pro Ser Pro Ser Cys Glu Lys He Glu He He 
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50 55 60 

Ala Thr Leu Lys Asn Gly Val Gin Thr Cys Leu Asn Pro Asp Ser Ala . 

Asp Val Lys Glu Leu He Lys Lys Trp Glu lys Gin Val Ser Gin Lys 

Lys Lys Gin Lys Asn Gly Lys Lys His Gin Lys Lys Lys Val Leu Lys 

Val Arg Lys Ser Gin Arg Ser Arg Gin Lys Lys Thr Thr 

115 120 ' 



110 



<210> 276 

<211> 2545 

<212> DNA 

<213> Homo sapiens 

<400> 276 



tt'cc^cttgg 3 TcToltTtl g^g^S atr^T* ^aagaaaag tggtgttctt 60 
aagggtcgct gttcctgcat attggagtgc aaggaacccc agtagtgaga 120 

gaccttaaac aatttgcccc aaocccttcc " aag9gaCta tcca "taca atccttgaaa 180 
aagaatggag ttcaaacato tgcgagaaaa ttgaaatcat tgctacactg 240 

aagtggg^ aacaggtcag H""?" 9 \ tgtgaagga actgattaaa 300 

:s=s: sjsb ~ - - 

tgaaattgag ctggacctca ccaaactao? ^ 9 ? tgctggt 9 a 9 ccaagcagtt 84 0 
ctacaggcct cacacacaaf 9 gtggccatca acctctgtat ttgaatcagc 900 

ass ™ i E= ~- a. 
sssss: £™ ™ -™ -o 

agtgctttct tctcccaatt «tr^. ? tttctcatct tgtccccaac accccacaga 1200 

sis idhiz F"< ~£ ;«s isss 

km bs: Si ££= ~- iss 

aaaatcatat aatcttacaa t|aaaaggac t """^ cagtttaccg 1740 

ccaaccatac aaaaattcct tttrrri!n "tatagatc agccagtgac caaccttttc 1800 
tctaagatct aacaagatag TattT^ ttCtcaataa gcctcagctt 1860 

agttttattg tccgt?ta c ? tgtttcagag tttgtatt^t o^arcfr ? 9C " atat « 1920 
talt™ £~ "~ ^ 

™ 9 2S« ™ S== 2ES5 ^ 0 

tcccacccga acqtcttatc Li-r a fn» <=cgatgattt agagtgctgt ccggtggaga 2220 
aaaaatctaa gtg^tcara u actccctagt tccttcatgt aacttccctg 2280 

9tagacagta cataactaac a IToTtTttot S^" 9 Catctcacag "40 

tcatttatca tatatataca tacatgcata cactctcaaJ 9 * * cacgttataa 2400 

aaacagtatt gacttgtata ccttg^at? tgaa^attt ttttt tt^ tttCaCttca 2460 
tatcaataaa tagaccatta atcag ^"tattt tctttgttaa aatagaatgg 2520 
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<210> 277 

<211> 98 

<212> PRT 

<213> Homo sapiens 



<400> 277 



Met 


Asn 


Gin 


Thr 


Ala 


He 


Leu 


He 


Cys 


Cys 


Leu 


He 


Phe 


Leu 


Thr 


Leu 


1 








5 










10 










15 




Ser 


Gly 


He 


Gin 
20 


Gly 


Val 


Pro 


Leu 


Ser 
25 


Arg 


Thr 


Val 


Arg 


Cys 
30 


Thr 


Cys 


He 


Ser 


He 


Ser 


Asn 


Gin 


Pro 


Val 


Asn 


Pro 


Arg 


Ser 


Leu 


Glu 


Lys 


Leu 






35 










40 










45 






Glu 


He 
50 


He 


Pro 


Ala 


Ser 


Gin 

55 


Phe 


Cys 


Pro 


Arg 


Val 
60 


Glu 


He 


lie 


Ala 


Thr 


Met 


Lys 


Lys 


Lys 


Gly 


Glu 


Lys 


Arg 


Cys 


Leu 


Asn 


Pro 


Glu 


Ser 


Lys 


65 










70 










75 










80 


Ala 


He 


Lys 


Asn 


Leu 

85 


Leu 


Lys 


Ala 


Val 


Ser 
90 


Lys 


Glu 


Met 


Ser 


Lys 
95 


Arg 


Ser 


Pro 































<210> 278 
<211> 1172 
<212> DNA 

<213> Homo sapiens 
<400> 278 

gagacattcc tcaattgctt agacatattc tgagcctaca gcagaggaac ctccagtctc 60 

agcaccatga atcaaactgc gattctgatt tgctgcctta tctttctgac tctaagtggc 120 

attcaaggag tacctctctc tagaaccgta cgctgtacct gcatcagcat tagtaatcaa 180 

cctgttaatc caaggtcttt agaaaaactt gaaattattc ctgcaagcca attttgtcca 240 

cgtgttgaga tcattgctac aatgaaaaag aagggtgaga agagatgtct gaatccagaa 300 

tcgaaggcca tcaagaattt actgaaagca gttagcaagg aaatgtctaa aagatctcct 360 

taaaaccaga ggggagcaaa atcgatgcag tgcttccaag gatggaccac acagaggctg 420 

cctctcccat cacttcccta catggagtat atgtcaagcc ataattgttc ttagtttgca 480 

gttacactaa aaggtgacca atgatggtca ccaaatcagc tgctactact cctgtaggaa 540 

ggttaatgtt catcatccta agctattcag taataactct accctggcac tataatgtaa 600 

gctctactga ggtgctatgt tcttagtgga tgttctgacc ctgcttcaaa tatttccctc 660 

acctttccca tcttccaagg gtactaagga atctttctgc tttggggttt atcagaattc 720 

tcagaatctc aaataactaa aaggtatgca atcaaatctg ctttttaaag aatgctcttt 780 

acttcatgga cttccactgc catcctccca aggggcccaa attctttcag tggctaccta 840 

catacaattc caaacacata caggaaggta gaaatatctg aaaatgtatg tgtaagtatt 900 

cttatttaat gaaagactgt acaaagtata agtcttagat gtatatattt cctatattgt 960 

tttcagtgta catggaataa catgtaatta agtactatgt atcaatgagt aacaggaaaa 1020 

ttttaaaaat acagatagat atatgctctg catgttacat aagataaatg tgctgaatgg 1080 

ttttcaaata aaaatgaggt actctcctgg aaatattaag aaagactatc taaatgttga 1140 

aagatcaaaa ggttaataaa gtaattataa ct 1172 

<210> 279 
<211> 166 
<212> PRT 

<213> Homo sapiens 
<400> 279 

Met Lys Tyr Thr Ser Tyr He Leu Ala Phe Gin Leu Cys lie Val Leu 

1 5 10 ^ 15 

Gly Ser Leu Gly Cys Tyr Cys Gin Asp Pro Tyr Val Lys Glu Ala Glu 

20 25 30 

Asn Leu Lys Lys Tyr Phe Asn Ala Gly His Ser Asp Val Ala Asp Asn 
35 40 45 
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Gly 


Thr 


Leu 


Phe 


Leu 


Gly 


He 


Leu 




50 










55 




Arg 


Lys 


He 


Met 


Gin 


Ser 


Gin 


He 


65 










70 ' 






Lys 


Asn 


Phe 


Lys 


Asp 


Asp 


Gin 


Ser 










85 








Lys 


Glu 


Asp 


Met 


Asn 


Val 


Lys 


Phe 








100 








Asp 


Asp 


Phe 


Glu 


Lys 


Leu 


Thr 


Asn 






115 










120 


Gin 


Arg 


Lys 


Ala 


He 


His 


Glu 


Leu 




130 










135 




Pro 


Ala 


Ala 


Lys 


Thr 


Gly 


Lys 


Arg 


145 










150 






Gly 


Arg 


Arg 


Ala 


Ser 


Gin 







165 



Lys 


Asn 


TrD 


Lys 


Glu 


Glu 




Asp 








60 








Val 


Ser 


Phe 


Tvr 


Phe 


Lys 


Leu 


nu „ 






75 










o r\ 
oO 


lie 


Gin 


Lys 


Ser 


Val 




i nr 


lie 




90 














Phe 


Asn 


Ser 


Asn 


Lys 


Lys 


Lys 


Arg 


105 










110 






Tyr 


Ser 


Val 


Thr 


Asp 


Leu 


Asn 


Val 










125 








He 


Gin 


Val 


Met 


Ala 


Glu 


Leu 


Ser 








140 










Lys 


Arg 


Ser 


Gin 


Met 


Leu 


Phe 


Gin 






155 










160 



<210> 280 
<211> 1193 
<212> DNA 

<213> Homo sapiens 
<400> 280 

tgaagatcag ctattagaag agaaagatca gttaagtcct ttggacctga tcagcttgat 60 
acaagaacta ctgatttcaa cttctttggc ttaattctct cggaaacgat gaaatataca 120 
agttatatct tggcttttca gctctgcatc gttttgggtt ctcttggctg ttactgccag 180 
gacccatatg taaaagaagc agaaaacctt aagaaatatt ttaatgcagg tcattcagat 240 
gtagcggata atggaactct tttcttaggc attttgaaga attggaaaga ggagagtgac 300 
agaaaaataa tgcagagcca aattgtctcc ttttacttca aactttttaa aaactttaaa 360 
gatgaccaga gcatccaaaa gagtgtggag accatcaagg aagacatgaa tgtcaagttt 420 
ttcaatagca acaaaaagaa acgagatgac ttcgaaaagc tgactaatta ttcggtaact 480 
gacttgaatg tccaacgcaa agcaatacat gaactcatcc aagtgatggc tgaactgtcg 540 
ccagcagcta aaacagggaa gcgaaaaagg agtcagatgc tgtttcaagg tcgaagagca 600 
tcccagtaat ggttgtcctg cctgcaatat ttgaatttta aatctaaatc tatttattaa 660 
tatttaacat tatttatatg gggaatatat ttttagactc atcaatcaaa taagtattta 720 
taatagcaac ttttgtgtaa tgaaaatgaa tatctattaa tatatgtatt atttataatt 780 
cctatatcct gtgactgtct cacttaatcc tttgttttct gactaattag gcaaggctat 840 
gtgattacaa ggctttatct caggggccaa ctaggcagcc aacctaagca agatcccatg 900 
ggttgtgtgt ttatttcact tgatgataca atgaacactt ataagtgaag tgatactatc 960 
cagttactgc cggtttgaaa atatgcctgc aatctgagcc agtgctttaa tggcatgtca: .1020 
gacagaactt gaatgtgtca ggtgaccctg atgaaaacat agcatctcag gagatttcat 1080 
gcctggtgct tccaaatatt gttgacaact gtgactgtac ccaaatggaa agtaactcat 1140 
ttgttaaaat tatcaatatc taatatatat gaataaagtg taagttcaca act 1193 

<210> 281 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 281 

gcggaatcat gggcaccaat gatgctgaag actg 34 

<210> 282 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 282 

gcgggatcct taactgctgc ggcgcttcat cttg 34 

<210> 283 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 283 

gccgaattca ccccagtagt gagaaagggt cgctg 35 

<210> 284 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 284 

cgcggatcct tatgtagtct tcttttgacg agaacgttg 39 

<210> 285 
<211> 36 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 285 

gccgaattcg tacctctctc tagaaccgta cgctgt 36 

<210> 286 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 286 

gcgggatcct taaggagatc ttttagacat ttccttgcta 40 

<210> 287 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 287 

gcggaatcat gtgttactgc caggacccat atg 33 
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<210> 288 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 288 

gcgggatcct tactgggatg ctcttcgacc ttg 

<210> 289 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<400> 289 

Met Lys Val Ser Ala Ala Ala Leu Ala Val He Leu He Ala Thr Ala 

1 5 10 15 

Leu Cys Ala Pro Ala Ser Ala Ser Pro Tyr Ser Ser Asp Thr Thr Pro 

20 25 30 

Cys Cys Phe Ala Tyr He Ala Arg Pro Leu Pro Arg Ala His He Lys 

35 40 45 

Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala Val Val Phe 

5 ° 55 60 

Val Thr Arg Lys Asn Arg Gin Val Cys Ala Asn Pro Glu Lys Lys Trp 
65 ™ 75 80 

Val Arg Glu Tyr lie Asn Ser Leu Glu Met Ser 
85 90 



33 



<210> 290 

<211> 1237 

<212> DNA 

<213> Homo sapiens 



<400> 290 

gctgcagagg 

caggtaccat 

gcgctcctgc 

ttgcccgccc 

ccaacccagc 

agaaatgggt 

acctgaactt 

ccctcactat 

ttacaaaaac 

gaggccaagg 

tgaaacccca 

agtcccagct 

gcagtgagcc 

caaaaaaaaa 

aatcccagct 

gcagtgagct 

caacaacaac 

ttgtcaaaag 

tgtgaccagg 

ttaagcttcc 

taccattaaa 

<210> 291 
<211> 33 



attcctgcag 

gaaggtctcc 

atctgcctcc 

actgccccgt 

agtcgtcttt 

tcgggagtac 

acacaaattt 

cctaccccac 

cttccccagg 

tgggtggatc 

tctctactaa 

actcgggagg 

gagatcgcgc 

aaaaaaaaaa 

actcgggagg 

gagattgtgc 

aacaaaaagc 

gaagtctcta 

aaggaagtca 

gccgtctcaa 

gttctcaatg 



aggatcaaga 

gcggcagccc 

ccatattcct 

gcccacatca 

gtcacccgaa 

atcaactctt 

gcctgtttct 

ccgctccttg 

ctggacgtgg 

acttgaggtc 

aaatacaaaa 

ctgaggcagg 

cactgcactc 

aaaatacaaa 

ctaaggcagg 

cacttcactc 

ttccccaact 

ggttctgagc 

gcatgcctct 

cccctcacag 

caaccataaa 



cagcacgtgg 

tcgctgtcat 

cggacaccac 

aggagtattt 

agaaccgcca 

tggagatgag 

gcttgctctt 

aagggcccag 

tggctcacgc 

aggagt tcga 

aattagccgg 

agaatggcgt 

cagcctgggc 

aattagccgg 

aaaattgttt 

cagcctgggt 

aaagcctaga 

tctggctttg 

agaggcaagg 

gagcttactg 

aaaaaaa 



acctcgcaca 

cctcattgct 

accctgctgc 

ctacaccagt 

agtgtgtgcc 

ctaggatgga 

gtcctagctt 

attctaccac 

ctgtaatccc 

gaccagcctg 

gcgtggtagc 

gaacccggga 

gacagagcga 

gcgtggtggc 

gaacccagga 

gacaaagtga 

agagcttctg 

ccttggcttt 

aggggaggaa 

gcaaacatga 



gcctctccca 
actgccctct 
tttgcctaca 
ggcaagtgct 
aacccagaga 
gagtcct tga 
gggaggcttc 
acagcagcag 
agcactttgg 
gccaacatga 
gggcgcctgt 
ggcggagctt 
gactccgtct 
ccacgcctgt 
ggtggaggct 
gactccgtca 
aggcgctgct 
gccagggctc 
cactgcactc 
aaaatcggct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1237 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 291 

cgcggatccg tgcagtcctg ctccgcctac ggc 33 

<210> 292 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 292 

ccgaattctt atgctggtac ttcaactatt tcaaagtag 39 

<210> 293 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> THAP antigenic peptide 
<400> 293 

Ala Val Arg Arg Lys Asn Phe Lys Pro Thr Lys Tyr Ser Ser lie Cys 
1 5 10 15 



<210> 294 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Control peptide 
<400> 294 

Gin Val Glu Lys Leu Arg Lys Lys Leu Lys Thr Ala Gin Gin Arg Cys 
1 5 10 15 



<210> 295 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 295 

ccgaattcag gatggtgcag tcctgctccg cct 33 

<210> 296 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 296 

gcgctctaga ttatgctggt acttcaacta tttcaaagta g 41 

<210> 297 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 297 

gcgtctagaa tgagtgatgg aggggctcag gactgttg 38 

<210> 298 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 298 

gggcggccgc ctatggccct ttaggggtct gtgaccgc 38 

<210> 299 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 299 

gcgctcgagc tgcacctggg ccttctctgc cctgg 35 

<210> 300 
<211> 35 
<212> DNA 

<213> Artificial" Sequence 
<220> 

<223> Primer 
<400> 300 

cgaagcttac tgtgctcctt ttatctctgc ccaag 35 

<210> 301 
<211> 420 
<212> DNA 

<213> Homo sapiens 
<400> 301 

cttctctgcc ctgggccaag cctgcccgag cctctctgtc ctctgcctgc ccagctggac 60 
atctctgggc ctctctggag accagtgggg tgggctgtgg gggcgtcata ttgccctggc 120 
ttggcatccc tcttgtggct gtacccctcc cagcagcccc aggactagca agtccccgag 180 
atgggggtgg ggacagtggt tgatgccaaa ggttgtgggg gcaggggcgg ggcaggagca 240 
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ggaaggtccc ctgagttccc tcaccttggg cagagataaa aggagcacag ttccaggcgg 300 

ggctgagcta gggcgtagct gtgatttcag gggcacctct ggcggctgcc gtgatttgag 360 

aatctcgggt ctcttggctg actgatcctg ggagactgtg gatgaataat gctggtgagt 420 



<210> 302 
<211> 378 
<212> PRT 

<213> Homo sapiens 



<40O> 302 



Met 


Asp 


Leu 


Gly 


Lys 


Pro 


Met 


Lys 


Ser 


val 


Leu 


Val 


Val 


Ala 


Leu 


Leu 


1 








5 










10 










15 




Val 


He 


Phe 


Gin 


Val 


Cys 


Leu 


Cys 


Gin 


Asp 


Glu 


Val 


Thr 


Asp 


Asp 


Tyr 








20 










25 










30 






lie 


Gly 


Asp Asn 


Thr 


Thr 


Val 


Asp 


Tyr 


Thr 


Leu 


Phe 


Glu 


Ser 


Leu 


Cys 






35 










40 










45 








Ser 


Lys 


Lys 


Asp 


Val 


Arg 


Asn 


Phe 


Lys 


Ala 


Trp 


Phe 


Leu 


Pro 


He 


Met 




50 










55 








- 


60 










Tyr 


Ser 


He 


He 


Cys 


Phe 


Val 


Gly 


Leu 


Leu 


Gly 


Asn 


Gly 


Leu 


Val 


Val 


65 










70 










75 










80 


Leu 


Thr 


Tyr 


He 


Tyr 


Phe 


Lys 


Arg 


Leu 


Lys 


Thr 


Met 


Thr 


Asp 


Thr 


Tyr 










85 










90 










95 




Leu 


Leu 


Asn 


Leu 


Ala 


Val 


Ala 


Asp 


lie 


Leu 


Phe 


Leu 


Leu 


Thr 


Leu 


Pro 








100 










105 








- 


110 






Phe 


Trp 


Ala 


Tyr 


Ser 


Ala 


Ala 


Lys 


Ser 


Trp 


Val 


Phe 


Gly 


Val 


His 


Phe 






115 










120 










125 








Cys 


Lys 


Leu 


He 


Phe 


Ala 


He 


Tyr 


Lys 


Met 


Ser 


Phe 


Phe 


Ser 


Gly 


Met 




130 










135 










140 










Leu 


Leu 


Leu 


Leu 


Cys 


He. 


Ser 


He 


Asp Arg 


Tyr 


Val 


Ala 


He 


Val 


Gin 


145 










150 










155 










160 


Ala 


Val 


Ser 


Ala 


His 


Arg 


His 


Arg 


Ala 


Arg 


Val 


Leu 


Leu 


He 


Ser 


Lys 










165 










170 










175 




Leu 


Ser 


Cys 


Val 


Gly 


He 


Trp 


He 


Leu 


Ala 


Thr 


Val 


Leu 


Ser 


He 


Pro 








180 










185 










190 






Glu 


Leu 


Leu 


Tyr 


Ser 


Asp 


Leu 


Gin 


Arg 


Ser 


Ser 


Ser 


Glu 


Gin 


Ala 


Met 






195 










200 










205 








Arg 


Cys 


Ser 


Leu 


He 


Thr 


Glu 


His 


Val 


Glu 


Ala 


Phe 


He 


Thr 


He 


Gin 




210 










215 










220 










Val 


Ala 


Gin 


Met 


Val 


He 


Gly 


Phe 


Leu 


Val 


Pro 


Leu 


Leu 


Ala 


Met 


Ser 


225 










230 










235 










240 


Phe 


Cys 


Tyr 


Leu 


Val 


He 


He 


Arg 


Thr 


Leu 


Leu 


Gin 


Ala 


Arg 


Asn 


Phe 










245 










250 










255 




Glu Arg 


Asn 


Lys 


Ala 


He 


Lys 


Val 


He 


He 


Ala 


Val 


Val 


Val 


Val 


Phe 








260 










265 










270 






He 


Val 


Phe 


Gin 


Leu 


Pro 


Tyr 


Asn 


Gly Val 


Val 


Leu 


Ala 


Gin 


Thr 


Val 






275 










280 










285 








Ala 


Asn 


Phe 


Asn 


He 


Thr 


Ser 


Ser 


Thr 


Cys 


Glu 


Leu 


Ser 


Lys 


Gin 


Leu 




290 










295 










300 










Asn 


He 


Ala 


Tyr 


Asp 


Val 


Thr 


Tyr 


Ser 


Leu 


Ala 


Cys 


Val 


Arg 


Cys 


Cys 


305 










310 










315 










320 


Val 


Asn 


Pro 


Phe 


Leu 


Tyr 


Ala 


Phe 


He 


Gly 


Val 


Lys 


Phe 


Arg 


Asn 


Asp 










325 










330 










335 




Leu 


Phe 


Lys 


Leu 


Phe 


Lys 


Asp 


Leu 


Gly 


Cys 


Leu 


Ser 


Gin 


Glu 


Gin 


Leu 








340 










345 










350 






Arg 


Gin 


Trp 


Ser 


Ser 


Cys 


Arg 


His 


He 


Arg 


Arg 


Ser 


Ser 


Met 


Ser 


Val 






355 










360 










365 








Glu 


Ala 


Glu 


Thr 


Thr 


Thr 


Thr 


Phe 


Ser 


Pro 
















370 










375 
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<210> 303 

<211> 1136 

<212> DNA 

<213> Homo sapiens 

<400> 303 



atggacctgg ggaaaccaat gaaaagcgtg ctggtggtgg ctctccttgt cattttccag 60 
gtatgcctgt gtcaagatga ggtcacggac gattacatcg gagacaacac cacagtggat 120 
acactttgtt cgagtctttg tgctccaaga aggacgtgcg gaactttaaa gcctggttcc 180 
tccctatcat gtactccatc atttgtttcg tgggcctact gggcaatggg ctggtcgtgt 240 
tgacctatat ctatttcaag aggctcaaga ccatgaccga tacctacctg ctcaacctgg 300 
cggtggcaga catcctcttc ctcctgaccc ttcccttctg ggcctacagc gcggccaaqt 360 
cctgggtctt cggtgtccac ttttgcaagc tcatctttgc catctacaag atgagcttct 420 
tcagtggcat gctcctactt ctttgcatca gcattgaccg ctacgtggcc atcgtccagg 480 
ctgtctcagc tcaccgccac cgtgcccgcg tccttctcat cagcaagctg tcctgtgtgg 540 
gcatctggat actagccaca gtgctctcca tcccagagct cctgtacagt gacctccaga 600 
ggagcagcag tgagcaagcg atgcgatgct ctctcatcac agagcatgtg gaggccttta 660 
tcaccatcca ggtggcccag atggtgatcg gctttctggt ccccctgctg gccatgagct 720 
tctgttacct tgtcatcatc cgcaccctgc tccaggcacg caactttgag cgcaacaagg 780 
ccatcaaggt gatcatcgct gtggtcgtgg tcttcatagt cttccagctg ccctacaatg 840 
gggtggtcct ggcccagacg gtggccaact tcaacatcac cagtagcacc tgtgagctca 900 
gtaagcaact caacatcgcc tacgacgtca cctacagcct ggcctgcgtc cgctgctgcg 960 
tcaacccttt cttgtacgcc ttcatcggcg tcaagttccg caacgatctc ttcaagctct 1020 
tcaaggacct gggctgcctc agccaggagc agctccggca gtggtcttcc tgtcggcaca 1080 
tccggcgctc ctccatgagt gtggaggccg agaccaccac caccttctcc ccatag 1136 

<210> 304 
<211> 368 
<212> PRT 
<213> Homo sapiens 

<400> 304 

Met Val Leu Glu Val Ser Asp His Gin Val Leu Asn Asp Ala Glu Val 

5 10 15 

Ala Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn 

20 25 30 

Glu Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gin Asp Phe Ser 

35 4 0 4 5 

Leu Asn Phe Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe 

50 55 60 

Leu Leu Gly Leu Leu Gly Asn Gly Ala Val Ala Ala Val Leu Leu Ser 
65 70 75 80 

Arg Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala 

85 90 95 

Val Ala Asp Thr Leu Leu Val Leu Thr Leu Pro Leu Trp Ala Val Asn 

100 105 no 

Ala Ala Val Gin Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly 

115 120 125 

Ala Leu Phe Asn lie Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys 

130 135 i 4 o 

lie Ser Phe Asp Arg Tyr Leu Asn He Val His Ala Thr Gin Leu Tyr 
145 I 50 155 160 

Arg Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val Trp 

165 170 175 

Gly Leu Cys Leu Leu Phe Ala Leu Pro Asp Phe He Phe Leu Ser Ala 

180 185 190 

His His Asp Glu Arg Leu Asn Ala Thr His Cys Gin Tyr Asn Phe Pro 

195 200 205 

Gin Val Gly Arg Thr Ala Leu Arg Val Leu Gin Leu Val Ala Glv Phe 

210 215 220 

Leu Leu Pro Leu Leu Val Met Ala Tyr Cys Tyr Ala His lie Leu Ala 
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225 230 235 240 



Vdl 


Leu 


Leu 


val 


OcX 


Arg 


tjiy 


bin 


Arg 


Arg 


Leu 


Arg 


rt.1 d 


L J tr L 


Arg 


Leu 










O/l c 
D 
























V d X 


vdl 


V d JL 


Vdl 


vdl 


Vdl 


Hid 


rne 


Hid 


Leu 


Cys 


Trp 


1 1 1 X 


trx O 


Tyr 


rtlS 








o ^ n 










ore 
^OJ 










97 n 
z / u 






Le U 


Vdl 


Vdl 


Leu 


Vdl 


Asp 


lie 


Leu 


Lyieu 


Asp 


Leu 


uiy 


M.1 d 


Leu 


Ala 


Arg 






D "7 c; 










o o c\ 
z ou 


















Asn 


Cys 


Gly 


Arg 


Glu 


OCX 


Arg 


vdl 




vox 




Lys 


Ser 


Val 


i hi 


O fc; X 




290 










295 










300 










Gly 


Leu 


Gly 


Tyr 


Met 


His 


Cys 


Cys 


Leu 


Asn 


Pro 


Leu 


Leu 


Tyr 


Ala 


Phe 


305 










310 










315 










320 


Val 


Gly 


Val 


Lys 


Phe 


Arg 


Glu 


Arg 


Met 


Trp 


Met 


Leu 


Leu 


Leu 


Arg 


Leu 










325 










330 










335 




Gly 


Cys 


Pro 


Asn 


Gin 


Arg 


Gly 


Leu 


Gin 


Arg 


Gin 


Pro 


Ser 


Ser 


Ser 


Arg 








340 










345 










350 






Arg 


Asp 


Ser 


Ser 


Trp 


Ser 


Glu 


Thr 


Ser 


Glu 


Ala 


Ser 


Tyr 


Ser 


Gly 


Leu 



355 360 365 



<210> 305 

<211> 1670 

<212> DNA 

<213> Homo sapiens 

<400> 305 

ccaaccacaa gcaccaaagc agaggggcag 
gcccagccat ggtccttgag gtgagtgacc 
ccctcctgga gaacttcagc tcttcctatg 
gtacctcccc gccctgccca caggacttca 
ccctctacag cctcctcttt ctgctggggc 
tgctgagccg gcggacagcc ctgagcagca 
cagacacgct gctggtgctg acactgccgc 
tctttggctc tggcctctgc aaagtggcag 
gagccctcct gctggcctgc atcagctttg 
agctctaccg ccgggggccc ccggcccgcg 
tctgcctgct tttcgccctc ccagacttca 
tcaacgccac ccactgccaa tacaacttcc 
tgcagctggt ggctggcttt ctgctgcccc 
tcctggccgt gctgctggtt tccaggggcc 
tggtggtcgt ggtggccttt gccctctgct 
acatcctcat ggacctgggc gctttggccc 
tggccaagtc ggtcacctca ggcctgggct 
atgcctttgt aggggtcaag ttcegggage 
gccccaacca gagagggctc cagaggcagc 
ctgagacctc agaggcctcc tacteggget 
cccacagtct gacttccccg cattccaggc 
cccaatatcc tcgctcccgg gactcactgg 
gccaccctcc cagctctgag gactgcacca 
tgccgcccga ggtggctgcc tggagcccca 
atcttcccca agtgcgggga gtacaaggca 
cagcccaggc ctccagctca gcagtgactg 
ctcttttatt tttatgtcta aaatcctget 
accaaaaaaa aaaaaaaaaa aaaaaaaaaa 

<210> 306 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> THRE consensus sequence 



gcagcacacc acccagcagc cagagcacca 60 
accaagtget aaatgacgee gaggttgccg 120 
actatggaga aaacgagagt gaetegtget 180 
gectgaaett cgaccgggcc ttcctgccag 240 
tgctgggcaa eggegeggtg gcagccgtgc 300 
ccgacacctt cctgctccac ctagctgtag 360 
tctgggcagt ggacgctgcc gtccagtggg 420 
gtgccctctt caacatcaac ttctacgcag 480 
accgctacct gaacatagtt catgccaccc 540 
tgaccctcac ctgcctggct gtctgggggc 600 
tcttcctgtc ggcccaccac gacgagcgcc 660 
cacaggtggg ccgcacggct ctgcgggtgc 720 
tgctggtcat ggcctactgc tatgcccaca 780 
agcggcgcct gcgggccatg cggctggtgg 840 
ggacccccta tcacctggtg gtgctggtgg 900 
gcaactgtgg ccgagaaagc agggtagacg 960 
acatgeactg ctgcctcaac ccgctgctct 1020 
ggatgtggat getgetcttg cgcctgggct 1080 
catcgtcttc ccgccgggat tcatcctggt 1140 
tgtgaggccg gaatceggge tcccctttcg 1200 
tcctccctcc ctctgccggc tctggctctc 1260 
cagccccagc accaccaggt ctcccgggaa 1320 
ttgctgctcc ttagctgeca agccccatcc 1380 
ctgcccttct catttggaaa ctaaaacttc 1440 
tggegtagag ggtgctgccc catgaageca 1500 
tggccatggt ccccaagacc tctatatttg 1560 
taaaactttt caataaacaa gategtcagg 1620 
aaaaaaaaaa aaaaaaaaaa 1670 
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<221> misc_feature 
<222> (1) ... (i) 
<223> N=A or T 

<221> misc__feature 
<222> (2) ... (2) 
<223> N=G, C or A 

<221> misc_feature 
<222> (4) ... (4) 
<223> N= A or G 

<221> misc_feature 
<222> (5) . . . (5) 
<223> N=T, C or A 

<221> misc_f eature 
<222> (6) . . . (6) 
<223> N= G or T 

<221> misc__feature 
<222> (11) . . . (ii) 
<223> N= A, T, G or C 

<400> 306 
nntnnnggca n 

<210> 307 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> THRE variant one 

<400> 307 
agtaagggca a 

<210> 308 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Mutant THRE 

<400> 308 
agtaatttca a 

<210> 309 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Mutant THRE 

<400> 309 
agtaaggtca a 
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11 



11 



11 



11 
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<210> 310 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Mutant THRE 

<400> 310 
agtaagtgca a 

<210> 311 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Mutant THRE 

<400> 311 
agtaagggcc a 

<210> 312 
<211> 11 
<212> DNA 

<213> Artificial Sequence- 
<220> 

<223> Mutant THRE 

<400> 312 
agtaagggaa a 

<210> 313 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sequence containing THRE variant 
<400> 313 

agcaagtaag ggcaaactac ttcat 

<210> 314 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sequence containing THRE mutant 
<400> 314 

agcaagtaat ttcaaactac ttcat 

<210> 315 
<211> 319 
<212> DNA 

<213> Homo sapiens 
<400> 315 



PCT/IB2003/006434 



11 



11 



11 



one 

25 



-110- 



<WO 20O4055050A2_l_> 



WO 2004/055050 



PCT/IB2003/006434 



cgtcgctggg 
gcacaccccg 
agggcgtgcg 
gacccgttgg 
agccctttct 
gcgcctgcac 



tgcaccgcga 
cgccgccccg 
ctcccgacat 
cagaggtggc 
caaggaccac 
cccggagcg 



ccacgggcag agccacgcgg cgggaggact 
cctctactcc cagaaggccg cggggggtgg 
gccccgcggc gcgccattaa ccgccagatt 
ggcggcggca tgggtgcccc gacgttgccc 
cgcatctcta cattcaagaa ctggcccttc 



acaactcccg 60 
accgcctaag 120 
tgaatcgcgg 180 
cctgcctggc 240 
ttggagggct 300 
319 



<210> 316 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DR-5-type element 

<221> misc_feature 

<222> (7) . . . (7) 

<223> N=A, T, G, or C 

<221> misc_feature 

<222> (8) . . . (8) 

<223> N=A, T, G, or C 

<221> misc_f eature 

<222> (9) ... (9) 

<223> N=A, T, G, or C 

<221> misc_f eature 
<222> (10) . . . (10) 
<223> N=A, T, G, or C 

<400> 316 

gggcaannnn ngggcac 

<210> 317 
<211> 600 
<212> DNA 
<213> Homo sapiens 



17 



<400> 317 

tggtgccgtg 

gcctgggcga 

tattctgaca 

cgacaggaca 

gtgttccaga 

cacttcagga 

accaaacccc 

tcagagccga 

atgccgcgtt 

tcaattcgtc 



cgcctgtagt 
cagagggata 
tggtgcagga 
gaattacgga 
aaggtaacca 
gcatttatat 
taaacccgaa 
tggtcccggg 
ccggaagtta 
accaggagga 



cccagctact 
ttccgtctca 
aggtaaattc 
aacaaattta 
tctaggaaac 
aacttcgtgg 
catccggcgc 
aggtgggggt 
ttgctttcca 
agacggagct 



cgggaggctg 
aacaaacaaa 
aagacaactt 
agcgttcccc 
tccaaggctc 
ttatgtcaga 
aagccgcacg 
ggggtggtgg 
ggggtcactc 
ggctgcccag 



aggcagaaga 
aaatcactcg 
aggtactcag 
cttttagctc 
agaccaccac 
gacgagaaaa 
caggcgcaga 
tggcctagcc 
tggcttcgac 
cccaaaggcc 



atgcactcca 
ctgcgttttt 
ttttagaagt 
caaatataat 
cggatgccca 
cccattgaca 
tttactagcg 
acttcccata 
tccgtcgctc 
catgagggga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 



<210> 318 
<211> 11 
<212> DNA 

<213> Artifical Sequence 
<220> 

<223> THRE-type element 



<400> 318 
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agtgtgggca t 

<210> 319 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sequence containing THRE mutant 
<400> 319 

agcaagtaag gtcaaactac ttcat 

<210> 320 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sequence containing THRE mutant 
<400> 320 

agcaagtaag tgcaaactac ttcat 

<210> 321 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sequence containing THRE mutant 
<400> 321 

agcaagtaag ggccaactac ttcat 

<210> 322 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sequence containing THRE mutant 
<400> 322 

agcaagtaag ggaaaactac ttcat 

<210> 323 

<211> 94 

<212> PRT 

<213> Homo sapiens 

<400> 323 

Met Ser Val Lys Gly Met Ala He Ala Leu 

15 10 
Thr Val Val Gin Gly Phe Pro Met Phe Lys 

20 25 
He Gly Pro Gly Val Lys Ala Val Lys Val 

35 40 
Ser He Met Tyr Pro Ser Asn Asn Cys Asp 

50 55 
Thr Leu Lys Glu Asn Lys Gly Gin Arg Cys 



PCT/IB2003/006434 

11 



25 



25 



25 



25 



Ala Val He Leu Cys Ala 
15 

Arg Gly Arg Cys Leu Cys 
30 

Ala Asp He Glu Lys Ala 
45 

Lys He Glu Val He He 
60 

Leu Asn Pro Lys Ser Lys 
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65 70 75 

Gin Ala Arg Leu lie lie Lys Lys Val Glu Arg Lys Asn Phe 
85 90 



80 



<210> 324 
<211> 1493 
<212> DNA 

<213> Homo sapiens 
<400> 324 

ttcctttcat gttcagcatt tctactcctt ccaagaagag cagcaaagct gaagtagcag 60 
caacagcacc agcagcaaca gcaaaaaaca aacatgagtg tgaagggcat ggctatagcc 120 
ttggctgtga tattgtgtgc tacagttgtt caaggcttcc ccatgttcaa aagaggacgc 180 
tgtctttgca taggccctgg ggtaaaagca gtgaaagtgg cagatattga gaaagcctcc 240 
ataatgtacc caagtaacaa ctgtgacaaa atagaagtga ttattaccct gaaagaaaat 300 
aaaggacaac gatgcctaaa tcccaaatcg aagcaagcaa ggcttataat caaaaaagtt 360 
gaaagaaaga atttttaaaa atatcaaaac atatgaagtc ctggaaaagg gcatctgaaa 420 
aacctagaac aagtttaact gtgactactg aaatgacaag aattctacag taggaaactg 480 
agacttttct atggttttgt gactttcaac ttttgtacag ttatgtgaag gatgaaaggt 540 
gggtgaaagg accaaaaaca gaaatacagt cttcctgaat gaatgacaat cagaattcca 600 
ctgcccaaag gagtccagca attaaatgga tttctaggaa aagctacctt aagaaaggct 660 
ggttaccatc ggagtttaca aagtgctttc acgttcttac ttgttgtatt atacattcat 720 
gcatttctag gctagagaac cttctagatt tgatgcttac aactattctg ttgtgactat 780 
gagaacattt ctgtctctag aagttatctg tctgtattga tctttatgct atattactat 840 
ctgtggttac agtggagaca ttgacattat tactggagtc aagcccttat aagtcaaaag 900 
catctatgtg tcgtaaagca ttcctcaaac attttttcat gcaaatacac acttctttcc 960 
ccaaatatca tgtagcacat caatatgtag ggaaacattc ttatgcatca tttggtttgt 1020 
tttataacca attcattaaa tgtaattcat aaaatgtact atgaaaaaaa ttatacgcta 1080 
tgggatactg gcaacagtgc acatatttca taaccaaatt agcagcaccg gtcttaattt 1140 
gatgtttttc aacttttatt cattgagatg ttttgaagca attaggatat gtgtgtttac 1200 
tgtacttttt gttttgatcc gtttgtataa atgatagcaa tatcttggac acatttgaaa 1260 
tacaaaatgt ttttgtctac caaagaaaaa tgttgaaaaa taagcaaatg tatacctagc 1320 
aatcactttt actttttgta attctgtctc ttagaaaaat acataatcta atcaatttct 1380 
ttgttcatgc ctatatactg taaaatttag gtatactcaa gactagttta aagaatcaaa 1440 
gtcatttttt tctctaataa actaccacaa cctttctttt ttaaaaaaaa aaa 1493 

<210> 325 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 325 

ggggaattct tccccatgtt caaaagagga c 31 

<210> 326 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 326 

ggggatcctt aaaaattctt tctttcaac 29 

<210> 327 
<211> 35 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 327 

ccgaattccc accatgaaga aaagtggtgt tcttt 35 

<210> 328 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 328 

ccggatcctg tagtcttctt ttgacgagaa cgttg 35 

<210> 329 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 329 

cctctagacc accatggtcc ttgaggtgag tgac 34 



<210> 330 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 330 

cccgcggccg ctcacaagcc cgagtaggag gc 

<210> 331 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



32 



<220> 

<223> Primer 
<400> 331 

ggcattcaat tgctcgagtt taaacgcggc cgcg 34 

<210> 332 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



-114- 



BNSDOCID: <WO 2004055050A2_I_> 



WO 2004/055050 



PCT/IB2003/006434 



<400> 332 

aatccgcggc cgcgtttaaa ctcgagcaat tgaatgcc 

<210> 333 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 333 

atgaagtgcc ttttgtactt agcctt 



<210> 334 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 334 

tcataaaaat taaaaactca aatataattg agg 

<210> 335 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



33 



<220> 

<223> Primer 
<400> 335 

atgggccaga ctgttaccac tc 

<210> 336 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 336 
ttagggggcc tcgcgg 

<210> 337 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 337 

atggtgcagt cctgctccgc 

<210> 338 
<211> 31 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 338 

gccaattgtt atgctggtac ttcaactatt t 31 

<210> 339 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 339 

atgagtgatg gaggggctca gg 22 

<210> 340 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 340 

ggaattccta tggcccttta ggg 23 

<210> 341 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 341 

atgaccccag tagtgagaaa gggtc 25 

<210> 342 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 342 

ggaattctta tgtagtcttc ttttgacgag a 31 

<210> 343 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<400> 343 

Met Gly Ala Pro Thr Leu Pro Pro Ala Trp Gin Pro Phe Leu Lys Asp 

15 10 15 

His Arg lie Ser Thr Phe Lys Asn Trp Pro Phe Leu Glu Gly Cys Ala 
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20 25 30 

Cys Thr Pro Glu Arg Met Ala Glu Ala Gly Phe lie His Cys Pro Thr 

Glu Asn Glu Pro Asp Leu Ala Gin Cys Phe Phe Cys Phe Lys Glu Leu 

55 60 

Glu Gly Trp Glu Pro Asp Asp Asp Pro He Glu Glu His Lys Lys His 
t 70 75 Y go 

Ser Ser Gly Cys Ala Phe Leu Ser Val Lys Lys Gin Phe Glu Glu Leu 

85 90 95 

Thr Leu Gly Glu Phe Leu Lys Leu Asp Arg Glu Arg Ala Lys Asn Lys 

100 105 110 

He Ala Lys Glu Thr Asn Asn Lys Lys Lys Glu Phe Glu Glu Thr Ala 

120 12 5 
Lys Lys Val Arg Arg Ala lie Glu Gin Leu Ala Ala Met Asp 
130 135 i4 0 



<210> 344 

<211> 1619 

<212> DNA 

<213> Homo sapiens 



<400> 344 

^^ att t9 ? atc 9 c 99 gacccgttgg cagaggtggc ggcggcggca tgggtgcccc 60 
ctggcccttc SI 99 ? agCCC " tCt -aggaccac cgcatctcta cScaa aa 120 
ccactqcccc ' ^cgcctgcac cccggagcgg atggccgagg ctggcttcat 180 

actgagaacg agccagactt ggcccagtgt ttcttctgct tcaaggagct 24 0 

q 9 9 t tr a t agatg aCgaCCCCat a^ggaacat aaaaagcatt cgtccggrtg 300 

cgctttcctt tctgtcaaga agcagtttga agaattaacc cttggtgaat ttttqaaact 360 
lalTa 9 , aga t agagCCaaga -caaaattgc aaaggaaacc aacaataaga agaalgaa^t 4 2 S 
tgaggaaact gcgaagaaag tgcgccgtgc catcgagcag ctggctgcca tggat?gagg 480 
gtgccacc^ ITtlltT gtCCCagagt ^tgcacca cttccagggt tL'ttccct 540 
caaattao^ Stt 9 ! 9 ggCCCCtta 9 caatgtctta ggaaaggaga tcaacatttt 600 
f aaa " 3gat 9 tttcaa =tg tgctcctgtt ttgtcttgaa agtggcacca gaggtgcttc 660 
tgcctgtgca gcgggtgctg ctggtaacag tggctgcttc tctctctctc ?c??t?t"? 720 
llllll 71 ""Sctgtt ttgattcccg ggcttaccag gtgagaagtg agggaggaag 780 
^ cc Ctttt g Ct agagctgaca gctttgttcg cgtgggcaga gcStccaca 840 
gtgaatgtgt ctggacctca tgttgttgag gctgtcacag tcctgagtgt ggacttgqca 900 
ggtgcctgtt gaatctgagc tgcaggttcc ttatctgtca cacccgtgcc rcctcaqaqq 960 
acagtttttt tgttgttgtg tttttttgtt tttttttttt ggtagatgca Jgactrgtgt 1020 
tTttlTtl TalTJr* g ! gtCCCtgg ctcctctact gtttaacaac arggctt^ct 080 
aoccattcta " ttcaca ^at agcacaaact acaattaaaa ctaagcacaa 1140 

999 gaaacggggt gaacttcagg tggatgagga gacagaatag 1200 
llllll a 9 c 9 tct 93 c agatactcct tttgccactg ctgtgtgatt agacaggccc 1260 

lll q t aCa l g ct gg c ^tc ctccctcaga aaaaggcagt ggcctaaatc 1320 
^!"" aaat ^ ctt ^cto gatgctgtgg gggactggct gggctgctgc aggccgtgtg 1380 
S ScScttto 0 3tCtgtCaCg "ctccacac gggggagaga c?Iag?ccgc 1440 
altaalttt^ Ifl "ctttg gaggcagcag ctcccgcagg gctgaagtct ggcgtaagat 1500 
qctqqaai^ a " c 9 cc <=tc ctccctgtca tagagctgca gggtggattg tlacagcttc 1560 
gctggaaacc tctggaggtc atctcggctg ttcctgagaa ataaaaagcc tgtcatttc 1619 

<210> 345 
<211> 202 
<212> PRT 
<213> Homo sapiens 

<400> 345 

Met Ala Thr Leu He Tyr Val Asp Lys Glu Asn Gly Glu Pro Gly Thr 

a 5 1° 15 

Arg Val Val Ala Lys Asp Gly Leu Lys Leu Gly Ser Gly Pro Ser He 
20 25 30 
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Lys 


Ala 


Leu 


Asp 


Gly 


Arg 


Ser 


Gin 


Val 


Ser 


Thr 


Pro 


Arg 


Phe 


Gly 


Lys 






35 










40 










45 






Thr 


Phe 


Asp 


Ala 


Pro 


Pro 


Ala 


Leu 


Pro 


Lys 


Ala 


Thr 


Arg 


Lys 


Ala 


Leu 




50 










55 










60 










Gly 


Thr 


Val 


Asn 


Arg 


Ala 


Thr 


Glu 


Lys 


Ser 


Val 


Lys 


Thr 


Lys 


Gly 


Pro 


65 










70 










75 










80 


Leu 


Lys 


Gin 


Lys 


Gin 


Pro 


Ser 


Phe 


Ser 


Ala 


Lys 


Lys 


Met 


Thr 


Glu 


Lys 










85 










90 










95 




Thr 


Val 


Lys 


Ala 


Lys 


Ser 


Ser 


Val 


Pro 


Ala 


Ser 


Asp 


Asp 


Ala 


Tyr 


Pro 








100 










105 










110 






Glu 


lie 


Glu 


Lys 


Phe 


Phe 


Pro 


Phe 


Asn 


Pro 


Leu 


Asp 


Phe 


Glu 


Ser 


Phe 






115 










120 










125 








Asp 


Leu 


Pro 


Glu 


Glu 


His 


Gin 


He 


Ala 


His 


Leu 


Pro 


Leu 


Ser 


Gly 


Val 




130 










135 










140 








Pro 


Leu 


Met 


He 


Leu 


Asp 


Glu 


Glu 


Arg 


Glu 


Leu 


Glu 


Lys 


Leu 


Phe 


Gin 


145 










150 










155 










160 


Leu 


Gly 


Pro 


Pro 


Ser 


Pro 


Val 


Lys 


Met 


Pro 


Ser 


Pro 


Pro 


Trp 


Glu 


Ser 










165 










170 










175 




Asn 


Leu 


Leu 


Gin 


Ser 


Pro 


Ser 


Ser 


He 


Leu 


Ser 


Thr 


Leu 


Asp 


Val 


Glu 








180 










185 










190 






Leu 


Pro 


Pro 


Val 


Cys 


Cys 


Asp 


He 


Asp 


He 















195 200 



<210> 346 
<211> 727 
<212> DNA 

<213> Homo sapiens 
<400> 346 

cggcctcaga tgaatgcggc tgttaagacc tgcaataatc cagaatggct actctgatct 60 

atgttgataa ggaaaatgga gaaccaggca cccgtgtggt tgctaaggat gggctgaagc 120 

tggggtctgg accttcaatc aaagccttag atgggagatc tcaagtttca acaccacgtt 180 

ttggcaaaac gttcgatgcc ccaccagcct tacctaaagc tactagaaag gctttgggaa 240 

ctgtcaacag agctacagaa aagtctgtaa agaccaaggg acccctcaaa caaaaacagc 300 

caagcttttc tgccaaaaag atgactgaga agactgttaa agcaaaaagc tctgttcctg 360 

cctcagatga tgcctatcca gaaatagaaa aattctttcc cttcaatcct ctagactttg 420 

agagttttga cctgcctgaa gagcaccaga ttgcgcacct ccccttgagt ggagtgcctc 480 

tcatgatcct tgacgaggag agagagcttg aaaagctgtt tcagctgggc cccccttcac 540 

ctgtgaagat gccctctcca ccatgggaat ccaatctgtt gcagtctcct tcaagcattc 600 

tgtcgaccct ggatgttgaa ttgccacctg tt tgctgtga.catagatatt taaatttctt 660 

agtgcttcag agtttgtgtg tatttgtatt aataaagcat tcttcaacag aaaaaaaaaa 720 
aaaaaaa 727 

<210> 347 
<211> 79 
<212> PRT 

<213> Homo sapiens 
<400> 347 

Met Ser His Lys Gin He Tyr Tyr Ser Asp Lys Tyr Asp Asp Glu Glu 

15 10 15 

Phe Glu Tyr Arg His Val Met Leu Pro Lys Asp lie Ala Lys Leu Val 

20 25 30 

Pro Lys Thr His Leu Met Ser Glu Ser Glu Trp Arg Asn Leu Gly Val 

35 40 45 

Gin Gin Ser Gin Gly Trp Val His Tyr Met He His Glu Pro Glu Pro 

50 55 60 

His He Leu Leu Phe Arg Arg Pro Leu Pro Lys Lys Pro Lys Lys 
65 70 75 
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<210> 348 
<211> 717 
<212> DNA 
<213> Homo sapiens 



<400> 348 

agagcgatca 

gagtatcgac 

atgtctgaat 

atgatccatg 

aagaaatgaa 

ctaacatctt 

agatgttcaa 

caccaccata 

cccagaagcc 

agtttgagac 

gggtgttgta 

ttattcaaga 



tgtcgcacaa 
atgtcatgct 
ctgaatggag 
aaccagaacc 
gctggcaagc 
tctgataaca 
tacactgttt 
gcccagccag 
acgatgtgct 
catggctgtt 
tgtgtggtga 
aatctgt tea 



acaaatttac 
gcccaaggac 
gaatcttggc 
tcacatcttg 
tacttttcag 
ttattatgtt 
gaatgtgctg 
atgagtgctc 
ctgtatccag 
acagggatca 
ettgeggatt 
tgttaaaagc 



tatteggaca 
atagecaage 
gttcagcaga 
ctgttccggc 
cctcaagctt 
gecttcttgt 
gtaactgett 
tgtggaccca 
aacacacttg 
tgtaaacttg 
tatgtttcag 
cttgattaaa 



aatacgacga 
tggtccctaa 
gtcagggatg 
gcccactacc 
tacacagctg 
ttctcacttt 
tgcttcttga 
cagcctaagc 
gcagatggag 
ctgtttttgt 
tgtactggaa 
gaggaagttt 



cgaggagttt 
aacccatctg 
ggtccattat 
caagaaacca 
tccttacttc 
gatatttaaa 
gtagagecac 
tgagtgtgac 
gaagcatctg 
tttttctgee 
actttccatt 
ttataat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

717 



<210> 349 
<211> 205 
<212> PRT 
<213> Homo sapiens 

<400> 349 

Met Ala Leu Gin Leu Ser Arg Glu Gin Gly 

1 5 10 

Ala Glu lie Val Ala Glu Phe Phe Ser Phe 

20 25 
Tyr Gin Arg Gly He Tyr Pro Ser Glu Thr 

35 40 
Tyr Gly Leu Thr Leu Leu Val Thr Thr Asp 

50 55 
Leu Asn Asn Val Val Glu Gin Leu Lys Asp 
65 70 

Val Gin Lys Leu Val Val Val He Ser Asn 

85 90 
Leu Glu Arg Trp Gin Phe Asp He Glu Cys 

100 1 105 

Asp Ser Ala Pro Arg Glu Lys Ser Gin Lys 

115 120 
Arg Ser Val He Arg Gin He Thr Ala Thr 

130 135 
Leu Glu Val Ser Cys Ser Phe Asp Leu Leu 
1^5 150 

Leu Val Val Pro Glu Lys Trp Glu Glu Ser 
165 170 
Asn Ser Glu Glu Val Arg Leu Arg Ser Phe 

180 185 
Val Asn Ser Met Val Ala Tyr Lys He Pro 
195 200 



He Thr Leu 

Gly He Asn 

Phe Thr Arg 
45 

Leu Glu Leu 
60 

Trp Leu Tyr 
7 5 

He Glu Ser 

Asp Lys Thr 

Ala He Gin 
125 

Val Thr Phe 

140 
He Tyr Thr 
155 

Gly Pro Gin 

Thr Thr Thr 

Val Asn Asp 
205 



Arg 

Ser 

30 

Val 

He 

Lys 

Gly 

Ala 
110 
Asp 

Leu 

Asp 

Phe 

He 
190 



Gly Ser 
15 

He Leu 

Gin Lys 

Lys Tyr 

Cys Ser 

80 
Glu Val 
95 

Lys Asp 

Glu lie 

Pro Leu 

Lys Asp 
160 
He Thr 
175 

His Lys 



<210> 350 

<211> 1390 

<212> DNA 

<213> Homo sapiens 

<400> 350 

gggaagtgct gttggagccg ctgtggttgc tgtccgcgga gtggaagcgc gtgcttttgt 60 
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ttgtgtccct ggccatggcg ctgcagctct 
gcgccgaaat cgtggccgag ttcttctcat 
gcatatatcc atctgaaacc tttactcgag 
ctactgatct tgagctcata aaatacctaa 
tatacaagtg ttcagttcag aaactggttg 
tcctggaaag atggcagttt gatattgagt 
ccagagaaaa gtctcagaaa gctatccagg 
cagctacggt gacatttctg ccactgttgg 
atacagacaa agatttggtt gtacctgaaa 
ccaattctga ggaagtccgc cttcgttcat 
tggtggccta caaaattcct gtcaatgact 
taattttgaa atgtggtttt cctgaaatca 
ttggttaatt tttacatgga gaaaaccaaa 
cctttatttt tttggtacct atttgactta 
cattgttcaa aaggaaccag gaggtttttt 
gatagtaact gtagatggaa aaacttgtgc 
gttttggtca agtagtttga ctcagtatag 
aaaggaagtc taaatattca gaatctttgt 
aacaatgaaa tattgctgta tagctccttt 
ttgaatcagt ttccaattat ttgactttaa 
atatttgtac tgtttaatgt tctgtgatac 
ttataaaatc aagttttaag tgaaagtgag 
aaaaaaaaaa 



PCT/IB2003/006434 

cccgggagca gggaatcacc ctgcgcggga 120 
tcggcatcaa cagcatttta tatcagcgtg 180 
tgcagaaata cggactcacc ttgcttgtaa 240 
ataatgtggt ggaacaactg aaagattggt 300 
tagttatctc aaatattgaa agtggtgagg 360 
gtgacaagac tgcaaaagat gacagtgcac 420 
atgaaatccg ttcagtgatc agacagatca 480 
aagtttcttg ttcatttgat ctgctgattt 540 
aatgggaaga gtcgggacca cagtttatta 600 
ttactactac aatccacaaa gtaaatagca 660 
gaggatgaca tgaggaaaat aatgtaattg 720 
ggtcatctat agttgatatg ttttatttca 780 
atgatactta ctgaactgtg tgtaattgtt 840 
ccatggagtt aacatcatga atttattgca 900 
tgtcaacatt gtgatgtata ttcctttgaa 960 
tataaagcta gatgctttcc taaatcagat 1020 
gtagggagat atttaagtat aaaatacaac 1080 
taaggtcctg aaagtaactc ataatctata 1140 
tgaccttcat ttcatgtata gttttcccta 1200 
tttatgtaac ttgaacctat gaagcaatgg 1260 
agaactctta aaaatgtttt ttcatgtgtt 1320 
gaaataaagt taagtttgtt ttaaaaaaaa 1380 

1390 



<210> 351 
<211> 823 
<212> PRT 

<213> Homo sapiens 



<400> 351 

Met Gly Lys Lys Arg Thr Lys Gly Lys Thr Val Pro lie Asp Asp Ser 

15 10 ' 15 

Ser Glu Thr Leu Glu Pro Val Cys Arg His lie Arg Lys Gly Leu Glu 

20 25 30 

Gin Gly Asn Leu Lys Lys Ala Leu Val Asn Val Glu Trp Asn lie Cys 

35 40 4 5 

Gin Asp Cys Lys Thr Asp Asn Lys Val Lys Asp Lys Ala Glu Glu Glu 

50 55 1 60 

Thr Glu Glu Lys Pro Ser Val Trp Leu Cys Leu Lys Cys Gly His Gin 
65 70 75 80 

Gly Cys Gly Arg Asn Ser Gin Glu Gin His Ala Leu Lys His Tyr Leu 

85 90 95 

Thr Pro Arg Ser Glu Pro His Cys Leu Val Leu Ser Leu Asp Asn Trp 

100 105 110 

Ser Val Trp Cys Tyr Val Cys Asp Asn Glu Val Gin Tyr Cys Ser Ser 

115 120 125 

Asn Gin Leu Gly Gin Val Val Asp Tyr Val Arg Lys Gin Ala Ser He 

130 135 140 

Thr Thr Pro Lys Pro Ala Glu Lys Asp Asn Gly Asn He Glu Leu Glu 
145 150 155 160 

Asn Lys Lys Leu Glu Lys Glu Ser Lys Asn Glu Gin Glu Arg Glu Lys 

165 170 175 

Lys Glu Asn Met Ala Lys Glu Asn Pro Pro Met Asn Ser Pro Cys Gin 

180 185 190 

He Thr Val Lys Gly Leu Ser Asn Leu Gly Asn Thr Cys Phe Phe Asn 

195 200 205 

Ala Val Met Gin Asn Leu Ser Gin Thr Pro Val Leu Arg Glu Leu Leu 

210 215 220 

Lys Glu Val Lys Met Ser Gly Thr He Val Lys He Glu Pro Pro Asp 
225 230 235 240 

Leu Ala Leu Thr Glu Pro Leu Glu lie Asn Leu Glu Pro Pro Gly Pro 
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245 250 255 



Leu 


Thr 


Leu 


Ala 


Met 


Ser 


Gin 


Phe 


Leu 


Asn 


Glu 


Met 


Gin 


Glu 


Thr 


Lys 








260 










265 










270 




Lys 


Gly 


Val 


Val 


Thr 


Pro 


Lys 


Glu 


Leu 


Phe 


Ser 


Gin 


Val 


Cys 


Lys 


Lys 






275 










280 










285 








Ala 


Val 


Arg 


Phe 


Lys 


Gly 


Tyr 


Gin 


Gin 


Gin 


Asp 


Ser 


Gin 


Glu 


Leu 


Leu 




290 










295 










300 










Arg 


Tyr 


Leu 


Leu 


Asp 


Gly 


Met 


Arg 


Ala 


Glu 


Glu 


His 


Gin 


Arg 


Val 


Ser 


305 










310 










315 










320 


Lys 


Gly 


He 


Leu 


Lys 


Ala 


Phe 


Gly 


Asn 


Ser 


Thr 


Glu 


Lys 


Leu 


Asp 


Glu 










325 










330 










335 




Glu 


Leu 


Lys 


Asn 


Lys 


Val 


Lys 


Asp 


Tyr 


Glu 


Lys 


Lys 


Lys 


Ser 


Met 


Pro 








340 










345 










350 






Ser 


Phe 


Val 


Asp 


Arg 


He 


Phe 


Gly 


Gly 


Glu 


Leu 


Thr 


Ser 


Met 


He 


Met 






355 










360 










365 








Cys 


Asp 


Gin 


Cys 


Arg 


Thr 


Val 


Ser 


Leu 


Val 


His 


Glu 


Ser 


Phe 


Leu 


Asp 




370 










375 










380 










Leu 


Ser 


Leu 


Pro 


Val 


Leu 


Asp 


Asp 


Gin 


Ser 


Gly 


Lys 


Lys 


Ser 


Val 


Asn 


385 










390 










395 










400 


Asp 


Lys 


Asn 


Leu 


Lys 


Lys 


Thr 


Val 


Glu 


Asp 


Glu 


Asp 


Gin 


Asp 


Ser 


Glu 










405 










410 










415 




Glu 


Glu 


Lys 


Asp 


Asn 


Asp 


Ser 


Tyr 


He 


Lys 


Glu 


Arg 


Ser 


Asp 


He 


Pro 








420 










425 










430 






Ser 


Gly 


Thr 


Ser 


Lys 


His 


Leu 


Gin 


Lys 


Lys 


Ala 


Lys 


Lys 


Gin 


Ala 


Lys 






435 










440 










445 








Lys 


Gin 


Ala 


Lys 


Asn 


Gin 


Arg 


Arg 


Gin 


Gin 


Lys 


He 


Gin 


Gly 


Lys 


Val 




450 










455 










460 










Leu 


His 


Leu 


Asn 


Asp 


He 


Cys 


Thr 


He 


Asp 


His 


Pro 


Glu 


Asp 


Ser 


Glu 


465 










470 










475 










480 


Tyr 


Glu 


Ala 


Glu 


Met 


Ser 


Leu 


Gin 


Gly 


Glu 


Val 


Asn 


He 


Lys 


Ser 


Asn 










485 










490 










4 95 




His 


lie 


Ser 


Gin 


Glu 


Gly 


Val 


Met 


His 


Lys 


Glu 


Tyr 


Cys 


Val 


Asn 


Gin 








500 










505 










510 






Lys 


Asp 


Leu 


Asn 


Gly 


Gin 


Ala 


Lys 


Met 


He 


Glu 


Ser 


Val 


Thr 


Asp 


Asn 






515 










520 










525 








Gin 


Lys 


Ser 


Thr 


Glu 


Glu 


Val 


Asp 


Met 


Lys 


Asn 


lie 


Asn 


Met 


Asp 


Asn 




530 










535 










540 










Asp 


Leu 


Glu 


Val 


Leu 


Thr 


Ser 


Ser 


Pro 


Thr 


Arg 


Asn 


Leu 


Asn 


Gly 


Ala 


545 










550 










555 










560 


Tyr 


Leu 


Thr 


Glu 


Gly 


Ser 


Asn 


Gly 


Glu 


Val 


Asp 


He 


Ser 


Asn 


Gly 


Phe 










565 










570 










575 




Lys 


Asn 


Leu 


Asn 


Leu 


Asn 


Ala 


Ala 


Leu 


His 


Pro Asp 


Glu 


He 


Asn 


lie 








580 










585 










590 






Glu 


He 


Leu 


Asn 


Asp 


Ser 


His 


Thr 


Pro 


Gly 


Thr 


Lys 


Val 


Tyr 


Glu 


Val 






595 










600 










605 








Val 


Asn 


Glu 


Asp 


Pro 


Glu 


Thr 


Ala 


Phe 


Cys 


Thr 


Leu 


Ala 


Asn Arg 


Glu 




610 










615 










620 










Val 


Phe 


Asn 


Thr 


Asp 


Glu 


Cys 


Ser 


He 


Gin 


His 


Cys 


Leu 


Tyr 


Gin 


Phe 


625 










630 










635 










640 


Thr 


Arg 


Asn 


Glu 


Lys 


Leu 


Arg 


Asp 


Ala 


Asn 


Lys 


Leu 


Leu 


Cys 


Glu 


Val 










64 5 










650 










655 




Cys 


Thr 


Arg 


Arg 


Gin 


Cys 


Asn 


Gly 


Pro 


Lys 


Ala 


Asn 


lie 


Lys 


Gly 


Glu 








660 










665 










670 






Arg 


Lys 


His 


Val 


Tyr 


Thr 


Asn 


Ala 


Lys 


Lys 


Gin 


Met 


Leu 


He 


Ser 


Leu 






675 










680 










685 








Ala 


Pro 


Pro 


Val 


Leu 


Thr 


Leu 


His 


Leu 


Lys 


Arg 


Phe 


Gin 


Gin 


Ala 


Gly 




690 










695 










700 










Phe 


Asn 


Leu 


Arg 


Lys 


Val 


Asn 


Lys 


His 


He 


Lys 


Phe 


Pro 


Glu 


He 


Leu 


705 










710 










715 










720 


Asp 


Leu 


Ala 


Pro 


Phe 


Cys 


Thr 


Leu 


Lys 


Cys 


Lys 


Asn 


Val 


Ala 


Glu 


Glu 










725 










730 










735 
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7\ r~» 


Thv 
X HX 


mx y 


Val 


Leu 


Tvr 


Ser 


Leu 


Tvr 


Glv 


Val 


Val 


Glu 


His 


OCX 


uxy 








74 0 










74 5 










750 






Thr* 


Met 


ZX v n 
rtx y 


Ser 


Gly 


His 


i yx 


Thr 


Ala 


Tvr 


.Ala 


Lys 


Ala 


Arg 


Thr 

J. 1 IX 








755 










760 










765 








Asn 


Ser 


His 


Leu 


Ser 


Asn 


Leu 


Val 


Leu 


His 


Gly 


Asp 


He 


Pro 


Gin 


Asp 




770 










775 










780 








Phe 


Glu 


Met; 


Glu 


Ser 


Lys 


Gly 


Gin 


Trp 


Phe 


His 


He 


Ser 


Asp 


Thr 


His 


785 










7 90 










795 










800 


Val 


Gin - 


Ala 


Val 


Pro 


Thr 


Thr 


Lys 


Val 


Leu 


Asn 


Ser 


Gin 


Ala 


Tyr 


Leu 










805 










810 










815 




Leu 


Phe 


Tyr 


Glu 


Arg 


He 


Leu 





















820 



<210> 352 

<211> 2903 

<212> DNA 

<213> Homo sapiens 



<400> 352 

ctcaattcgt caccaggagg aagacggagc 
atgcagttat gggctctgtc gccgtggatt 
gtgccaacat gggaaagaaa cggacaaagg 
aaactttaga acctgtgtgc agacacatta 
aggctttagt gaatgtggaa tggaatatct 
aagataaagc tgaagaagaa acagaagaaa 
gccatcaggg ctgtggcaga aattctcagg 
caagatctga acctcactgt ctggttctta 
tatgtgataa tgaggtccag tattgtagtt 
tcagaaaaca agccagcatt acaactccaa 
aacttgaaaa ta'aaaaatta gaaaaagaga 
aaaacatggc taaagagaat cctcccatga 
tcagtaattt gggaaacaca tgtttcttca 
cagtgcttag agaactacta aaagaagtga 
cacctgattt ggcattaaca gaaccattag 
ctttagccat gagccagttt cttaatgaga 
cgaaagaact cttttctcag gtctgtaaaa 
aagacagcca ggagctgctt cgctacttat 
gagtgagtaa aggaatactt aaagcatttg 
taaaaaataa agttaaagat tatgagaaga 
tctttggtgg tgaactaact agtatgatca 
ttcatgaatc tttccttgat ttgtccctcc 
gtgtaaatga taaaaatctg aaaaagacag 
aaaaagataa cgacagttac ataaaagaga 
acttacagaa aaaagcaaag aaacaagcca 
aaaaaattca aggaaaagtt cttcatttaa 
acagtgaata tgaagctgaa atgtcacttc 
tttcacaaga gggtgttatg cataaagaat 
aagcaaaaat gatcgaaagt gtaactgaca 
aaaatatcaa catggataat gatctggagg 
atggtgccta cctaacggaa gggagcaatg 
acctaaattt gaatgctgct cttcatcctg 
gtcatactcc tggaacaaag gtgtatgagg 
gtactcttgc aaacagggaa gttttcaata 
atcagttcac ccgtaatgag aaacttcgag 
cacggagaca gtgtaatgga ccaaaggcaa 
ccaatgccaa aaagcagatg ctaatttctc 
agagatttca gcaggctggt tttaacctac 
aaatcttaga tttggctcct ttttgcaccc 
caagggtact ctattcctta tatggagttg 
attacactgc ctatgccaag gcaagaaccg 



tggctgccca gcccaaaggc ccatgagggg 60 
gttattttgt gtcagtaagt aatccataaa 120 
gaaaaactgt tccaatcgat gattcctctg 180 
gaaaaggatt ggaacaaggt aatttgaaaa 240 
gccaagactg taagactgac aataaagtga 300 
agccttcagt ttggctgtgt cttaaatgtg 360 
agcagcatgc cttgaagcac tatctgacgc 420 
gtttggacaa ctggagtgta tggtgttacg 480 
caaaccagtt gggtcaagtg gttgattatg 540 
agccagcaga gaaagataat ggaaatattg 600 
gtaagaatga acaagagaga gaaaagaagg 660 
attctccttg ccaaataacc gtgaaaggac 720 
atgcagttat gcagaacttg tcacaaacac 780 
aaatgtctgg aacaattgta aaaattgaac 840 
aaataaacct tgagcctcca ggccctctta 900 
tgcaagagac caaaaagggg gttgtgacac 960 
aagcagtgcg gtttaaaggc tatcagcagc 1020 
tggatgggat gagagcagaa gaacaccaaa 1080 
gtaattctac tgaaaagttg gatgaagaac 1140 
aaaaatcaat gccaagtttt gttgaccgca 1200 
tgtgtgatca atgcagaact gtctccttgg 1260 
cagttttaga tgatcagagt ggtaagaaaa 1320 
tggaggatga agatcaagat agtgaggaag 1380 
gaagtgatat tccttctgga acaagtaagc 1440 
aaaagcaagc caagaaccaa cgaagacaac 1500 
atgatatttg tactattgac catcctgaag 1560 
aaggagaagt aaatattaaa tccaaccata 1620 
attgtgtcaa ccagaaagat ttgaatggcc 1680 
atcaaaaatc cacagaggaa gtagatatga 1740 
ttttaacatc ttctcccact aggaatttaa 1800 
gagaagtgga catttccaat ggtttcaaaa 1860 
atgaaataaa tatagagatt ctgaatgata 1920 
ttgtaaatga agatccagaa actgctttct 1980 
ctgatgagtg ttcaatccaa cattgtttat 2040 
atgcgaataa actgctttgt gaagtatgca 2100 
atataaaagg tgaaaggaag catgtttaca 2160 
ttgctcctcc tgttcttact cttcatttaa 2220 
gcaaagttaa caaacacata aagtttccgg 2280 
ttaaatgtaa gaatgttgca gaagaaaata 2340 
ttgaacacag tggtactatg aggtcggggc 2400 
caaatagtca tctctctaat cttgttcttc 2460 
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acggtgatat 
acacacatgt 
tttatgagag 
tttataatgg 
aaatacatgc 
gtacaaaggt 
tcctgaatta 
ataaactgac 



tccacaagat 
gcaagctgtg 
aatactgtaa 
ctgaaataac 
cagaagaaat 
tttatattgt 
cttaagtact 
ttaccctaaa 



<210> 353 
<211> 724 
<212> PRT 

<213> Homo sapiens 



tttgaaatgg 
cctacaacta 
taatatcaaa 
gataaaaaaa 
catgtttatt 
catagtggtt 
ttgtgtttaa 
ate 



aatcaaaagg 
aagtactaaa 
agcact tttt 
gac-taattaa 
taaatattga 
tttattcctg 
tatatctggg 



gcagtggttt 
ctcacaagcg 
ctggaaacac 
aatcatgttc 
agggaaaaat 
ctttgtttct 
tgatggatca 



cacatcagcg 
tacctcctat 
atttatggct 
acttaacatt 
acctaaaaat 
ggaaaggaaa 
caacacatca 



2520 
2580 
2640 
2700 
2760 
2820 
2'380 
2903. 



<400> 353 

Met Ser Phe Pro Lys Ala Pro Leu Lys Arg Phe Asn Asp Pro Ser Gly 

1 5 10 15 

Cys Ala Pro Ser Pro Gly Ala Tyr Asp Val Lys Thr Leu Glu Val Leu 

20-25 30 
Lys Gly Pro Val Ser Phe Gin Lys Ser Gin Arg Phe Lys Gin Gin Lys 

35 40 45 

Glu Ser Lys Gin Asn Leu Asn Val Asp Lys Asp Thr Thr Leu Pro Ala 

50 55 60 

Ser Ala Arg Lys Val Lys Ser Ser Glu Ser Lys Glu Ser Gin Lys Asn 
65 70 75 , 80 

Asp Lys Asp Leu Lys He Leu Glu Lys Glu He Arg Val Leu Leu Gin 

85 90 95 

Glu Arg Gly Ala Gin Asp Ser Arg He Gin Asp Leu Glu Thr Glu Leu 

100 105 HO 

Glu Lys Met Glu Ala Arg Leu Asn Ala Ala Leu Arg Glu Lys Thr Ser 

115 120 125 

Leu Ser Ala Asn Asn Ala Thr Leu Glu Lys Gin Leu He Glu Leu Thr 

130 135 140 

Arg Thr Asn Glu Leu Leu Lys Ser Lys Phe Ser Glu Asn Gly Asn Gin 
145 150 155 160 

Lys Asn Leu Arg He Leu Ser Leu Glu Leu Met Lys Leu Arg Asn Lys 

165 170 175 

Arg Glu Thr Lys Met Arg Gly Met Met Ala Lys Gin Glu Gly Met Glu 

180 185 190 

Met Lys Leu Gin Val Thr Gin Arg Ser Leu Glu Glu Ser Gin Gly Lys 

195 200 205 

He Ala Gin Leu Glu Gly Lys Leu Val Ser He Glu Lys Glu Lys He 

210 215 220 

Asp Glu Lys Ser Glu Thr Glu Lys Leu Leu Glu Tyr He Glu Glu He 
225 230 235 240 

Ser Cys Ala Ser Asp Gin Val Glu Lys Tyr Lys Leu Asp He Ala Gin 

245 250 255 

Leu Glu Glu Asn Leu Lys Glu Lys Asn Asp Glu He Leu Ser Leu Lys 

260 265 270 

Gin Ser Leu Glu Glu Asn He Val He Leu Ser Lys Gin Val Glu Asp 

275 280 285 

Leu Asn Val Lys Cys Gin Leu Leu Glu Lys Glu Lys Glu Asp His Val 

290 295 300 

Asn Arg Asn Arg Glu His Asn Glu Asn Leu Asn Ala Glu Met Gin Asn 
305 310 315 320 

Leu Lys Gin Lys Phe He Leu Glu Gin Gin Glu Arg Glu Lys Leu Gin 

325 330 335 

Gin Lys Glu Leu Gin He Asp Ser Leu Leu Gin Gin Glu Lys Glu Leu 

340 345 350 

Ser Ser Ser Leu His Gin Lys Leu Cys Ser Phe Gin Glu Glu Met Val 
355 360 365 
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Lys 


Glu 


Lys 


Asn 


Leu 


Phe 


Glu 


Glu 


Glu 


Leu 


Lys 


Gin 


Thr 


Leu 


Asp 


Glu 




370 










375 










380 








Leu 


Asp 


Lys 


Leu 


Gin 


Gin 


Lys 


Glu 


Glu 


Gin 


Ala 


Glu 


Arg 


Leu 


Val 


Lys 


385 










390 










395 










400 


Gin 


Leu 


Glu 


Glu 


Glu 


Ala 


Lys 


Ser 


Arg 


Ala 


Glu 


Glu 


Leu 


Lys 


Leu 


Leu 










405 










410 










415 




Glu 


Glu 


Lys 


Leu 


Lys 


Gly 


Lys 


Glu 


Ala 


Glu 


Leu 


Glu 


Lys 


Ser 


Ser 


Ala 








420 










425 










430 






Ala 


His 


Thr 


Gin 


Ala 


Thr 


Leu 


Leu 


Leu 


Gin 


Glu 


Lys 


Tyr 


Asp 


Ser 


Met 






435 










440 










445 








Val 


Gin 


Ser 


Leu 


Glu 


Asp 


Val 


Thr 


Ala 


Gin 


Phe 


Glu 


Ser 


Tyr 


Lys 


Ala 




4 50 










455 










460 








Leu 


Thr 


Ala 


Ser 


Glu 


lie 


Glu 


Asp 


Leu 


Lys 


Leu 


Glu 


Asn 


Ser 


Ser 


Leu 


465 










470 










475 










480 


Gin 


Glu 


Lys 


Ala 


Ala 


Lys 


Ala 


Gly 


Lys 


Asn 


Ala 


Glu 


Asp 


Val 


Gin 


His 










485 










490 










4 95 




Gin 


lie 


Leu 


Ala 


Thr 


Glu 


Ser 


Ser 


Asn 


Gin 


Glu 


Tyr 


Val 


Arg 


Met 


Leu 








500 










505 










510 






Leu 


Asp 


Leu 


Gin 


Thr 


Lys 


Ser 


Ala 


Leu 


Lys 


Glu 


Thr 


Glu 


lie 


Lys 


Glu 






515 










520 










525 








lie 


Thr 


Val 


Ser 


Phe 


Leu 


Gin 


Lys 


lie 


Thr Asp 


Leu 


Gin 


Asn 


Gin 


Leu 




530 










535 










540 










Lys 


Gin 


Gin 


Glu 


Glu 


Asp 


Phe 


Arg 


Lys 


Gin 


Leu 


Glu 


Asp 


Glu 


Glu 


Gly 


545 










550 










555 










560 


Arg 


Lys 


Ala 


Glu 


Lys 


Glu 


Asn 


Thr 


Thr 


Ala 


Glu 


Leu 


Thr 


Glu 


Glu 


lie 










565 










570 










575 




Asn 


Lys 


Trp 


Arg 


Leu 


Leu 


Tyr 


Glu 


Glu 


Leu 


Tyr 


Asn 


Lys 


Thr 


Lys 


Pro 








580 










585 










590 






Phe 


Gin 


Leu 


Gin 


Leu 


Asp 


Ala 


Phe 


Glu 


Val 


Glu 


Lys 


Gin 


Ala 


Leu 


Leu 






595 










600 










605 








Asn 


Glu 


His 


Gly 


Ala 


Ala 


Gin 


Glu 


Gin 


Leu 


Asn 


Lys 


lie 


Arg 


Asp 


Ser 




610 










615 










620 










Tyr 


Ala 


Lys 


Leu 


Leu 


Gly 


His 


Gin 


Asn 


Leu 


Lys 


Gin 


Lys 


lie 


Lys 


His 


625 










630 










635 










640 


Val 


Val 


Lys 


Leu 


Lys 


Asp 


Glu 


Asn 


Ser 


Gin 


Leu 


Lys 


Ser 


Glu 


Val 


Ser 










645 










650 










655 




Lys 


Leu 


Arg 


Cys 


Gin 


Leu 


Ala 


Lys 


Lys 


Lys 


Gin 


Ser 


Glu 


Thr 


Lys 


Leu 








660 










665 










670 






Gin 


Glu 


Glu 


Leu 


Asn 


Lys 


Val 


Leu 


Gly 


lie 


Lys 


His 


Phe 


Asp 


Pro 


Ser 






675 










680 










685 








Lys 


Ala 


Phe 


His 


His 


Glu 


Ser 


Lys 


Glu 


Asn 


Phe 


Ala 


Leu 


Lys 


Thr 


Pro 




690 










695 










700 










Leu 


Lys 


Glu 


Gly 


Asn 


Thr 


Asn 


Cys 


Tyr 


Arg Ala 


Pro 


Met 


Glu 


Cys 


Gin 


705 










710 










715 










720 


Glu 


Ser 


Trp 


Lys 



























<210> 354 

<211> 3002 

<212> DNA 

<213> Homo sapiens 

<400> 354 

gccagtcacc ttcagtttct 
aaacgattca atgacccttc 
ttagaagtat tgaaaggacc 
gaatctaaac aaaatcttaa 
gttaagtctt cggaatcaaa 
aaagagattc gtgttcttct 
gaaactgagt tggaaaagat 
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ggagctggcc 
tggttgtgca 
agtatccttt 
tgttgacaaa 
ggaatctcaa 
acaggaacgt 
ggaagcaagg 



gtcaacatgt 
ccatctccag 
cagaaatcac 
gatactacct 
aagaatgata 
ggtgcccagg 
ctaaatgctg 



cctttcctaa 
gtgcttatga 
aaagatttaa 
tgcctgcttc 
aagatttgaa 
acagccggat 
cactaaggga 



ggcgcccttg 
tgttaaaact 
acaacaaaaa 
agctagaaaa 
gatattagag 
ccaggatctg 
aaaaacatct 



60 

120 

180 

240 

300 

360 

420 



WO 2004/055050 



PCT/IB2003/006434 



ctctctgcaa 
ctactaaaat 
gagttgatga 
gaaggcatgg 
atagcccaac 
gaaacagaaa 
aaatacaagc 
ttaagcctta 
ctaaatgtga 
gaacacaacg 
caacaggaac 
gagaaagaat 
aaagagaaga 
cagcaaaagg 
agagctgaag 
aaaagtagtg 
gtgcaaagcc 
gagatagaag 
aaaaatgcag 
gtaaggatgc 
atcacagttt 
gaagacttta 
acagcagaat 
aaaacaaaac 
aatgaacatg 
ttgggtcatc 
agccaactca 
gagacaaaac 
aaggcttttc 
aatacaaact 
aaacctgttg 
gacatgggta 
ggaacatttt 
ctaatgctca 
catctcatca 
tcaagagtct 
tctttagcta 
tcacttggtc 
cctaacttca 
ggctccatcc 
aacaaactct 
atttcttcag 
acatttgtgt 
aa 



ataatgctac 
ctaagttttc 
aact tagaaa 
agatgaagct 
tggagggaaa 
aactcttgga 
tagatat tgc 
agcagtctct 
aatgtcagct 
aaaatctaaa 
gtgaaaagct 
tatcttcgag 
atctgtttga 
aggaacaagc 
aattaaaact 
ctgctcatac 
ttgaagatgt 
atcttaagct 
aggatgttca 
ttctagatct 
cttttcttca 
gaaaacagct 
taactgaaga 
cttttcagct 
gtgcagctca 
agaatttgaa 
aatcggaagt 
ttcaagagga 
atcatgaaag 
gttaccgagc 
aagattattt 
ttttataatg 
ttaccaaagt 
cctttatcac 
acttaaaagt 
aaaccccaga 
ggctaaaaca 
ctacctatta 
tagaaacctc 
tgtttgtatg 
tcagttaatt 
agtttgtcat 
aatatgaaat 



actggaaaaa 
tgaaaatggt 
caaaagagaa 
gcaggtcacc 
acttgtttca 
atacatcgaa 
ccagttagaa 
tgaggagaat 
gcttgaaaaa 
tgcagagatg 
tcaacaaaaa 
tcttcatcag 
ggaagaatta 
tgaaaggctg 
cctagaagaa 
ccaggccacc 
tactgctcaa 
ggagaactca 
gcatcagatt 
gcagaccaag 
aaaaataact 
ggaagatgaa 
aattaacaag 
acaactagat 
ggaacagcta 
acaaaaaatc 



atcaaaactc 
attgaataaa 
taaagaaaat 
tcctatggag 
cattcgtctt 
ttgtatttaa 
gtcttttgac 
ctcattctga 
cagtatcata 
ttcttcagct 
ccttggcttg 
tccttctact 
tctattttta 
aaattataat 
cttagataca 
atactgcttg 
aaaattacac 



caacttattg 
aaccagaaga 
acaaagatga 
caaaggagtc 
atagagaaag 
gaaa ttagtt 
gaaaatttga 
attgttatat 
gaaaaagaag 
caaaacCtaa 
gaattacaaa 
aagctctgtt 
aagcaaacac 
gtcaagcaat 
aagctgaaag 
ctgcttttgc 
tttgaaagct 
tcattacagg 
ttggcaactg 
tcagcactaa 
gatttgcaga 
gaaggaagaa 
tggcgtctcc 
gcttttgaag 
aataaaataa 
aagcatgttg 
cgctgtcagc 
gttctaggta 
tttgccctga 
tgtcaagaat 
gttgttattg 
ttttaactgc 
attttatttt 
accctttcgc 
ttattatcct 
tgatcctgga 
ttattgcctc 
tgtccagttc 
atcagttgtt 
ctgtggattg 
ctaaaaatct 
tcatctgcat 
agtaagtcat 



aattgaccag 
atttgagaat 
ggggtatgat 
tcgaagagtc 
aaaagattga 
gtgcttcaga 
aagagaagaa 
tatctaaaca 
accatgtcaa 
aacagaagtt 
ttgattcact 
cttttcaaga 
tggatgagct 
tggaagagga 
ggaaggaggc 
aggaaaagta 
ataaagcgtt 
aaaaagcggc 
agagctcaaa 
aggaaacaga 
accaactcaa 
aagctgaaaa 
tctatgaaga 
tagaaaaaca 
gagattcata 
tgaagttgaa 
ttgctaaaaa 
tcaaacactt 
agaccccatt 
catggaagta 
atgttgctgt 
caatccttaa 
ttcttgcaaa 
tggctttcca 
cctgttctga 
ggtcttttct 
tactttgatt 
aaataagaaa 
taataattta 
gcctttaagc 
gagaaactct 



gtctactcag 
ttaaccaaaa 



gactaatgaa 
tctaagcttg 
ggctaagcaa 
tcaagggaaa 
tgaaaaatct 
tcaagtggaa 
tgatgaaatt 
agtagaagat 
caggaataga 
tattcttgaa 
tctgcaacaa 
ggaaatggtt 
tgataaatta 
agcaaaatct 
tgaactggag 
tgacagtatg 
aacagccagt 
caaggctggg 
t caagaatat 
aattaaagaa 
gcaacaggag 
agaaaataca 
actatataat 
ggcattgttg 
tgctaaatta 
agatgaaaat 
aaaacaaagt 
tgatccttca 
aaaagaaggc 
aacatctgag 
tattatattt 
atatgtgaaa 
tacctcctcc 
gcttagaatg 
aaccttagtt 
agtctgagct 
ctgataatgc 
taaggacaag 
caggttctta 
ctgcattctt 
acatgtaact 
catttgatta 
aaaaaaaaaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3002 



<210> 355 
<211> 846 
<212> PRT 

<213> Homo sapiens 
<400> 355 

Met Ser Ser Ser His Phe Ala Ser Arg His Arg Lys Asp He Ser Thr 
1 5 10 15 

Glu Met He Arg Thr Lys He Ala His Arg Lys Ser Leu Ser Gin Lys 
20 ' 25 30 

Glu Asn Arg His Lys Glu Tyr Glu Arg Asn Arg His Phe Gly Leu Lys 
35 40 45 

Asp Val Asn He Pro Thr Leu Glu Gly Arg He Leu Val Glu Leu Asp 
, 50 55 60 

Glu Thr Ser Gin Glu Leu Val Pro Glu Lys Thr Asn Val Lys Pro Arg 

65 70 75 80 



-125- 



BNSDOCID: <WO__2004055050A2_I_> 



WO 2004/055050 



PCT/IB2003/006434 



Ala Met Lys Thr 

Tyr Lys Glu Glu 
100 

Ala Lys Arg Gly 
115 

Cys Phe Leu Leu 
130 

Ala lie Pro Ser 
145 

Met Glu Gin Thr 

Pro Gly Pro Arg 
180 

Lys Val Val Gin 
195 

Ala Thr Gin Ala 

210 

Ala Arg Lys Pro 

225 

Lys Val Pro Ser 

Asp Lys Gly lie 
260 

Ser Gin Thr Asn 
275 

Lys Met Glu Asn 
290 

Asn Ser Phe Ala 
305 

Lys Thr Tyr Gin 

Leu Thr Pro Ser 
340 

Ser Gin Ala Thr 

355 

Thr Lys Thr lie 
370 

Pro Leu Thr Val 
385 

Thr Thr Lys Asn 

Glu Arg Asn Glu 
420 

Phe Arg Asn lie 
435 

Phe Glu Trp Asp 
4 50 

Asp Leu lie Arg 
465 

Arg Phe Lys Gin 

Gly lie Lys Glu 
500 

Val Ser Phe Gin 
515 

Lys Leu Glu Glu 
530 

Met Asn Lys Asn 
545 

Lys Pro Lys Gin 



He Leu Gly Asp 
85 

Lys Gin Leu Gin 

He Phe Lys Val 
120 

Ser Asn Gin Asn 
135 

Ser Val Arg He 
150 

Lys He Asp Asn 
165 

Gin Thr Ser Glu 

Pro Val Met Pro 
200 

Ala Lys Gin Val 
215 

Val Thr Arg Ala 
230 

Lys Gly Arg Pro 
245 

Ser Cys Lys Val 

Ala Thr Ser Gly 
280 

Leu Pro Glu He 
295 

Pro Lys Asp Phe 
310 

Val Thr Pro Met 
325 

Tyr Thr Trp Thr 

Lys Glu He Leu 
360 

Gin Gin Asp Ser 
375 

Trp His Glu Glu 
390 

Leu Asn Gly Leu 
405 

Gly Arg He Ala 

Leu Gin Ser Glu 
440 

Arg Lys Leu Glu 
455 

Thr Ala Val Gly 
470 

Phe Glu Gly Leu 
485 

Thr Thr Cys Thr 

He Glu Asp Val 
520 

Ser Gly Trp Gin 
535 

Val Phe Arg Lys 
550 

Asp Asp Ala Gly 



Gin 


Arg 


Lys 


(j±n 




y U 






Lys 


Leu 


Lys 


blU 


105 








Gly 


Arg 


Tyr 


Arg 


Ala 


val 


Lys 


Ala 








140 


Thr 


Arg 


Ser 


Lys 






ice 
lDO 




Glu 


Ser 


Asp 


Val 




170 






Lys 


Lys 


Val 


Ser 


185 








Thr 


Ser 


Leu 


Arg 


Pro 


Arg 


Thr 


Val 








220 


Ala 


Asn 


Glu 


Asn 






235 




Ala 


Lys 


Asn 


Val 




250 






Asp 


Ser 


Glu 


Glu 


265 








Met 


Asn 


Pro 


Asp 


Asn 


Thr 


Ala 


Lys 








300 


Met 


Phe 


Gin 


Pro 






315 




Thr 


Pro 


Arg 


Ser 




330 






Pro 


Leu 


Lys 


Thr 


34 5 








Ala 


Gin 


Lys 


Cys 


Asn 


Lys 


Leu 


Pro 








380 


His 


Val 


Leu 


Asn 






one 

395 




Pro 


He 


Lys 


Glu 




410 






Gin 


Pro 


His 


His 


4 ZD 








Thr 


Glu 


Lys 


Leu 


Leu 


Asp 


He 


Pro 








460 


Gin 


Thr 


Arg 


Leu 






475 




Val 


Asp 


Asp 


Cys 




490 






Asp 


Leu Asp 


Gly 


DUD 








He 


His 


Lys 


Phe 


Val 


Asn 


Asn 


Asn 








540 


Lys 


Val 


Val 


Ser 






555 




Arg 


He 


Ala 


Ala 



Met Leu Gin Lys 
95 

Gin Arg Glu Lys 
110 

Pro Asp Met Pro 
125 

Glu Pro Lys Lys 

Ala Lys Asp Gin 
160 

Arg Ala lie Arg 
175 

Asp Lys Glu Lys 
190 

Met Thr Arg Ser 
205 

Ser Ser Thr Thr 

Glu Pro Glu Gly 
240 

Glu Thr Lys Pro 
255 

Asn Thr Leu Asn 
270 

Gly Val Leu Ser 
285 

lie Lys Gly Lys 

Leu Asp Gly Leu 
320 

Ala Asn Ala Phe 

. 335 

Glu Val Asp Glu 
350 

Lys Thr Tyr Ser 

365 

Cys Pro Leu Gly 

Lys Asn Glu Ala 
400 

Val Pro Ser Leu 
415 

Gly Val Pro Tyr 
4 30 

Thr Ser His Cys 
445 

Asp Asp Ala Lys 

Leu Met Lys Glu 
480 

Glu Tyr Lys Arg 
4 95 

Phe Trp Asp Met 
510 

Asn Asn Leu He 
525 

Met Asn His Asn 

Gly He Ala Ser 
560 

Arg Asn Arg Leu 
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Ala Ala He Lys 
580 

Ala Glu Thr Ala 
595 

Phe Asp Ala Gly 
610 

Gly Leu Ser Val 
625 

Lys Ser Val Asn 

Pro Gin Gin Thr 
660 

Ser Glu His Val 
675 

Ser Ser He Glu 
690 

Glu Asn His Val 
705 

Ser Glu Arg Met 

He Asn Thr Asn 
740 

Glu Leu Asn Ser 
755 

Glu Lys Asn Thr 
770 

Lys He Ser Gin 
785 

Cys His Leu Leu 

Gin Leu Glu Arg 
820 

Gly Asn Leu He 
835 



565 

Asn Ala Met Arg 

Val Ser Val He 
600 

Phe Phe Arg Val 
615 

Ser Ser Glu Gly 
630 

Lys Ala Val Ser 
645 

Thr Ser Pro Glu 

Lys Lys Thr Leu 
680 

Asp Ala Gin Cys 
695 

Val Asn Lys Thr 
710 

Ser Leu Pro Leu 
725 

Lys Lys Glu Gly 

Ser He Thr Ser 
760 

Ala Ser Gin Asn 
775 

Ser Glu Leu Phe 
790 

Asp Ser Pro Gly 
805 

Arg His Gin Glu 

Thr Phe Ser Pro 
840 



570 

Glu Arg He Arg 
585 

Pro Lys Glu Val 

Glu Ser Pro Val 
620 

Pro Ser Gin Arg 
635 

Gin Ser Arg Asn 
650 

Asn Ala Gly Pro 
665 

Phe Leu Ser lie 

Pro Gly Leu Pro 
700 

Asp Leu Lys Val 
715 

Leu Ala Gly Gly 
730 

He Ser Asp Val 
745 

Gin Asp Val Leu 

Ser He Leu Glu 
780 

Asp Asn Lys Ser 
795 

Leu Asn Cys Ser 
810 

His Ala Arg His 
825 

Leu Gin Pro Gly 



575 

Gin Glu Glu Cys 
590 

Asp Lys lie Val 
605 

Lys Leu Phe Ser 

Leu Gly Thr Pro 
640 

Glu Met Gly lie 
655 

Gin Asn Thr Lys 
670 

Pro Glu Ser Arg 
685 

Asp Leu lie Glu 

Asp Cys Leu Ser 
720 

Val Ala Asp Asp 
735 

Val Glu Gly Met 
750 

Met Ser Ser Pro 
765, 

Glu Gly Glu Thr 

Leu Thr Thr Glu 
800 

Asn Pro Phe Thr 
815 

lie Ser Phe Gly 

830 
Glu Phe 
845 



<210> 356 
<211> 2979 
<212> DNA 

<213> Homo sapiens 



<400> 356 

agcaaaccaa tcgcaagcct cgttgagtgg 
ttaaagcccc tccaatcagc ggctcggtgc 
tgtgagggtt cctgcttcgg agtcggcggt 
tgtttggttg aggtttcacg ctagaaggtg 
tcgacacagg aaggatataa gtactgaaat 
actgtctcag aaagaaaata gacataagga 
agatgtaaac attccaacct tggaaggtag 
agagcttgtt ccagaaaaga ccaatgttaa 
tcaacgaaaa cagatgctcc aaaaatacaa 
gcagagagag aaagctaaac gaggaatatt 
ttgttttctt ttatcaaacc agaatgctgt 
ttctgtacgg attacaaggt caaaggccaa 
cgagagtgat gttcgagcaa tccgacctgg 
agacaaagag aaaaaagttg tgcagcctgt 
agctactcaa gcagcaaagc aggttcccag 
agtcacaaga gctgctaatg aaaacgaacc 
tgccaaaaat gtagaaacaa aacccgacaa 
aaatactttg aattcacaaa ctaatgcaac 



aaggggtggg atcttccccg gaagttttgg 60 
ggcaagtttg aatttcgtgg aggctcgggt 120 
ggtcgtccag accgagtgtt ctttactttt 180 
gctcaggatg tcttcatcac attttgccag 240 
gattagaact aaaattgctc ataggaaatc 300 
atacgaacga aatagacact ttggtttgaa 360 
aattcttgtt gaattagatg agacatctca 420 
gccaagggca atgaaaacta ttctaggtga 480 
agaagaaaag caacttcaaa aattgaaaga 540 
taaagtgggt cgttatagac ctgatatgcc 600 
gaaagctgag ccaaaaaagg ctattccatc 660 
agaccaaatg gagcagacta agattgataa 720 
tccaagacaa acttctgaaa agaaagtgtc 780 
aatgcccacg tcgttgagaa tgactcgatc 840 
aacagtctca tctaccacag caagaaagcc 900 
agaaggaaag gtgccaagta aaggaagacc 960 
gggtatttct tgtaaagtcg atagtgaaga 1020 
aagtggaatg aatccagatg gagtcttatc 1080 
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aaaaatggaa aacttacctg agataaatac tgcaaaaata aaagggaaga attccttcgc 1140 
acctaaggat tttatgtttc agccactgga tggtctgaag acctatcaag taacacctat 1200 
gactcccaga agtgccaatg cttttttgac acccagttac acctggactc ctttaaaaac 1260 
agaagttgat gagtctcaag caacaaaaga aattttggca caaaaatgta aaacttactc 1320 
taccaagaca atacagcaag attcaaataa attgccatgt cctttgggtc ctctaactgt 1380 
ttggcatgaa gaacatgttt taaataaaaa tgaagctact actaaaaatt taaatggcct 1440 
tccaataaaa gaagtcccat cacttgaaag aaatgaaggt cgaattgctc agccccacca 1500 
tggtgtgcca tatttcagaa atatcctcca gtcagaaact gagaaattaa cttcacattg 1560 
cttcgagtgg gacaggaaac ttgaattgga cattccagat gatgctaaag atcttattcg 1620 
cacagcagtt ggtcaaacaa gactccttat gaaggaaagg tttaaacagt ttgaaggact 1680 
ggttgatgat tgtgaatata aacgaggtat aaaggagact acctgtacag atctggatgg 1740 
attttgggat atggttagtt ttcagataga agatgtaatc cacaaattca acaatctgat 1800 
caaacttgag gaatctgggt ggcaagtcaa taataatatg aatcataata tgaacaaaaa 1860 
tgtctttagg aaaaaagttg tctcaggtat agcaagtaaa ccaaaacagg atgatgctgg 1920 
aagaattgca gcgagaaatc gcctagctgc cataaaaaat gcaatgagag agagaattag 1980 
gcaggaagaa tgtgctgaaa cagcagtttc tgtgatacca aaggaagttg ataaaatagt 2040 
gttcgatgct ggatttttca gagttgaaag tcctgttaaa ttattctcag gactttctgt 2100 
ctcttctgaa ggcccttctc aaagacttgg aacacctaag tctgtcaaca aagctgtatc 2160 
tcagagtaga aatgagatgg gcattccaca acaaactaca tcaccagaaa atgccggtcc 2220 
tcagaatacg aaaagtgaac atgtgaagaa gactttgttt ttgagtattc ctgaaagcag 2280 
gagcagcata gaagatgctc agtgtcctgg attaccagat ttaattgaag aaaaccatgt 2340 
tgtaaataag acagacttga aggtggattg tttatccagt gagagaatga gtttgcctct 2400 
tcttgctggt ggagtagcag atgatattaa tactaacaaa aaagaaggaa tttcagatgt 2460 
tgtggaagga atggaactga attcttcaat tacatcacag gatgttttga tgagtagccc 2520 
tgaaaaaaat acagcttcac aaaatagcat cttagaagaa ggggaaacta aaatttctca 2580 
gtcagaacta tttgataata aaagtctcac tactgaatgc caccttcttg attcaccagg 2640 
tctaaactgc agtaatccat ttactcagct ggagaggaga catcaagaac atgccagaca 2700 
catttctttt ggtggtaacc tgattacttt ttcacctcta caaccaggag aattttgaat 2760 
ttaaaaataa atccaaacat tttccttcat attatcaatg cttatatatt ccttagacta 2820 
ttgaaatttt ggagaaaatg tatttgtgtt cacttctata gcatataatg ttttaatatt 2880 
ctgtgttcat caaagtgtat tttagatata ctctttctca agggaagtgg ggatattttg 2940 
tacattttca acacagaata aaaaatgtac tgtgccttg 2979 

<210> 357 
<211> 191 
<212> PRT 

<213> Homo sapiens 



<400> 357 



Met 


Ala 


Thr 


Leu 


He 


Tyr 


Val 


Asp 


Lys 


Glu 


He 


Gly 


Glu 


Pro 


Gly 


Thr 


1 








5 










10 










15 




Arg 


Val 


Ala 


Ala 


Lys 


Asp 


Val 


Leu 


Lys 


Leu 


Glu 


Ser 


Arg 


Pro 


Ser 


He 








20 










25 










30 






Lys 


Ala 


Leu 


Asp 


Gly 


He 


Ser 


Gin 


Val 


Leu 


Thr 


Pro 


Arg 


Phe 


Gly Lys 






35 










40 










45 








Thr 


Tyr 


Asp Ala 


Pro 


Ser 


Ala 


Leu 


Pro 


Lys 


Ala 


Thr 


Arg 


Lys 


Ala 


Leu 




50 










55 










60 










Gly 


Thr 


Val 


Asn 


Arg 


Ala 


Thr 


Glu 


Lys 


Ser 


Val 


Lys 


Thr 


Asn 


Gly 


Pro 


65 










70 










75 










80 


Arg 


Lys 


Gin 


Lys 


Gin 


Pro 


Ser 


Phe 


Ser 


Ala 


Lys 


Lys 


Met 


Thr 


Glu 


Lys 










85 










90 










95 




Thr 


Val 


Lys 


Thr 


Lys 


Ser 


Ser 


Val 


Pro 


Ala 


Ser 


Asp 


Asp Ala 


Tyr 


Pro 








100 










105 










110 






Glu 


He 


Glu 


Lys 


Phe 


Phe 


Pro 


Phe 


Asn 


Leu 


Leu 


Asp 


Phe 


Glu 


Ser 


Phe 






115 










120 










125 








Asp 


Leu 


Pro 


Glu 


Glu 


Arg 


Gin 


He 


Ala 


His 


Leu 


Pro 


Leu 


Ser 


Gly 


Val 




130 










135 










140 








Pro 


Leu 


Met 


He 


Leu 


Asp 


Glu 


Glu 


Gly 


Glu 


Leu 


Glu 


Lys 


Leu 


Phe 


Gin 


145 










150 










155 








160 


Leu 


Gly 


Pro 


Pro 


Ser 


Pro 


Val 


Lys 


Met 


Pro 


Ser 


Pro 


Pro 


Trp 


Glu 


Cys 



165 170 175 
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Asn Leu Phe Ala Val Ser Phe Lys His Ser Val Asp Pro Gly Cys 
180 . 185 190 ' 



<210> 358 
<211> 576 
<212> DNA 
<213> Homo sapiens 



<400> 358 

atggctactc 

aaggatgtgc 

gttttaacac 

agaaaggctt 

agaaaacaaa 

aaaagttctg 

aatcttctag 

ttgagtggag 

ctgggccccc 

gtctccttca 



tgatctacgt 
tgaagctgga 
cacgttttgg 
tgggcactgt 
aacagccaag 
ttcctgcctc 
actttgagag 
tgcctctcat 
cttcacctgt 
agcattctgt 



tgataaggaa 
gtctagacct 
caaaacatac 
caacagagct 
cttttctgcc 
agatgacgcc 
ttttgacctg 
gatccttgat 
gaaaatgccc 
cgaccctgga 



attggagaac 
tcaatcaaag 
gatgctccat 
acagaaaagt 
aaaaagatga 
tatccagaaa 
cctgaagagc 
gaggagggag 
tctccaccat 
tgttga 



caggcacccg 
cattagatgg 
cagccttacc 
cagtaaagac 
ccgagaagac 
tagaaaaatt 
gccagattgc 
agcttgaaaa 
gggaatgcaa 



tgtggctgcc 
gatatctcaa 
taaagctacc 
caatggaccc 
tgttaaaaca 
ctttcccttc 
acacctcccc 
gctgtttcag 
tctgtttgca 



■60 
120 
180 
240 
300 
360 
420 
480 
540 
576 



<210> 359 
<211> 202 
<212> PRT 
<213> Homo sapiens 



<400> 359 
Met Ala Thr 
1 

Leu Val Ala 

Lys Ala Leu 

35 

Thr Phe Asp 
50 

Gly Thr Val 

65 

Leu Lys Gin 

Thr Val Lys 

Glu lie Glu 
115 

Asp Leu Pro 

130 
Pro Leu Met 
145 

Leu Gly Pro 

Asn Leu Leu 

Leu Pro Pro 
195 



Leu 

Thr 
20 
Asp 

Ala 

Asn 

Lys 

Ala 
.100 
Lys 

Glu 

He 

Pro 

Gin 
180 
Val 



He Tyr Val 
5 

Lys Asp Gly 

Gly Arg Ser 

Pro Thr Ser 
55 

Arg Ala Thr 
70 

Gin Pro Ser 
85 

Lys Asn Ser 

Leu Phe Pro 

Glu His Gin 
135 

Leu Asp Glu 
150 

Ser Pro Leu 
165 

Ser Pro Leu 
Cys Cys Asp 



Asp 

Leu 

Gin 

40 

Leu 

Glu 

Phe 

Val 

Phe 
120 
He 

Glu 

Lys 

Ser 

He 
200 



Lys 

Lys 

25 

Val 

Pro 

Lys 

Ser 

Pro 
105 
Asn 

Ala 

Arg 

Met 

He 
185 
Asp 



Glu Asn 
10 

Leu Gly 

Ser He 

Lys Ala 

Ser Val 

75 
Ala Lys 
90 

Ala Ser 

Pro Leu 

His Leu 

Glu Leu 
155 
Pro Ser 
170 

Leu Leu 



He 



Glu Glu Pro 

Ser Gly Pro 
30 

Ser Cys Phe 
45 

Thr Arg Lys 
60 

Lys Thr Asn 

Lys Met Thr 

Asp Asp Gly 
110 

Gly Phe Glu 
125 

Pro Leu Ser 
140 

Glu Lys Leu 

Pro Pro Trp 

Thr Leu Asp 
190 



Gly He 
15 

Ser He 

Gly Lys 

Ala Leu 

Gly Pro 

80 
Glu Lys 
95 

Tyr Pro 

Ser Phe 

Glu Val 

Phe Gin 
160 
Lys Ser 
175 

Val Glu 



<210> 360 
<211> 609 
<212> DNA 
<213> Homo sapiens 

<400> 360 
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atggctactc 
aaggatgggc 
gtttcaatat 
agaaaggctt 
ctcaaacaaa 
aaaaactctg 
aatcctctag 
ttgagtgaag 
ctgggccccc 
tctcctttaa 
gatatttaa 



tgatctatgt 
tgaagctggg 
catgttttgg 
tgggaactgt 
aacagccaag 
ttcctgcctc 
get tcgagag 
tgcctctcat 
cttcaccttt 
gcattctgtt 



tgataaggaa 
gtctggacct 
caaaacattc 
caacagagct 
ctt ttctgee 
agatgatggc 
ttttgacctg 
gatacttgat 
gaagatgece 
gaccctggat 



<210> 361 
<211> 450 
<212> PRT 

<213> Homo sapiens 



aacgaagaac 
tcaatcaaag 
gatgctccca 
acagaaaagt 
aaaaagatga 
tatccagaaa 
cctgaagagc 
gaggagagag 
tctccaccat 
gttgaattgc 



caggcatcct 
ccttagatgg 
catccttacc 
cagtaaagac 
ctgagaagac 
tagaaaaatt 
accagattgc 
agcttgaaaa 
ggaaatccaa 
cacctgtttg 



tgtggctaca 
gagatctcaa 
taaagctacc 
caatggaccc 
tgttaaagca 
atttcccttc 
acatctcccc 
gctgtttcag 
tctgttgcag 
ctgtgacata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

609 



<400> 361 



Met 


Asp 


Ala 


Arg 


Arg 


Val 


Pro 


Gin 


Lys 


Asp 


Leu 


Arg 


Val 


Lys 


Lys 


Asn 


1 








5 










10 










15 




Leu 


Lys 


Lys 


Phe 


Arg 


Tyr 


Val 


Lys 


Leu 


lie 


Ser 


Met 


Glu 


Thr 


Ser 


Ser 








20 










25 










30 






Ser 


Ser 


Asp 


Asp 


Ser 


Cys 


Asp 


Ser 


Phe 


Ala 


Ser 


Asp 


Asn 


Phe 


Ala 


Asn 






35 










40 










45 








Thr 


Lys 


Pro 


Lys 


Phe 


Arg 


Ser 


Asp 


He 


Ser 


Glu 


Glu 


Leu 


Ala 


Ser 


Val 




50 










55 










60 










Phe 


Tyr 


Glu 


Asp 


Ser 


Asp 


Asn 


Glu 


Ser 


Phe 


Cys 


Gly 


Phe 


Ser 


Glu 


Ser 


65- 










70 










75 










80 


Glu 


Val 


Gin 


Asp 


Val 


Leu 


Asp 


His 


Cys 


Gly 


Phe 


Leu 


Gin 


Lys 


Pro 


Arg 










85 










90 










95 




Pro 


Asp 


Val 


Thr 


Asn 


Glu 


Leu 


Ala 


Gly 


He 


Phe 


His 


Ala 


Asp 


Ser 


Asp 








100 










105 










110 






Asp 


Glu 


Ser 


Phe 


Cys 


Gly 


Phe 


Ser 


Glu 


Ser 


Glu 


He 


Gin 


Asp 


Gly 


Met 






115 










120 










125 








Arg 


Leu 


Gin 


Ser 


Val 


Arg 


Glu 


Gly 


Cys 


Arg 


Thr 


Arg 


Ser 


Gin 


Cys 


Arg 




JL 










1 Jo 




















His 


Ser 


Gly 


Pro 


Leu 


Arg 


Val 


Ala 


Met 


Lys 


Phe 


Pro 


Ala 


Arg 


Ser 


Thr 


145 










150 










155 










160 


Arg 


Gly 


Ala 


Thr 


Asn 


Lys 


Lys 


Ala 


Glu 


Ser 


Arg 


Gin 


Pro 


Ser 


Glu 


Asn 










165 










170 










175 




Ser 


Val 


Thr 


Asp 


Ser 


Asn 


Ser 


Asp 


Ser 


Glu 


Asp 


Glu 


Ser 


Gly 


Met 


Asn 








180 










185 










190 






Phe 


Leu 


Glu 


Lys 


Arg 


Ala 


Leu 


Asn 


He 


Lys 


Gin 


Asn 


Lys 


Ala 


Met 


Leu 






195 










200 










205 








Ala 


Lys 


Leu 


Met 


Ser 


Glu 


Leu 


Glu 


Ser 


Phe 


Pro 


Gly 


Ser 


Phe 


Arg 


Gly 




210 










215 










220 










Arg 


His 


Pro 


Leu 


Pro 


Gly 


Ser 


Asp 


Ser 


Gin 


Ser 


Arg 


Arg 


Pro 


Arg 


Arg 


225 










230 










235 










240 


Arg 


Thr 


Phe 


Pro 


Gly 


Val 


Ala 


Ser 


Arg 


Arg 


Asn 


Pro 


Glu 


Arg 


Arg 


Ala 










245 










250 










255 




Arg 


Pro 


Leu 


Thr 


Arg 


Ser 


Arg 


Ser 


Arg 


He 


Leu 


Gly 


Ser 


Leu 


Asp 


Ala 








260 










265 










270 






Leu 


Pro 


Met 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu Asp Lys 


Tyr 


Met 


Leu 


Val 


Arg 






275 










280 










285 








Lys 


Arg 


Lys 


Thr 


Val 


Asp 


Gly 


Tyr 


Met 


Asn 


Glu 


Asp 


Asp 


Leu 


Pro 


Arg 




290 










295 










300 










Ser 


Arg 


Arg 


Ser 


Arg 


Ser 


Ser 


Val 


Thr 


Leu 


Pro 


His 


He 


He 


Arg 


Pro 


305 










310 










315 










320 


Val 


Glu 


Glu 


He 


Thr 


Glu 


Glu 


Glu 


Leu 


Glu 


Asn 


Val 


Cys 


Ser 


Asn 


Ser 










325 










330 










335 




Arg 


Glu 


Lys 


He 


Tyr 


Asn 


Arg 


Ser 


Leu 


Gly 


Ser 


Thr 


Cys 


His 


Gin 


Cys 
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340 










Arg 


Gin 


Lys 

355 


Thr 


He 


Asp 


Thr 


Lys 
360 


Trp 


Gly 


Val 


Arg 


Gly 


Gin 


Phe 


Cys 




370 










375 


Gly 


Glu 


Glu 


Val 


Arg 


Asp 


Ala 


Leu 


385 










390 






Pro 


Cys 


Arg 


Gly 


He 
405 


Cys 


Asn 


Cys 


Arg 


Cys 


Ala 


Thr 
420 


Gly 


Val 


Leu 


Val 


Gly 


Asn 


Val 
435 


His 


Ala 


Tyr 


Leu 


Lys 
440 



Gin Ala 



450 



345 










350 






Thr 


Asn 


Cys 


Arg 


Asn 


Pro 


Asp 


vyo 










365 






Gly 


Pro 


Cys 


Leu 


Arg 


Asn 


Arg 


Tvr 








380 








Leu 


Asp 


Pro 


Asn 


Trp 


His 


Cys 


Pro 






395 








400 


Ser 


Phe 


Cys 


Arg 


Gin 


Arg 


Asp 


Gly 




410 










415 


Tyr 


Leu 


Ala 


Lys 


Tyr 


His 


Gly 


Phe 


425 










430 




Ser 


Leu 


Lys 


Gin 


Glu 


Phe 


Glu 


Met 



445 



<210> 362 

<211> 2824 

<212> DNA 

<213> Homo sapiens 

<400> 362 

cccgagcccc gcccctccgg gcccgggtcg gcgcgcccag cctgccagcc gcgctgctgc 60 
tgctcctcct gctgtgggac cgctgaccgc gcggctgctc cgctctoccc gctccaagcg 120 
ccgatctggg cacccgccac cagcatggac gctcgccgcg tgccgcagaa agatctcaga 180 
gtaaagaaga acttaaagaa attcagatat gtgaagttga tttccatgga aacctcgtca 240 
tcctctgatg acagttgtga cagctttgct tctgataatt ttgcaaacac gaaacctaaa 300 
ttcaggtcag atatcagtga agaactggca agtgtttttt atgaggactc tgataatgaa 360 
tctttctgcg gcttttcaga aagtgaggtg caagatgtat tagaccattg tggattttta 420 
cagaaaccaa ggccagatgt cactaacgaa ctggccggta tttttcatgc cgactctgac 480 
gatgaatcat tttgcggttt ctcagagagt gagatacaag atggaatgag gctgcagtca 540 
gttcgggaag gctgtaggac ccgcagccag tgcaggcact ctggacctct cagggtggcg 600 
atgaagtttc cagcgcggag taccagggga gcaaccaaca aaaaagcaga gtcccgccag 660 
ccctcagaga attctgtgac tgattccaac tccgattcag aagatgaaag tggaatgaat 720 
tttttggaga aaagggcttt aaatataaag caaaacaaag caatgcttgc aaaactcatg 780 
tctgaattag aaagcttccc tggctcgttc cgtggaagac atcccctccc aggctccgac 840 
tcacaatcaa ggagaccgcg aaggcgtaca ttcccgggtg ttgcttccag gagaaaccct 900 
gaacggagag ctcgtcctct taccaggtca aggtcccgga tcctcgggtc ccttgacgct 960 
ctacccatgg aggaggagga ggaagaggat aagtacatgt tggtgagaaa gaggaagacc 1020 
gtggatggct acatgaatga agatgacctg cccagaagcc gtcgctccag atcatccgtg 1080 
acccttccgc atataattcg cccagtggaa gaaattacag aggaggagtt ggagaacgtc 1140 
tgcagcaatt ctcgagagaa gatatataac cgttcactgg gctctacttg tcatcaatgc 1200 
cgtcagaaga ctattgatac caaaacaaac tgcagaaacc cagactgctg gggcgttcga 1260 
ggccagttct gtggcccctg ccttcgaaac cgttatggtg aagaggtcag ggatgctctg 1320 
ctggatccga actggcattg cccgccttgt cgaggaatct gcaactgcag tttctgccgg 1380 
cagcgagatg gacggtgtgc gactggggtc cttgtgtatt tagccaaata tcatggcttt 1440 
gggaatgtgc atgcctactt gaaaagcctg aaacaggaat ttgaaatgca agcataatat 1500 
ctggaaaatt tgctgcctgc cttctacttc tcaaatcttt cttgtaaaag tttccaattt 1560 
tttcactgaa acctgagtta aaaatcttga tgatcagcct gtttcataag aaactccaat 1620 
caagttaatc ttagcagaca tgtgtttctg gagcatcaca gaaggtatat tgctagttac 1680 
actttgccct cctgcagttt cttctctgct cccaaccccc atctcatagc atccccctct 1740 
atttccaatg ctcctctcca accgcttagt ttctgaattt cttttaaatt acagttttat 1800 
gaaagcatat tttatttact tggtgttgaa atagccctca taaaacctaa gcacttggaa 1860 
acacaataat agtattaact aactagatct attgaatttc agagaagagc cttctaactt 1920 
gtttacacaa aaacgagtat gatttagcat tcatactagt tgaaattttt aatagaatca 1980 
aggcacaaaa gtcttaaaac catgtggaaa aattaggtaa ttattgcaga ttgatgtctc 2040 
tcaatcccat gtattgcgct tatgttacaa gttgttgtca cagttgagac ttaatttctc 2100 
ctaatttctt ctgcccgaag ggtaagtggt gcgtccagct tacacaatca taattcaaag 2160 
gttggtgggc aatgtaatac ttaattaaaa taatgatgga agagctatct ggagattatg 2220 
agtaagctga tttgaatttt cagtataaaa ctttagtata attgtagttt gcaaagttta 2280 
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tttcagttca 
atattaaaaa 
ggttcaagtt 
acttttagca 
tgtcagtatt 
tgtttgtatg 
gcataattgg 
acaaaagttt 
tcttactgta 
ttct 



catgtaaggt 
ctagtctgtg 
tagattttaa 
gtttgttaac 
ccccctcctc 
tccaatt tac 
accttgaatc 
attttaataa 
cttaattcta 



attgcaaata 
gttctttgca 
gcacttttat 
ctgacatctc 
tttgcattaa 
ttgcatatgt 
gataagtgta 
taaaatgttt 
attttttcct 



aattcttgga 
gtttcttgta 
aacaatgata 
tgccagtcta 
tcaaggtatt 
aaaccattgc 
aatacagctt 
gttctaactt 
catatttaaa 



caattttgta 
aatttataaa 
agtgcctttt 
gtttctgggc 
tggtagaggt 
tgtgccattc 
t tgatctgta 
gtctgctttt 
taaaaggcca 



tggaaacttg 
ccaggcacaa 
tggagatgta 
aggtttcctg 
ggaatctaag 
aatgtttgat 
atgcttttat 
ttaaaaataa 
t ttccacctt 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2824 



<210> 363 

<211> 371 

<212> PRT 

<213> Homo sapiens 



<400> 363 



Met 


Asp 


Ala 


Arg 


Arg 


Val 


Pro 


Gin 


1 








3 








Leu 


Lys 


Lys 


Phe 


Arg 


Tyr 


val 


Lys 








20 










Ser 


Car 

ber 


Asp 


Asp 


O v 

ber 


uys 


Asp 


Ser 






35 










4 U 


Thr 


Arg 


Leu 


Gin 


Ser 


Val 


Arg 


Glu 




a 










c c 

33 




Arg 


HIS 


Ser 


Gly 


Pro 


Leu 


Arg 


val 












"7 A 






ji nr 


Arg 


Gly Ala 


l nr 


Asn 


Lys 


Lys 










D c: 








Asn 


O " -L 


Val 


Thr 


Asp 


i)cl 


Asn 


oer 








100 










Asn 


iriie 


Leu 


Glu 


Lys 


Arg 


Pi± a 


Leu 






115 










1 O A 

1 zu 


Leu 


Hid 


Lys 


Leu 


Met: 


oer 




Leu 




iju 










1 O C 

I Jo 




oiy 


Arg 


His 


Pro 


T 

Leu 


riu 


oi y 


O tr J. 


1 A ^ 
















Arg 


Arg 


Thr 


Phe 


Pro 


Gly 


Val 


Ala 










165 








Ala 


Arg 


Pro 


Leu 


Thr 


Arg 


Ser 


Arg 








180 










Ala 


Leu 


Pro 


Met 


Glu 


Glu 


Glu 


Glu 






195 










200 


Arg 


Lys 


Arg 


Lys 


Thr 


Val 


Asp 


Gly 




210 










215 




Arg 


Ser 


Arg 


Arg 


Ser 


Arg 


Ser 


Ser 


225 










230 






Pro 


Val 


Glu 


Glu 


He 


Thr 


Glu 


Glu 










245 








Ser 


Arg 


Glu 


Lys 


He 


Tyr 


Asn 


Arg 








260 










Cys 


Arg 


Gin 


Lys 


Thr 


He 


Asp 


Thr 






275 










280 


Cys 


Trp 


Gly Val 


Arg 


Gly 


Gin 


Phe 




290 










295 




Tyr 


Gly 


Glu 


Glu 


Val 


Arg 


Asp 


Ala 


305 










310 






Pro 


Pro 


Cys 


Arg 


Gly 


He 


Cys 


Asn 










325 








Gly 


Arg 


Cys 


Ala 


Thr 


Gly 


Val 


Leu 








340 











Lys 


Asp 


Leu 


Arg 


val 


Lys 


Lys 


Asn 




10 










15 




Leu 


He 


Ser 


Met 


CjlU 


i nr 


Ser 


ber 


25 










30 






Phe 


Ala 


Ser 


Asp 


Asn 


Phe 


Ala 


Asn 










4 5 








Gly Cys Arg 


Thr 


Arg 


Ser 


Gin 


Cys 








C A 










Ala 


Met 


Lys 


rne 


Pro 


Ala 


Arg 


Ser 






75 










O A 
O V 


Ala 


Glu 


Ser 


Arg 


uin 


Pro 


Q a. v 

oer 


bill 




90 










A C 




Asp 


Ser 


Glu 


Asp 


olU 


oer 


Liiy 


Met 


105 










"» T A 
III) 






Asn 


He 


Lys 


L*in 


Asn 


Lys 


Hid 


Ma +- 

Met 


















Glu 


Ser 


Phe 






oer 


rne 


Arg 








1 A A 
1 H U 










Asp 


Ser 


Gin 


0 AT- 
OCX 


Arg 


Arg 


Pro 


Arg 






155 










1 £A 

1 bU 


Ser 


Arg Arg 


Asn 


Pro 


Glu 


Arg 


Arg 




170 










175 




Ser Arg 


He 


Leu 


Gly 


Ser 


Leu 


Asp 


185 










190 






Glu 


Glu 


Asp 


Lys 


Tyr 


Met 


Leu 


Val 










205 








Tyr 


Met 


Asn 


Glu 


Asp 


Asp 


Leu 


Pro 








220 










Val 


Thr 


Leu 


Pro 


His 


He 


He 


Arg 






235 










240 


Glu 


Leu 


Glu 


Asn 


Val 


Cys 


Ser 


Asn 




250 










255 




Ser 


Leu Gly 


Ser 


Thr 


Cys 


His 


Gin 


265 










270 






Lys 


Thr 


Asn 


Cys 


Arg 


Asn 


Pro 


Asp 










285 








Cys 


Gly 


Pro 


Cys 


Leu 


Arg 


Asn 


Arg 








300 










Leu 


Leu 


Asp 


Pro 


Asn 


Trp 


His 


Cys 






315 










320 


Cys 


Ser 


Phe 


Cys 


Arg 


Gin 


Arg 


Asp 




330 










335 




Val 


Tyr 


Leu 


Ala 


Lys 


Tyr 


His 


Gly 


345 










350 
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• 355 360 365 

Met Gin Ala 

370 



<210> 364 
<211> 2587 
<212> DNA 

<213> Homo sapiens 
<400> 364 



rt^n 9 t CC t 9C ^ CCtCc ^ gcccgggtcg gcgcgcccag cctgccagcc gcgctgctgc 60 
tgctcctcct gctgtgggac cgctgaccgc gcggctgctc cgctctcccc gc^ccaagcg 120 

gtaaaaaaaa 9 acttaf" 0 CagCatggaC ^cgccgcg tgccgcagaa agatc'caga' 80 
gtaaagaaga acttaaagaa attcagatat gtgaagttga tttccatgga aacctcgtca 240 
tcctctgatg acagttgtga cagctttgct tctgataatt ttgcaaacac gaggctgcag 300 
tcagttcggg aaggctgtag gacccgcagc cagtgcaggc actctggacc tc^agqqtq 360 
gcgatgaagt ttccagcgcg gagtaccagg ggagcaacca acaaaaaagc agagj^cgc 420 
aattSr 9 agaattCtgt g-ctgattcc aactccgatt cagaagatga aagtggaa?g 480 
aattttttgg agaaaagggc tttaaatata aagcaaaaca aagcaatgct tgcaaaactc 540 
gactcaSt aa ^t "ctggctcg ttccgtggaa gac'atccLt cccaggctcc 00 

^^! Caat caa W a <3*<=c gcgaaggcgt acattcccgg gtgttgcttc caggagaaac 660 

gctctaccca co^af" tCttaCCagg ^caaggtccc ggatcctcgg gteL?^ 120 
gctctaccca tggaggagga ggaggaa g ag gataagtaca tgttggtgag aaaqaqqaao 780 

ZTJV*? , tgaagatgaC -tgcccagaa glcgllgltl cTgTtlTtol 8 0 
gtctgcaoca " tCgC< = Cagt g ^agaaatta cagaggagga gttggagaac 900 

gtctgcagca attctcgaga gaagatatat aaccgttcac tgggctctac ttqtcatcaa 960 
tgccgtcaga agactattga taccaaaaca aactgcagaa acccagactg ctggggcqt? 1020 
cgaggccagt tctgtggccc ctgccttcga aaccgttatg gtgaagaggt caggqatqcJ JSeO 
ctgctggatc cgaactggca ttgcccgcct tgtcgaggaa tc?gcaactg cagt c q 11*0 
t??a a aa 9a9 atggaCggtg tgcgactggg gtccttgtgt att'agccaa atatcatggc lloo 
S? a 9 attt^r^ f tgaaaagC ctgaaacagg aatttgaaat gcaagcataa 1260^ 
tatctggaaa atttgctgcc tgccttctac ttctcaaatc tttcttgtaa aagtttccaa 1320 
aatcaaqtta 'tcttf" " aaaaatCt tgatgatcag cctgtt?cat aa aS X3B0 
ttr- a tcttagcag acatgtgttt ctggagcatc acagaaggta tattgctagt 1440 

cctatttcca TattllTt tttCttCtCt ^tcccaacc cccatctcat agcaLcc X500 
tctatttcca atgctcctct ccaaccgctt agtttctgaa tttcttttaa attacaqttt 1560 
tatgaaagca tattttattt acttggtgtt gaaatagccc tcataaaacc taagcacttg llto 

cttqtttaca oa aa a L tatta aC ^ actaga tctattgaat ttcagagaag agccttccaa itto 
cttgtttaca caaaaacgag tatgatttag cattcatact agttgaaatt tttaataqaa 1740 
tcaaggcaca aaagtcttaa aaccatgtgg aaaaattagg taatcattgc a a ^ 

Ttoltatttt c'tctar 9 " 9Ctta , tgtta "agttgttg tcacagttga gacttaaS 860 
tlaattnnln CttCtgc "9 aagggtaagt ggtgcgtcca gcttacacaa tcataattca 1920 
aaggttggtg ggcaatgtaa tacttaatta aaataatgat ggaagagcta tctqqagatt 1980 
atgagtaagc tgatttgaat tttcagtata aaactttagt ataactgtag tttgSal^ 11™ 

ttgatSaa tllTTt g f attgCaa ^aaattctt ggacaa?tt? gtatggaaac 2X00 
f aaactagtct gtggttcttt gcagtttctt gtaaatttat aaaccaggca 2160 
caaggttcaa gtttagattt taagcacttt tataacaatg ataagtgcct ttttqqaqa? 2220 
cJatoJi" 3 g ^ gtttgtt aacctgacat ctctgccag? ctagcttctg ggcaggt^c lllo 
ctgtgtcagt attccccctc ctctttgcat taatcaaggt atttggtaga qqtqqaatct 2340 
aagtgtttgt atgtccaatt tacttgcata tgtaaaccat tgctg?g«a ??cal?a££ 24^0 
gatgcataat tggaccttga atcgataagt gtaaatacag c?tt?g«c? gtaatqc^tJ 2460 
tatacaaaag tttattttaa taataaaatg tttgttctaa cttgt^g" 2 \\f 0 

taatcttact gtacttaatt ctaatttttt cctcatattt aaaLaaagg ccat^tccac 2580 

2587 

<210> 365 
<211> 709 
<212> PRT 

<213> Homo sapiens 
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<400> 365 



Met 


Ser 


Phe 


Pro 


Lys 


Ala 


Pro 


Leu 


Lys 


Arc? 


Phe 


Asn 


Asp 


Pro 


Ser 


Gly 


1 








5 










10 










15 




Cvs 


Ala 


Pro 


Ser 


Pro 


Glv 


Ala 


Tvr 
j 


Asp 


Val 


Lvs 


Thr 


Leu 


Glu 


Val 


Leu 








20 










25 










30 






Lys 


Glv 


Pro 


Val 


Ser 


Phe 


Gin 


Lys 


Ser 


Gin 


Arg 


Phe 


Lys 


Gin 


Gin 


Lvs 






35 










40 










4 5 








Glu 


Ser 




Gin 


Asn 


Leu 


Asn 


Val 


Asp 


Lys 


Asp 


Thr 


Thr 


Leu 


Pro 


Ala 




50 










55 










60 










Ser 


Ala 


Ar g 


Lys 


Val 


Lys 


Ser 


Ser 


Glu 


Ser 


Lvs 


lie 


Arg 


Val 


Leu 


Leu 


65 










70 










75 










80 


Gin 


Glu 


Arg 


Glv 


Ala 


Gin 


Asp 


Ser 


Arg 


He 


Gin 


Asp 


Leu 


Glu 


Thr 


Glu 










85 










90 










95 




Leu 


Glu 


Lys 


Met 


Glu 


Ala 


Arg 


Leu 


Asn 


Ala 


Ala 


Leu 


Arg 


Glu 


Lys 


Thr 








100 










105 










110 






Ser 


Leu 


Ser 


Ala 


Asn 


Asn 


Ala 


Thr 


Leu 


Glu 


Lys 


Gin 


Leu 


He 


Glu 


Leu 






115 










120 










125 








Thr 


Arg 


Thr 


Asn 


Glu 


Leu 


Leu 


Lys 


Ser 


Lys 


Phe 


Ser 


Glu 


Asn 


Glv 

vj j.y 


Asn 




130 










135 










14 0 










Gin 


Lys 


Asn 


Leu 


Arg 


lie 


Leu 


Ser 


Le u 


Glu 


Leu 


Met 


Lys 


Le u 


Arg 


Asn 


145 










150 










155 










160 


Lys 


Arg 


Glu 


Thr 


Lys 


Met 


Arg 


Gly 


Met 


Met 


Ala 


Lys 


Gin 


Glu 


Gly 


Met 










165 










170 










175 




Glu 


Met 


Lys 


Leu 


Gin 


Val 


Thr 


Gin 


Arg 


Ser 


Leu 


Glu 


Glu 


Ser 


Gin 


Gly 








180 










185 










190 






Lys 


lie 


Ala 


Gin 


Leu 


Glu 


Gly 


Lys 


Leu 


Val 


Ser 


He 


Glu 


Lys 


Glu 


Lys 






195 










200 










205 








lie 


Asp 


Glu 


Lys 


Ser 


Glu 


Thr 


Glu 


Lys 


Leu 


Leu 


Glu 




He 


Glu 


Glu 




210 










215 










970 










lie 


Ser 


Cy s 


Ala 


Ser 


Asp 


Gin 


Val 


Glu 


Lys 


Twr 

i yj. 


Lys 


Leu 


Asp 


He 


Ala 


225 










230 










235 










240 


Gin 


Leu 


Glu 


Glu 


Asn 


Leu 


Ly s 


Glu 


Lys 


Asn 


Asp 


Glu 


He 


Leu 


Ser 


Leu 










245 










250 










255 




Lys 


Gin 


Ser 


Leu 


Glu 


Glu 


Asn 


He 


Val 


He 


Leu 


Ser 


Lys 


Gin 


Val 


Glu 








260 










265 










270 






Asp 


Leu 


Asn 


Val 


Lys 


Cys 


Gin 


Leu 


Leu 


Glu 


Lys 


Glu 


Lys 


Glu 


Asp 


His 






275 










280 










285 








Val 


Asn 


Arg 


Asn 


Arg 


Glu 


His 


Asn 


Glu 


Asn 


Leu 


Asn 


Ala 


Glu 


Met 


Gin 




290 










295 










300 










Asn 


Leu 


Lys 


Gin 


Lys 


Phe 


lie 


Leu 


Glu 


Gin 


Gin 


Glu 


Arg 


Glu 


Lys 


Leu 


305 










310 










315 










320 


Gin 


Gin 


Lys 


Glu 


Leu 


Gin 


lie 


Asp 


Ser 


Leu 


Leu 


Gin 


Gin 


Glu 


Lys 


Glu 










325 










330 










335 




Leu 


Ser 


Ser 


Ser 


Leu 


His 


Gin 


Lys 


Leu 


Cys 


Ser 


Phe 


Gin 


Glu 


Glu 


Met 








340 










345 










350 






Val 


Lys 


Glu 


Lys 


Asn 


Leu 


Phe 


Glu 


Glu 


Glu 


Leu 


Lys 


Gin 


Thr 


Leu 


Asp 






355 










360 










365 








Glu 


Leu 


Asp 


Lys 


Leu 


Gin 


Gin 


Lys 


Glu 


Glu 


Gin 


Ala 


Glu 


Arg 


Leu 


Val 




370 










375 










380 










Lvs 


Gin 


Leu 


Glu 


Glu 


Glu 


Ala 


Lys 


Ser 


Arg 


Ala 


Glu 


Glu 


Leu 


Lys 


Leu 


385 










390 










395 










400 


Leu 


Glu 


Glu 


Lys 


Leu 


Lys 


Gly 


Lys 


Glu 


Ala 


Glu 


Leu 


Glu 


Lys 


Ser 


Ser 










405 










410 










415 




Ala 


Ala 


His 


Thr 


Gin 


Ala 


Thr 


Leu 


Leu 


Leu 


Gin 


Glu 


Lys 


Tyr Asp 


Ser 








420 










425 










430 






Met 


Val 


Gin 


Ser 


Leu 


Glu 


Asp 


Val 


Thr 


Ala 


Gin 


Phe 


Glu 


Ser 


Tyr 


Lys 






435 










440 










445 








Ala 


Leu 


Thr 


Ala 


Ser 


Glu 


He 


Glu 


Asp 


Leu 


Lys 


Leu 


Glu 


Asn 


Ser 


Ser 




450 










455 










4 60 










Leu 


Gin 


Glu 


Lys 


Ala 


Ala 


Lys 


Ala 


Gly 


Lys 


Asn 


Ala 


Glu 


Asp 


Val 


Gin 
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4 65 

His Gin lie Leu 

Leu Leu Asp Leu 
500 

Glu lie Thr Val 
515 

Leu Lys Gin Gin 
530 

Gly Arg Lys Ala 
545 

lie Asn Lys Trp 

Pro Phe Gin Leu 
580 

Leu Asn Glu His 
595 

Ser Tyr Ala Lys 
610 

His Val Val Lys 
625 

Ser Lys Leu Arg 

Leu Gin Glu Glu 
660 

Ser Lys Ala Phe 
675 

Pro Leu Lys Glu 
690 

Gin Glu Ser Trp 
705 



470 

Ala Thr Glu Ser 
485 

Gin Thr Lys Ser 

Ser Phe Leu Gin 
520 

Glu Glu Asp Phe 
535 

Glu Lys Glu Asn 
550 

Arg Leu Leu Tyr 
565 

Gin Leu Asp Ala 

Gly Ala Ala Gin 
600 

Leu Leu Gly His 
615 

Leu Lys Asp Glu 
630 

Cys Gin Leu Ala 
645 

Leu Asn Lys Val 

His His Glu Ser 
680 

Gly Asn Thr Asn 
695 

Lys 



475 

Ser Asn Gin Glu 
490 

Ala Leu Lys Glu 
505 

Lys lie Thr Asp 

Arg Lys Gin Leu 
540 

Thr Thr Ala Glu 
555 

Glu Glu Leu Tyr 
570 

Phe Glu Val Glu 
585 

Glu Gin Leu Asn 

Gin Asn Leu Lys 
620 

Asn Ser Gin Leu 
635 

Lys Lys Lys Gin 
650 

Leu Gly lie Lys 
665 

Lys Glu Asn Phe 

Cys Tyr Arg Ala 
700 



480 

Tyr Val Arg Met 
4 95 

Thr Glu lie Lys 
510 

Leu Gin Asn Gin 
525 

Glu Asp Glu Glu 

Leu Thr Glu Glu 
560 

Asn Lys Thr Lys 
57 5 

Lys Gin Ala Leu 
590 

Lys He Arg Asp 
605 

Gin Lys He Lys 

Lys Ser Glu Val 
640 

Ser Glu Thr Lys 
655 

His Phe Asp Pro 
670 

Ala Leu Lys Thr 
685 

Pro Met Glu Cys 



<210> 366 

<211> 2957 

<212> DNA 

<213> Homo sapiens 

<400> 366 



gccagtcacc ttcagtttct ggagctggcc gtcaacatgt cctttcctaa ggcgcccttg 60 
«:S:2£ f gacCCttc tggttgtgca ccatctccag gtgcttatga tJttaaSc? 12 



120 
180 
240 



ttagaagtat tgaaaggacc agtatccttt cagaaatcac aaagatttaa acaacaaaaa 
attaactc^ r* aat f taa ^"Qacaaa gatactacct tgcctgcttc agctagaaaa *« 0 
c 9 =9gaatcaaa gattcgtgtt cttctacagg aacgtggtgc ccaggacagc 300 

•!° t * g ! a " C tgagttggaa aagatggaag caaggctaaa tgctgcacL 360 
3 catctctctc tgcaaataat gctacactgg aaaaacaact tattgaattg 420 
'"^ atgaactact aaaatctaag ttttctgaaa atggtaacca gaagaatttg 480 

3 gcttggagtt gatgaaactt agaaacaaaa gagaaacaaa gatgagggg? 540 
agcaagaagg catggagatg aagctgcagg tcacccaaag gagtctcgaa 600 
a g ? a C !! 3 9 ggaaaatagc ccaactggag ggaaaacttg tttcaataga gaaagaaaag 660 
tltattl aatctgaaac agaaaaactc ttggaataca tcgaagaaat tagttgtgct 720 
aaaaatatta iT.ttt CaagCtagat attgcccagt tagaagaaaa tttgaaagag 780 
lllt-ttnl 9 aaa "" aag cc "aagcag tctcttgagg agaatattgt tatattatct 840 
aagatctaaa tgtgaaatgt cagctgcttg aaaaagaaaa agaagaccat 900 
aaattt Jr a " agagaaCa caacgaaaat ctaaatgcag agatgcaaaa cttaaaacag 960 
tca^^ ^V*^**** ggaacgtgaa aagcttcaac aaaaagaatt acaaattgat 1020 
tcacttctgc aacaagagaa agaattatct tcgagtcttc atcagaagct ctgttctttt 1080 
aaa^nT t99 " aaaga 9 aa 3 aa ^tg tttgaggaag aattaaagca aacactggat 1140 
gagcttgata aattacagca aaaggaggaa caagctgaaa ggctggtcaa gcaattggaa 1200 
aatCtagagC tgaagaatta aaactcctag aagaaaagct gaaaggg^ag 1260 
gaggctgaac tggagaaaag tagtgctgct catacccagg ccaccctgct tttgcaggaa 1320 
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aagtatgaca 
gcgttaacag 
gcggccaagg 
tcaaatcaag 
acagaaatta 
ctcaagcaac 
gaaaaagaaa 
gaagaactat 
aaacaggcat 
tcatatgcta 
ttgaaagatg 
aaaaaaaaac 
cactttgatc 
ccattaaaag 
aagtaaacat 
gctgttatta 
cttaaatatg 
gcaaatacct 
ttccagctta 
tctgaaacct 
tttctagtct 
tgattctgat 
agaaataagg 
atttacaggt 
taagcctgca 
actctacatg 
ctcagcattt 
caaaaaaaaa 



gtatggtgca 
ccagtgagat 
ctgggaaaaa 
aatatgtaag 
aagaaatcac 
aggaggaaga 
atacaacagc 
ataataaaac 
tgttgaatga 
aattattggg 
aaaatagcca 
aaagtgagac 
cttcaaaggc 
aaggcaatac 
ctgagaaacc 
tatttgacat 
tgaaaggaac 
cctccctaat 
gaatgcatct 
tagt ttcaag 
gagcttcttt 
aatgctcact 
acaagcctaa 
tcttaggctc 
ttcttaacaa 
taactatttc 
gattaacatt 
aaaaaaa 



aagccttgaa 
agaagatctt 
tgcagaggat 
gatgcttcta 
agtttctttt 
ctttagaaaa 
agaattaact 
aaaacctttt 
acatggtgca 
tcatcagaat 
actcaaatcg 
aaaacttcaa 
ttttcatcat 
aaactgttac 
tgttgaagat 
gggtatttta 
attttttacc 
gctcaccttt 
catcaactta 
agtctaaacc 
agctaggcta 
tggtcctacc 
cttcatagaa 
catcctgttt 
actcttcagt 
ttcagagttt 
tgtgtaatat 



gatgttactg 
aagctggaga 
gttcagcatc 
ga tctgcaga 
cttcaaaaaa 
cagctggaag 
gaagaaatta 
cagctacaac 
gctcaggaac 
ttgaaacaaa 
gaagtatcaa 
gaggaattga 
gaaagtaaag 
cgagctccta 
tatttcattc 
taatgttgta 
aaagtgtctt 
atcacctcat 
aaagtcagta 
ccagattctt 
aaacaccttg 
tattatcctt 
acctctctat 
gtatgaaatt 
taattct tag 
gtcatatact 
gaaataaaat 



ctcaatttga 
actcatcatt 
agattttggc 
ccaagtcagc 
taactgattt 
atgaagaagg 
acaagtggcg 
tagatgcttt 
agctaaataa 
aaat caagca 
aactccgctg 
ataaagttct 
aaaat tttgc 
tggagtgtca 
gtcttgttgt 
tttaatttta 
ttgacatttt 
tctgaaccct 
tcatattatt 
cagcttgatc 
gcttgttatt 
ctacttgtcc 
ttttaatcag 
ataatctgtg 
atacactaaa 
gcttgtcatc 
tacacagtaa 



aagctataaa 
acaggaaaaa 
aactgagagc 
actaaaggaa 
gcagaaccaa 
aagaaaagct 
tctcctctat 
tgaagtagaa 
aataagagat 
tgttgtgaag 
tcagcttgct 
aggtatcaaa 
cctgaagacc 
agaatcatgg 
tattgatgtt 
actgccaatc 
attttttctt 
ttcgctggct 
atcctcctgt 
ctggaggtct 
gcctctactt 
agttcaaata 
ttgtttaata 
gat tggcctt 
aatctgagaa 
tgcatgtcta 
gtcatttaac 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2957 



<210> 367 
<211> 79 
<212> PRT 

<213> Homo sapiens 
<400> 367 

Met Ser His Lys Gin He Tyr Tyr Ser Asp Lys Tyr Asp Asp Glu Glu 

15 10 15 

Phe Glu Tyr Arg His Val Met Leu Pro Lys Asp He Ala Lys Leu Val 

20 25 30 

Pro Lys Thr His Leu Met Ser Glu Ser Glu Trp Arg Asn Leu Gly Val 

35 .;4 0 4 5 

Gin Gin Ser Gin Gly Trp Val His Tyr Met He His Glu Pro Glu Pro 

50 55 60 

His He Leu Leu Phe Arg Arg Pro Leu Pro Lys Lys Pro Lys Lys 
65 70 75 



<210> 368 
<211> 717 
<212> DNA 

<213> Homo sapiens 
<400> 368 

agagcgatca tgtcgcacaa 

gagtatcgac atgtcatgct 

atgtctgaat ctgaatggag 

atgatccatg aaccagaacc 

aagaaatgaa gctggcaagc 

ctaacatctt tctgataaca 

agatgttcaa tacactgttt 

caccaccata gcccagccag 



acaaatttac tattcggaca 
gcccaaggac atagccaagc 
gaatcttggc gttcagcaga 
tcacatcttg ctgttccggc 
tacttttcag cctcaagctt 
ttattatgtt gccttcttgt 
gaatgtgctg gtaactgctt 
atgagtgctc tgtggaccca 



aatacgacga cgaggagttt 60 
tggtccctaa aacccatctg 120 
gtcagggatg ggtccattat 180 
gcccactacc caagaaacca 240 
tacacagctg tccttacttc 300 
ttctcacttt gatatttaaa 360 
tgcttcttga gtagagccac 420 
cagcctaagc tgagtgtgac 480 
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ttattcaaga aatctgttca tgttaaaagc cttgattaaa gaggaagttt ttat 



aat 



717 



<210> 369 
<211> 191 
<212> PRT 
<213> Homo sapiens 



<400> 369 
Met Ala Thr 
1 

Arg Val Ala 

Lys Ala Leu 
35 

Thr Tyr Asp 
50 

Gly Thr Val 
65 

Arg Lys Gin 

Thr Val Lys 

Glu lie Glu 
115 

Asp Leu Pro 

130 
Pro Leu Met 
145 

Leu Gly Pro 
Asn Leu Phe 



Leu lie 
5 

Ala Lys 
20 

Asp Gly 

Ala Pro 

Asn Arg 

Lys Gin 

85 
Thr Lys 
100 

Lys Phe 

Glu Glu 

lie Leu 

Pro Ser 
165 
Ala Val 
180 



Tyr Val Asp Lys 
Asp Val 
lie Ser 



Ser Ala 

55 
Ala Thr 
70 

Pro Ser 

Ser Ser 

Phe Pro 

Arg Gin 
135 
Asp Glu 
150 

Pro Val 
Ser Phe 



Leu Lys 

25 
Gin Val 
40 

Leu Pro 



Glu Lys 

Phe Ser 

Val Pro 
105 
Phe Asn 
120 

lie Ala 

Glu Gly 

Lys Met 

Lys His 
185 



Glu He Gly 
10 

Leu Glu Ser 

Leu Thr Pro 

Lys Ala Thr 
60 

Ser Val Lys 
75 

Ala Lys Lys 
90 

Ala Ser Asp 

Leu Leu Asp 

His Leu Pro 
140 

Glu Leu Glu 
155 

Pro Ser Pro 
170 

Ser Val Asp 



Glu Pro 

Arg Pro 
30 

Arg Phe 
45 

Arg Lys 

Thr Asn 

Met Thr 

Asp Ala 
, 110 
Phe Glu 
125 

Leu Ser 



Gly Thr 
15 

Ser He 

Gly Lys 

Ala Leu 

Gly Pro 

80 
Glu Lys 
95 

Tyr Pro 
Ser Phe 
Gly Val 



Lys Leu 

Pro Trp 

Pro Gly 
190 



*Phe Gin 
160 
Glu Cys 
175 
Cys 



<210> 370 
<211> 576 
<212> DNA 
<213> Homo sapiens 



<400> 370 

atggctactc 

aaggatgtgc 

gttttaacac 

agaaaggctt 

agaaaacaaa 

aaaagttctg 

aatcttctag 

ttgagtggag 

ctgggccccc 

gtctccttca 



tgatctacgt 
tgaagctgga 
cacgttttgg 
tgggcactgt 
aacagccaag 
ttcctgcctc 
actttgagag 
tgcctctcat 
cttcacctgt 
agcattctgt 



tgataaggaa 
gtctagacct 
caaaacatac 
caacagagct 
cttttctgcc 
agatgacgcc 
ttttgacctg 
gatccttgat 
gaaaatgccc 
cgaccctgga 



attggagaac 
tcaatcaaag 
gatgctccat 
acagaaaagt 
aaaaagatga 
tatccagaaa 
cctgaagagc 
gaggagggag 
tctccaccat 
tgttga 



caggcacccg 
cattagatgg 
cagccttacc 
cagtaaagac 
ccgagaagac 
tagaaaaatt 
gccagattgc 
agcttgaaaa 
gggaatgcaa 



tgtggctgcc 
gatatctcaa 
taaagctacc 
caatggaccc 
tgttaaaaca 
ctttcccttc 
acacctcccc 
gctgtttcag 
tctgtttgca 



<210> 371 
<211> 212 
<212> PRT 
<213> Homo sapiens 

<400> 371 

Met Lys Pro Pro Ser Ser He Gin Thr Ser Glu Phe Asp Ser Ser Asp 



60 

120 

180 

240 

300 

360 

420 

480. 

540 

576 
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Arg 
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Ala 


Cys 


Leu 


Leu 


Leu 


Tyr 
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Ser 


Asp 


145 










150 










155 










160 


Thr 


He 


Ser 


Pro 


Glu 


Gin 


Ala 


He 


Asp 


Ser 


Leu 


Arg 


Asp 


Leu 


Arg 


Gly 










165 










170 










175 




Ser 


Gly 


Ala 


He 


Gin 


Thr 


lie 


Lys 


Gin 


Tyr 


Asn 


Tyr 


Leu 


His 


Glu 
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180 
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Arg 


Asp 


Lys 
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Ala 


Ala 


His 


Leu 


Ser 


Ser 


Arg 


Asp 


Ser 


Gln 


Ser 


Arg 



195 200 205 



Ser Val Ser Arg 
210 



<210> 372 
<211> 836 
<212> DNA 
<213> Homo sapiens 

<400> 372 

ccggtgagtc gccggcgctg 
gatgaagecg cccagttcaa 
tgaagatgaa cagactccaa 
tcagtttctc ggtttatgtg 
ccaaaaagat acagaagaac 
cagaggggaa ctgtcaaaat 
aattatcacc catcatcatc 
tgaaataatg gaagagctta 
ctatggagga cttgggagat 
cacaatatca ccagagcaag 
acagaccatc aagcaataca 
atcatcaaga gattcacaat 
tatgaccatg tctgaaatgt 
tgttatcaac ttgaatgtaa 

<210> 373 

<211> 1085 

<212> PRT 

<213> Homo sapiens 

<400> 373 

Met Asp Thr Pro Glu Asn Val Leu Gin Met Leu 

15 10 

Ser Tyr Lys Gly Asn Asp Pro Leu Gly Glu Trp 

20 25 

Trp Val Glu Glu Asn Phe Pro Glu Asn Lys Glu 
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cagagggagg cggcactggt 
tacaaacaag tgagtttgac 
ttcatatatc atggctatct 
ctcttccagg ttgtaaattt 
taaagagctg tggtatacaa 
atagagtccc aaaccttctg 
caategcaga tggagggact 
caacctgcct taaaaattac 
cttgtcttgt agetgettgt 
ccatagacag cctgcgagac 
attatcttca tgagtttcgg 
caagatctgt atcaagataa 
cagttctcta gcataatttg 
atgtacatgt gcagatattc 



ctcgacgtgg ggcggccagc 60 
tcatcagatg aagagectat 120 
ttgtcacgag tgaattgttc 180 
aaagatgtta gaagaaatgt 240 
gacatatttg ttttctgeae 300 
gatctctacc agcaatgtgg 360 
cctgacatag ccagctgctg 420 
cgaaaaacct taatacactg 480 
ctcctactat acctgtctga 540 
ctaagaggat ccggggcaat 600 
gacaaattag ctgcacatct 660 
aggaattcaa atagcatata 720 
tattgaaatg aaaccaccag 780 
ctaaagtttt attgac 836 



Glu Ala His Met Gin 
15 

Glu Arg Tyr He Gin 
30 

Tyr Leu He Thr Leu 



WO 2004/055050 PCT/IB2003/006434 

35 40 45 

Leu Glu His Leu Met Lys Glu Phe Leu Asp Lys Lys Lys Tyr His Asn 



60 



Asp Pro Arg Phe He Ser Tyr Cys Leu Lys Phe Ala Glu Tyr Asn Ser 

70 75 an 

Asp Leu His Gin Phe Phe Glu Phe Leu Tyr Asn His Gly lie Gly Thr 

85 90 95 

Leu Ser Ser Pro Leu Tyr lie Ala Trp Ala Gly His Leu Glu Ala Gin 

100 105 110 

Gly Glu Leu Gin His Ala Ser Ala Val Leu Gin Arg Gly lie Gin Asn 

15 120 125 

Gin Ala Glu Pro Arg Glu Phe Leu Gin Gin Gin Tyr Arg Leu Phe Gin 

Thr Arg Leu Thr Glu Thr His Leu Pro Ala Gin Ala Arg Thr Ser Glu 

t, 150 155 " 160 

Pro Leu His Asn Val Gin Val Leu Asn Gin Met He Thr Ser Lys Ser 

165 170 i7 5 

Asn Pro Gly Asn Asn Met Ala Cys He Ser Lys Asn Gin Gly Ser Glu 

180 185 19Q 

Leu Ser Gly Val lie Ser Ser Ala Cys Asp Lys Glu Ser Asn Met Glu 

195 200 205 

Arg Arg Val He Thr He Ser Lys Ser Glu Tyr Ser Val His Ser Ser 
, ?" e 2 15 220 

Leu Ala Ser Lys Val Asp Val Glu Gin Val Val Met Tyr Cys Lys Glu 

235 240 
Lys Leu He Arg Gly Glu Ser Glu Phe Ser Phe Glu Glu Leu Arg Ala 

245 250 255 

Gin Lys Tyr Asn Gin Arg Arg Lys His Glu Gin Trp Val Asn Glu Asp 

260 265 270 

Arg His Tyr Met Lys Arg Lys Glu Ala Asn Ala Phe Glu Glu Gin Leu 

275 280 285 

Leu Lys Gin Lys Met Asp Glu Leu His Lys Lys Leu His Gin Val Val 

290 295 300 

Glu Thr Ser His Glu Asp Leu Pro Ala Ser Gin Glu Arg Ser Glu Val 

Z 310 315 320 

Asn Pro Ala Arg Met Gly Pro Ser Val Gly Ser Gin Gin Glu Leu Arg 

325 330 33 5 

Ala Pro Cys Leu Pro Val Thr Tyr Gin Gin Thr Pro Val Asn Met Glu 

340 345 350 

Lys Asn Pro Arg Glu Ala Pro Pro Val Val Pro Pro Leu Ala Asn Ala 

355 360 . 365 

He Ser Ala Ala Leu Val Ser Pro Ala Thr Ser Gin Ser lie Ala Pro 

J/U 375 380 

Pro Val Pro Leu Lys Ala Gin Thr Val Thr Asp Ser Met Phe Ala Val 

*T c 390 395 400 

Ala Ser Lys Asp Ala Gly Cys Val Asn Lys Ser Thr His Glu Phe Lys 

405 410 415 

Pro Gin Ser Gly Ala Glu He Lys Glu Gly Cys Glu Thr His Lys Val 

420 425 430 

Ala Asn Thr Ser Ser Phe His Thr Thr Pro Asn Thr Ser Leu Gly Met 
435 440 445 

Val Gin Ala Thr Pro Ser Lys Val Gin Pro Ser Pro Thr Val His Thr 
450 455 46Q 

Lys Glu Ala Leu Gly Phe He Met Asn Met Phe Gin Ala Pro Thr Leu 

470 475 4 g Q 

Pro Asp lie Ser Asp Asp Lys Asp Glu Trp Gin Ser Leu Asp Gin Asn 

485 490 495 

Glu Asp Ala Phe Glu Ala Gin Phe Gin Lys Asn Val Arg Ser Ser Gly 

500 505 510 

Ala Trp Gly Val Asn Lys He lie Ser Ser Leu Ser Ser Ala Phe His 
515 520 5 25 
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1010 1015 1020 

Leu Asp Met Trp Asn Glu Phe Phe His Val Met Leu Asn H e Pro Asp ' 
I . 1030 1035 10 40 

Cys Has His Leu Pro Ser Leu Asp Leu Leu Arg Gin Lys Leu Lys Lys 

1045 1050 1055 

Val Phe Gin Gin His Tyr Thr Asn Lys He Arg Ala Leu Arg Asn Arg 

1060 1065 1070 

Leu He Val Leu Leu Leu Glu Cys Lys Arg Ser Arg Lys 
1075 1080 1085 



<210> 374 
<211> 3446 
<212> DNA 

<213> Homo sapiens 



<400> 374 
ttctagtttg 
ggaaaatgtc 
tggtgaatgg 
cttgataact 
tgacccaaga 
attttttgag 
ctgggcgggg 
aggaattcaa 
gacacgcctc 
tgttcaggtt 
catttctaag 
gtcaaatatg 
t ttggcatcc 
tggggaatca 
gcatgagcaa 
tgaagaacag 
ggagacatcc 
tatggggcca 
tcagcagaca 
tttggcaaat 
tcctgttcct 
tgctggatgt 
agaagggtgt 
atcactggga 
aaaagaagca 
tgatgacaaa 
tcaaaaaaat 
atctgctttt 
aaataaaccc 
accaaaggag 
tcgctgcaac 
acaacttgcg 
taaagaaaat 
catggtggtg 
ggccttgtcg 
tggggtactt 
cgctgccatt 
gatgagttca 
taagctgatt 
ttttgaatgg 
taagctggtc 
tacccaggga 
gcctgccaac 
tatgcagcac 



cggttcaggt 
cttcagatgc 
gaaagataca 
t tactagaac 
ttcatcagtt 
tttctgtaca 
catctggaag 
aaccaggctg 
actgaaaccc 
ttaaatcaaa 
aatcagggtt 
gaacgaagag 
aaagttgatg 
gaattttcct 
tgggtaaatg 
ctattaaaac 
catgaggatc 
agtgtaggct 
ccagtgaaca 
gctatttctg 
ttgaaagccc 
gtgaataaga 
gaaacacata 
atggttcagg 
ttaggtttca 
gatgaatggc 
gtaaggtcat 
catgtgtttg 
acaggagcca 
gaagtgcctc 
aaaaccctgg 
tctacaccat 
gtggtagcaa 
ccttcaaggg 
tctcacatgt 
acctgtgagg 
gcagaagatc 
cttgggactg 
ttcaaacttt 
caatgtaaac 
tatgtccatc 
gatctgaatg 
ccctgggaa-t 
atgtttatga 



ttgccgctgc 
ttgaagccca 
tacagtgggt 
atttaatgaa 
attgtttaaa 
accatgggat 
cccaaggaga 
aacccagaga 
atttgccagc 
tgataacatc 
cagagctttc 
tgatcacgat 
ttgagcaggt 
ttgaagaatt 
aagacagaca 
agaaaatgga 
tgcccgcttc 
cccagcagga 
tggaaaagaa 
cagctttggt 
agacagtaac 
gtactcatga 
aggttgccaa 
caacgccatc 
tcatgaatat 
aatctctaga 
ctggggcttg 
aagatggaaa 
ggacctttgg 
atgctgaaga 
cacccagtcc 
tccacaagct 
aacagtgtac 
atggaaaatt 
attcagcatc 
cagagttggg 
caccagatgc 
ttgatgctcc 
tatctgggct 
ttccagccat 
accttcttgg 
atgctaaaaa 
tctacattgg 
agttctattc 



cggccagcgt 
catgcagagc 
agaagagaat 
ggaattttta 
atttgctgag 
tggaaccctg 
gctgcagcat 
gttcctgcaa 
tcaagctaga 
aaaatcaaat 
tggagtgata 
ttctaaatca 
tgttatgtat 
gagagcccag 
ttatatgaaa 
tgaacttcat 
ccaggaaagg 
actgagagcg 
cccaagagag 
gtccccagcc 
agactccatg 
attcaagcca 
cacaagttct 
caaagtgcag 
gtttcaggct 
tcaaaatgaa 
gggagtcaat 
caaagaaaat 
agaacgctct 
gtttttggat 
taagagccca 
tccagtggag 
ccaggcgact 
cagtccaatt 
cttacttcgt 
cgttgaggct 
tattgctggg 
aaacttcatt 
ttctaaacca 
caagcccaag 
agaaggagcc 
taaacagaaa 
gacccagttg 
tgcccactta 



cctctggcca 
tacaagggca 
tttcctgaga 
gataagaaga 
tacaacagtg 
tcatcccctc 
gccagtgctg 
caacaat,aca 
acctcagaac 
ccaggaaata 
tcttcagctt 
gaatattctg 
tgcaaggaga 
aaatacaatc 
aggaaagaag 
aagaagttgc 
tccgaggtta 
ccatgtcttc 
gcacctcctg 
accagccaga 
tttgcagtgg 
cagagtggag 
tttcacacaa 
ccatcaccca 
cctacacttc 
gatgcatttg 



aagatcatct 
tatggattac 
gtcagcagac 
gactcaactg 
ggagacttca 
tcagtgcaca 
ttggattctt 
caagagaaaa 
ctgagccagc 
tgcagactca 
ctccaagcag 
gttgggaacc 
gtgagttcct 
actgaatttc 
tttgcccagg 
tttgttttaa 
atggaaagac 
ttccagaatg 



tggacacccc 
atgaccctct 
ataaagaata 
aataccacaa 
acctccatca 
tgtacattgc 
tccttcagag 
ggttatttca 
ctctgcataa 
acatggcctg 
gtgataaaga 
tgcactcatc 
agcttattcg 
aacggagaaa 
caaatgcttt 
atcaggtggt 
atccagcacg 
cagtaaccta 
ttgttcctcc 
gcattgctcc 
ccagcaaaga 
cagagatcaa 
ctccaaacac 
ccgtgcacac 
ctgatatttc 
aagcccagtt 
cttctttgtc 
cacagcctaa 
ttccttcaaa 
tatggggtat 
catctgctgc 
ttttagaaga 
gtgaggaaaa 
gcccaaaaca 
ctgctgcagg 
cagacactga 
aatggatgca 
catgggatga 
atccaaatac 
aattgggttc 
tgtacgaagc 
aggtccaaaa 
taaagccatc 
gcagtgtatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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agtaggagag 
ccctgaaaaa 
gat tgagcaa 
act tggaaac 
gat tgacctg 
aaagtgtgaa 
ccagatcgat 
gaaagtgaaa 
tttggatatg 
tccatctttg 
caagattagg 
aaaataaaat 
ctttaaacct 
atggaatatt 



ctctacagct 
gtgatgcctc 
gtgcatgact 
ggatttttgg 
ggtcagagta 
acatctggtt 
tactttgggg 
aatgaaggag 
tggaatgaat 
gatttgttaa 
gccctacgta 
ttggatatag 
gttttttttt 
tccatgtaaa 



atggaacatt 
aaggtcttgt 
gtgaaatcat 
aacaggatga 
tagatatgaa 
ttcagtgtgt 
ttgctgcaac 
gagagtgtaa 
tttttcatgt 
ggcaaaagct 
ataggctaat 
acagtcctta 
catttattgt 
aaaaaa 



attaaatgcc 
catctctttt 
tcatggagac 
tgaagatgat 
actttttcca 
tgagatgctc 
agtatattgc 
gcctgaaggt 
tatgttgaat 
gaagaaagta 
tgtactgctc 
aaaatcacac 
ttatgtaaat 



attaacctct 
gctatgagaa 
attaaaccag 
ttatctgctg 
aaaggaacta 
agcaacaaac 
atgctctttg 
ctttttagaa 
attccagatt 
tttcaacaac 
ttagaatgta 
tgtaaatatg 
gtttgttaaa 



ataaaaatac 
tgctttacat 
acaatttcat 
gcttggcact 
tattcacagc 
catggaacta 
gcacttacat 
ggcttcctca 
gtcatcatct 
actatactaa 
agcgttcacg 
aatctgctca 
aataaatccc 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3446 



<210> 375 
<211> 724 
<212> PRT 
<213> Homo sapiens 



<400> 375 



Met 


Ser 


Phe 


Pro 


Lys 


Ala 


Pro 


Leu 


Lys 


Arg 


Phe 


Asn 


Asp 


Pro 


Ser 


Gly 


1 








5 










10 










15 




Cys 


Ala 


Pro 


Ser 


Pro 


Gly 


Ala 


Tyr 


Asp Val 


Lys 


Thr 


Leu 


Glu 


Val 


Leu 








20 










25 








- 


30 






Lys 


Gly 


Pro 


Val 


Ser 


Phe 


Gin 


Lys 


Ser 


Gin 


Arg 


Phe 


Lys 


Gin 


Gin 


Lys 






35 










40 










45 








Glu 


Ser 


Lys 


Gin 


Asn 


Leu 


Asn 


Val 


Asp 


Lys 


Asp 


Thr 


Thr 


Leu 


Pro 


Ala 




50 










55 










60 










Ser 


Ala 


Arg 


Lys 


Val 


Lys 


Ser 


Ser 


Glu 


Ser 


Lys 


Glu 


Ser 


Gin 


Lys 


Asn 


65 










70 










75 










80 


Asp 


Lys 


Asp 


Leu 


Lys 


lie 


Leu 


blU 


Lys 


Glu 


lie 


Arg 


Val 


Leu 


Leu 


Gin 










85 










90 










95 




Glu 


Arg 


Gly Ala Gin 


Asp 


Ser 


Arg 


He 


Gin 


Asp 


Leu 


Glu 


Thr 


Glu 


Leu 








100 










105 










110 






Glu 


Lys 


Met 


Glu 


Ala 


Arg 


Leu 


Asn 


Ala 


Ala 


Leu 


Arg 


Glu 


Lys 


Thr 


Ser 






115 










120 










125 








Leu 


Ser 


Ala 


Asn 


Asn 


Ala 


Thr 


Leu 


Glu 


Lys 


Gin 


Leu 


He 


Glu 


Leu 


Thr 




130 










135 










140 










Arg 


Thr 


Asn 


Glu 


Leu 


Leu 


Lys 


Ser 


Lys 


Phe 


Ser 


Glu 


Asn 


Gly Asn 


Gin 


145 










150 










155 










160 


Lys 


Asn 


Leu 


Arg 


He 


Leu 


Ser 


Leu 


Glu 


Leu 


Met 


Lys 


Leu 


Arg 


Asn 


Lys 










165 










170 










175 




Arg 


Glu 


Thr 


Lys 


Met 


Arg 


Gly 


Met 


Met 


Ala 


Lys 


Gin 


Glu 


Gly 


Met 


Glu 








180 










185 










190 






Met 


Lys 


Leu 


Gin 


Val 


Thr 


Gin 


Arg 


Ser 


Leu 


Glu 


Glu 


Ser 


Gin 


Gly 


Lys 






195 










200 










205 








lie 


Ala 


Gin 


Leu 


Glu 


Gly 


Lys 


Leu 


Val 


Ser 


He 


Glu 


Lys 


Glu 


Lys 


He 




210 










215 










220 










Asp 


Glu 


Lys 


Ser 


Glu 


Thr 


Glu 


Lys 


Leu 


Leu 


Glu 


Tyr 


He 


Glu 


Glu 


He 


225 










230 










235 










240 


Ser 


Cys 


Ala 


Ser Asp 


Gin 


Val 


Glu 


Lys 


Tyr 


Lys 


Leu 


Asp 


He 


Ala 


Gin 










245 










250 










255 




Leu 


Glu 


Glu 


Asn 


Leu 


Lys 


Glu 


Lys 


Asn 


Asp 


Glu 


lie 


Leu 


Ser 


Leu 


Lys 








260 










265 










270 






Gin 


Ser 


Leu 


Glu 


Glu 


Asn 


He 


Val 


He 


Leu 


Ser 


Lys 


Gin 


Val 


Glu 


Asp 






275 










280 










285 








Leu 


Asn 


Val 


Lys 


Cys 


Gin 


Leu 


Leu 


Glu 


Lys 


Glu 


Lys 


Glu 


Asp 


His 


Val 




290 










295 










300 










Asn 


Arg 


Asn 


Arg 


Glu 


His 


Asn 


Glu 


Asn 


Leu 


Asn 


Ala 


Glu 


Met 


Gin 


Asn 



305 310 315 320 
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Leu Lys Gin Lys Phe He Leu Glu Gin Gin Glu Arg Glu Lys 

325 330 
Gin Lys Glu Leu Gin He Asp Ser Leu Leu Gin Gin Glu Lys 
340 345 350 

Ser Ser Ser Leu His Gin Lys Leu Cys Ser Phe Gin Glu Glu 

355 360 365 

Lys Glu Lys Asn Leu Phe Glu Glu Glu Leu Lys Gin Thr Leu 

370 375 380 

Leu Asp Lys Leu Gin Gin Lys Glu Glu Gin Ala Glu Arg Leu 
385 390 395 

Gin Leu Glu Glu Glu Ala Lys Ser Arg Ala Glu Glu Leu Lys 

405 410 
Glu Glu Lys Leu Lys Gly Lys Glu Ala Glu Leu Glu Lys Ser 
420 425 430 

Ala His Thr Gin Ala Thr Leu Leu Leu Gin Glu Lys Tyr Asp 

435 440 4 45 

Val Gin Ser Leu Glu Asp Val Thr Ala Gin Phe Glu Ser Tyr 

450 455 460 

Leu Thr Ala Ser Glu He Glu Asp Leu Lys Leu Glu Asn Ser 
465 470 475 

Gin Glu Lys Ala Ala Lys Ala Gly Lys Asn Ala Glu Asp Val 

485 4 9o 
Gin He Leu Ala Thr Glu Ser Ser Asn Gin Glu Tyr Val Arg 
500 505 ' 510 

Leu Asp Leu Gin Thr Lys Ser Ala Leu Lys Glu Thr Glu He 

515 520 525 

He Thr Val Ser Phe Leu Gin Lys He Thr Asp Leu Gin Asn 

530 535 540 

Lys Gin Gin Glu Glu Asp Phe Arg Lys Gin Leu Glu Asp Glu 
545 550 555 

Arg Lys Ala Glu Lys Glu Asn Thr Thr Ala Glu Leu Thr Glu 

565 570 
Asn Lys Trp Arg Leu Leu Tyr Glu Glu Leu Tyr Asn Lys Thr 
580 585 " 590 

Phe Gin Leu Gin Leu Asp Ala Phe Glu Val Glu Lys Gin Ala 

595 6 00 6 o5 

Asn Glu His Gly Ala Ala Gin Glu Gin Leu Asn Lys He Arq 

610 615 620 

Tyr Ala Lys Leu Leu Gly His Gin Asn Leu Lys Gin Lys He 
625 630 635 

Val Val Lys Leu Lys Asp Glu Asn Ser Gin Leu Lys Ser Glu 

64 5 650 
Lys Leu Arg Cys Gin Leu Ala Lys Lys Lys Gin Ser Glu Thr 
660 665 670 

Gin Glu Glu Leu Asn Lys Val Leu Gly He Lys His Phe Asp 

675 680 685 

Lys. Ala Phe His His Glu Ser Lys Glu Asn Phe Ala Leu Lys 

690 695 700 

Leu Lys Glu Gly Asn Thr Asn Cys Tyr Arg Ala Pro Met Glu 
705 710 715 

Glu Ser Trp Lys 



Leu Gin 
335 

Glu Leu 

Met Val 

Asp Glu 

Val Lys 
400 
Leu Leu 
415 

Ser Ala 

Ser Met 

Lys Ala 

Ser Leu 
480 
Gin His 
495 

Met Leu 

Lys Glu 

Gin Leu 

Glu Gly 
560 
Glu He 
575 

Lys Pro 

Leu Leu 

Asp Ser 

Lys His 
640 
Val Ser 
655 

Lys Leu 

Pro Ser 

Thr Pro 

Cys Gin 
720 



<210> 376 
<211> 3002 
<212> DNA 

<213> Homo sapiens 
<400> 376 

gccagtcacc ttcagtttct ggagctggcc gtcaacatgt cctttcctaa ggcgcccttg 60 
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aaacgattca atgacccttc tggttgtgca ccatctccag gtgcttatga tgttaaaact 120 
ttagaagtat tgaaaggacc agtatccttt cagaaatcac aaagatttaa acaacaaaaa 180 
gaatctaaac aaaatcttaa tgttgacaaa gatactacct tgcctgcttc agctagaaaa 240 
gttaagtctt cggaatcaaa ggaatctcaa aagaatgata aagatttgaa gatattagag 300 
aaagagattc gtgttcttct acaggaacgt ggtgcccagg acagccggat ccaggatctg 360 
gaaactgagt tggaaaagat ggaagcaagg ctaaatgctg cactaaggga aaaaacatct 420 
ctctctgcaa ataatgctac actggaaaaa caacttattg aattgaccag gactaatgaa 480 
ctactaaaat ctaagttttc tgaaaatggt aaccagaaga atttgagaat tctaagcttg 540 
gagttgatga aacttagaaa caaaagagaa acaaagatga ggggtatgat ggctaagcaa 600 
gaaggcatgg agatgaagct gcaggtcacc caaaggagtc tcgaagagtc tcaagggaaa 660 
atagcccaac tggagggaaa acttgtttca atagagaaag aaaagattga tgaaaaatct 720 
gaaacagaaa aactcttgga atacatcgaa gaaattagtt gtgcttcaga tcaagtggaa 780 
aaatacaagc tagatattgc ccagttagaa gaaaatttga aagagaagaa tgatgaaatt 840 
ttaagcctta agcagtctct tgaggagaat attgttatat tatctaaaca agtagaagat 900 
ctaaatgtga aatgtcagct gcttgaaaaa gaaaaagaag accatgtcaa caggaataga 960 
gaacacaacg aaaatctaaa tgcagagatg caaaacttaa aacagaagtt tattcttgaa 1020 
caacaggaac gtgaaaagct tcaacaaaaa gaattacaaa ttgattcact tctgcaacaa 1080 
gagaaagaat tatcttcgag tcttcatcag aagctctgtt cttttcaaga ggaaatggtt 1140 
aaagagaaga atctgtttga ggaagaatta aagcaaacac tggatgagct tgataaatta 1200 
cagcaaaagg aggaacaagc tgaaaggctg gtcaagcaat tggaagagga agcaaaatct 1260 
agagctgaag aattaaaact cctagaagaa aagctgaaag ggaaggaggc tgaactggag 1320 
aaaagtagtg ctgctcatac ccaggccacc ctgcttttgc aggaaaagta tgacagtatg 1380 
gtgcaaagcc ttgaagatgt tactgctcaa tttgaaagct ataaagcgtt aacagccagt 1440 
gagatagaag atcttaagct ggagaactca tcattacagg aaaaagcggc caaggctggg 1500 
aaaaatgcag aggatgttca gcatcagatt ttggcaactg agagctcaaa tcaagaatat 1560 
gtaaggatgc ttctagatct gcagaccaag tcagcactaa aggaaacaga aattaaagaa 1620 
atcacagttt cttttcttca aaaaataact gatttgcaga accaactcaa gcaacaggag 1680 
gaagacttta gaaaacagct ggaagatgaa gaaggaagaa aagctgaaaa agaaaataca 1740 
acagcagaat taactgaaga aattaacaag tggcgtctcc tctatgaaga actatataat 1800 
aaaacaaaac cttttcagct acaactagat gcttttgaag tagaaaaaca ggcattgttg 1860 
aatgaacatg gtgcagctca ggaacagcta aataaaataa gagattcata tgctaaatta 1920 
ttgggtcatc agaatttgaa acaaaaaatc aagcatgttg tgaagttgaa agatgaaaat 1980 
agccaactca aatcggaagt atcaaaactc cgctgtcagc ttgctaaaaa aaaacaaagt 2040 
gagacaaaac ttcaagagga attgaataaa gttctaggta tcaaacactt tgatccttca 2100 
aaggcttttc atcatgaaag taaagaaaat tttgccctga agaccccatt aaaagaaggc 2160 
aatacaaact gttaccgagc tcctatggag tgtcaagaat catggaagta aacatctgag 2220 
aaacctgttg aagattattt cattcgtctt gttgttattg atgttgctgt tattatattt 2280 
gacatgggta ttttataatg ttgtatttaa ttttaactgc caatccttaa atatgtgaaa 2340 
ggaacatttt ttaccaaagt gtcttttgac attttatttt ttcttgcaaa tacctcctcc 2400 
ctaatgctca cctttatcac ctcattctga accctttcgc tggctttcca gcttagaatg 2460 
catctcatca acttaaaagt cagtatcata ttattatcct cctgttc.tga aaccttagtt 2520 
tcaagagtct aaaccccaga ttcttcagct tgatcctgga ggtcttttct agtctgagct 2580 
tctttagcta ggctaaaaca ccttggcttg ttattgcctc tactttgatt ctgataatgc 2640 
tcacttggtc ctacctatta tccttctact tgtccagttc aaataagaaa taaggacaag 2700 
cctaacttca tagaaacctc tctattttta atcagttgtt taataattta caggttctta 2760 
ggctccatcc tgtttgtatg aaattataat ctgtggattg gcctttaagc ctgcattctt 2820 
aacaaactct tcagttaatt cttagataca ctaaaaatct gagaaactct acatgtaact 2880 
atttcttcag agtttgtcat atactgcttg tcatctgcat gtctactcag catttgatta 2940 
acatttgtgt aatatgaaat aaaattacac agtaagtcat ttaaccaaaa aaaaaaaaaa 3000 
aa 3002 

<210> 377 
<211> 246 
<212> PRT 

<213> Homo sapiens 
<400> 377 

Met Asn Asp Trp Met Pro lie Ala Lys Glu Tyr Asp Pro Leu Lys Ala 

1 5 10 15 

Gly Ser lie Asp Gly Thr Asp Glu Asp Pro His Asp Arg Ala Val Trp 
20 25 ' 30 
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*rg «. M |t x. eg „. Atg ryt pro ssn Lys ^ ^ 

Pro L.„ Lgu Tnr leu phg ^ Ala Atg im flsn «_ ^ ^ 

u. „. „. v.l P„. s« A rg Tyr Gly L°p Ilg arg arg Lgu 
^ Le u „, arg fif Leu val Thr „ y £ g ^ t ^ ^ „ 

II. «- .». Jj. „„ Glu arg flla „. ^ ^ ^ ^ 

-p si, ^ v,, „. isp Gln „„ IU phe vai ftsp no g ^ ^ 

jrj T»r leu Ly » Gly „. p „ Arg flrg »» My ^ 

«y =1, l, s lys Glu Sg; Gly G1 „ Leu itg phe 1J0 ^ ^ ^ ^ 
"o flr9 Lys ,„ Ile as „ L . u pro J» fljn Aap ^ £0 
AT, Olu Gly Arg G1 „ flrg flrg ^ XW ^ 

Hi- Tr P J, s„ atg Thr arg arg asp Hls asp arg 1* ^ 
««. £, Trp G1 „ Glu arg „„ p „ ayg yai yrp .OS ^ ^ ^ 

«g *, «. at, j.p Phe arg asp asp arg ™ lys Gly arg giu 

Lys Lys Glu Arg Gly Lys 235 24 0 



245 



<210> 378 

<211> 1509 

<212> DNA 

<213> Homo sapiens 

<400> 378 



tSS ScSg a a a g q g q C a T g9C ^^-^^ 60 
gagtcgccca ggctggagag cggtggcgcg aagCtttgca gagagacagg 120 

ggtcaagcga tcctccagcc tcggcctctc L ?! ° ^tgcagcgt cgacatttcg 180 
gcgtgcctgg ccaaaaa^tt tc?aaat?tg ! ? r J a "»«TO<= Qtgagccagc 240 
atagacacca tatgatctaa tcttcacaaf tttttt t tataatt 9 ta atgcattttt 300 
cacaataaga atcctgaagc tc^SS" ctgaccfcaa* Scf " * gCtaCattt 3 ™ 
ggctgaggca ggcgggtcat ctgacatcag gagtSaoa 39 cac "tggga 420 

aaatcctgtc tgtactaaaa atacaaaaJ III I 9 ccagcttggc caacatggtg 480 
cccagctact cggcctccca aa'tg^ggg ^ Jggtggtgtg tgtctgtaa? 540 

cagccaaggt ttttaagtaa catatttcaa f tgagccaccg cgtctggcct 600 

cgattggatg cccatcgcca agglgtatga tcca^caaa -^catgaa 660 

cgatgaagac ccacacgacc gcgcgqtcta ataanrtt gCg 9gcagca ttgatggcac 720 
caaaggtgtc ataggagatc cccSctcac cc ot ! 9 Ct ^«cgat atgtccccaa 780 
caaggaggac aaattaaagg aagtcttttc ccqcta^oa? 9CCagaCtaa ^cttgcagac 840 
ggtcagggac ttggtcacag gt?tttcaaa gggctacg?c S ^ ^ Ct ^ct 900 
gcgtgccgtg atcaaagctt accgagatqc taatoa^ " CatCgaat a^aaggagga 960 
atttgtggac tacgagctgg aaaggacS Q 9 S"attgacc agcatgagat 1020 

cggtcttggg ggaaaaaagg agtctgggca actaaL^? atCcctc 99<= gacttggagg 1080 
tcgaaaacct attaacttgc cagttg??aa a " ^ggggacggg accggccttt 1140 

aaggcgggag cgatctcgat ccjgjgaaag ^"ct'gggac 'f"^ ^ aa -cggga 1200 
ccatgacagg ggccgggaga agagatggca agaaagaaaa llZl"™** gggatcga 9 a 1260 
caatgactgg gagagagaga gggact?cag aga^fcaao ^ gaCCaggg tgtggcccga 1320 
ggaaagaggc aagtagaggc ccaacagcag aaccccaaaa tlt**??^* gggagaagaa "80 
gtggaggggg attgtctttc aacgcagcgt gagtc^Sat 9 ^f 39 " 3 " gtg 9 aa atga 1440 

y yt gagtctaatg gttgaataaa acttactgat 1500 
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gatcaaaaa 



1509 



<210> 379 

<211> 246 

<212> PRT 

<213> Homo sapiens 

<400> 379 

Met Asn Asp Trp Met 

1 5 
Gly Ser He Asp Gly 
20 

Arg Ala Met Leu Ala 
35 

Pro Leu Leu Thr Leu 
50 

Asp Lys Leu Lys Glu 
65 

Arg Leu Val Arg Asp 
85 

He Glu Tyr Lys Glu 
100 

Asp Gly Leu Val He 
115 

Glu Arg Thr Leu Lys 
130 

Gly Gly Lys Lys Glu 
145 

Pro Phe Arg Lys Pro 
165 

Arg Glu Gly Lys Arg 
180 

His Trp Asp Ser Arg 
195 

Lys Arg Trp Gin Glu 
210 

Trp Glu Arg Glu Arg 
225 

Lys Lys Glu Arg Gly 
245 



Pro He Ala Lys 
Thr Asp 
Arg Tyr 



Phe Val 

55 
Val Phe 
70 

Leu Val 

Glu Arg 

Asp Gin 

Gly Trp 
135 
Ser Gly 
150 

He Asn 

Glu Arg' 

Thr Arg 

Arg Glu 
215 
Asp Phe 
230 
Lys 



Glu Asp 

25 
Val Pro 
40 

Ala Arg 



Ser Arg 

Thr Gly 

Ala Val 
105 
His Glu 
120 

lie Pro 

Gin Leu 

Leu Pro 

Arg Glu 
185 
Asp Arg 
200 

Pro Thr 
Arg Asp 



Glu Tyr 
10 

Pro His 

Asn Lys 

Leu Asn 

Tyr Gly 

75 
Phe Ser 
90 

lie Lys 

lie Phe 

Arg Arg 

Arg Phe 
155 
Val Val 
170 

Arg Ser 

Asp His 

Arg Val 

Asp Arg 
235 



Asp Pro Leu 

Asp Arg Ala 
30 

Gly Val lie 
45 

Leu Gin Thr 
60 

Asp lie Arg 

Lys Gly Tyr 

Ala Tyr Arg 
110 

Val Asp Tyr 

125 
Leu Gly Gly 
140 

Gly Gly Arg 

Lys Asn Asp 

Arg Ser Arg 
190 

Asp Arg Gly 

205 
Trp Pro Asp 
220 

He Lys Gly 



Lys 

15 

Val 

Gly 

Lys 

Arg 

Ala 

95 
Asp 

Glu 

Gly 

Asp 

Leu 
175 
Glu 

Arg 

Asn 

Arg 



Ala 

Trp 

Asp 

Glu 

Leu 

80 

Phe 

Ala 

Leu 

Leu 

Arg 
160 
Tyr 

Arg 

Glu 

Asp 

Glu 
240 



<210> 380 
<211> 967 
<212> DNA 
<213> Homo sapiens 



<400> 380 

ggcgaatctg 

cgcctgcagc 

acgattggat 

ccgatgaaga 

acaaaggtgt 

ccaaggagga 

tggtcaggga 

agcgtgccgt 

tatttgtgga 

gcggtcttgg 

ttcgaaaacc 

aaaggcggga 

accatgacag 



ctgctaccaa 
tgctcgcctg 
gcccatcgcc 
cccacacgac 
cataggagat 
caaattaaag 
cttggtcaca 
gatcaaagct 
ctacgagctg 
gggaaaaaag 
tattaacttg 
gcgatctcga 
gggccgggag 



tgtaaaggtc 
tctccgtcgg 
aaggagtatg 
cgcgcggtct 
cccctcctca 
gaagtctttt 
ggtttttcaa 
taccgagatg 
gaaaggactc 
gagtctgggc 
ccagttgtta 
tcccgagaaa 
aagagatggc 



aggccgaggc 
aagggagccc 
atccactcaa 
ggagggcaat 
ccctgtttgt 
cccgctatgg 
agggctacgc 
ctgatggcct 
tcaaagggtg 
aactgagatt 
aaaacgacct 
gacactggga 
aagaaagaga 



cggcgcggag 
aagctttgca 
agcgggcagc 
gctggcacga 
ggccagacta 
tgacatccgg 
cttcatcgaa 
ggttattgac 
gatccctcgg 
tgggggacgg 
ctatagagag 
ctcgaggaca 
gccgaccagg 



aatctgctgt 
gagaacatga 
attgatggca 
tatgtcccca 
aacttgcaga 
cggcttcggc 
tacaaggagg 
cagcatgaga 
cgacttggag 
gaccggcctt 
ggaaaacggg 
agggatcgag 
gtgtggcccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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acaatgactg ggagagagag agggacttca gagatgacag gatcaagqqg aaomfiaana a . n 
aggaaagagg caagtagagg cccaacagca gaaccccaaa gtgaag??a2 aat aaa * ta t«S 
agtggagggg gattgtcttt caacgcagcg tgagtctaat U^Lltt 



967 



<210> 381 
<211> 226 
<212> PRT 
<213> Homo sapiens 



<400> 381 

Met Ser Glu Thr Ala Pro Ala Glu Thr Ala Thr Pro Ala Pro Val Glu 

Lys Ser Pro Ala Lys Lys Lys Ala Thr Lys Lys Ala Ala Gly Ala Gly 

Ala Ala Lys Arg Lys Ala Thr Gly Pro Pro Val Ser Glu Leu lie Thr 

Lys Ala Val Ala Ala Ser Lys G L Arg Asn Gly Leu Ser Leu Ala Ala 

55 60 
Leu Lys Lys Ala Leu Ala Ala Gly Gly Tyr Asp Val Glu Lys Asn Asn 

Ser Arg He Lys Leu Gly Leu Lys Ser Leu Val Ser Lys Gly Thr Leu 
- - - 85 90 95 



Val Gin Thr Lys Gly Thr Gly Ala Ser Gly Ser Phe Lys Leu Asn Lys 

105 no y 

Lys Ala Ala Ser Gly Glu Ala Lys Pro Lys Ala Lys Lys Ala Gly Ala 

Ala Lys Ala Lys Lys Pro Ala Gly Ala Thr Pro Lys Lys Ala Lys Lys 

Ala Ala Gly Ala Lys Lys Ala Val Lys Lys Thr Pro Lys Lys Ala Lys 



Lys Pro Ala Ala Ala Gly Val Lys Lys Val Alt Lys Ser Pro Lys ^ys 



160 



165 



170 



Ala Lys Ala Ala Ala Lys Pro Lys Lys Ala Thr Lys Ser Pro Ala Lys 
180 185 y 



175 

Pro Lys Ala Val Lys Pro Lys Ala Ala Lys Pro Lys Ala All Lys Pro 

200 205 
Lys Ala Ala Lys Pro Lys Ala Ala Lys Ala Lys Lys Ala Ala Ala Lys 

Lys Lys 220 
225 



<210> 382 
<211> 790 
<212> DNA 
<213> Homo sapiens 



<400> 382 



aacctgctct ttagatttcg agcttattct cttctagcag tttcttocca ceai-ai-™, an 

ggca^taM C « CCC «* C gccggtg'gag 9 aaStccccg" SiSSS 0 

ggcaactaag aaggctgccg gcgccggcgc tgctaagcgc aaagcgacqg aocccrrL i«n 

ctcagagctg atcaccaagg ctgtggotgc ttctaaggag cgcaa^c ??^«arc 24 0 

111*1 TtlllttTcl SSSSc" CtaCgaC ^ s'gaaga^ ^cc^S llo 
tgc?tc?ggc tccrttal^ ^gtgagcaa gggcaccctg gtgcagacca agggcactgg 360 

si sii If =s =e ~ ~ 

-=s b=i i-ES SS» S5 355 1 
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caaaaagaag taggaagctg gcgtgtgaaa accgcaacaa agccccaaag gctcttttca 780 
gagccaccca 790 

<210> 383 
<211> 202 
<212> PRT 

<213> Homo sapiens 



<400> 383 



Met 


Ala 


Thr 


Leu 


He 


Tyr 


Val 


Asp Lys 


Glu 


Asn 


Glu 


Glu 


Pro 


Gly 


He 


± 








c. 
D 










i n 










ID 




Leu 


vdl 


t\±. a 


I III. 

o n 


Lys 


Asp 


o j. y 


Leu 


Lys 
25 


Leu 


m w 

ciy 






fro 

JU 


Ser 


ire 


Lys 


Ala 


Leu 

DD 


Asp 


Gly 


Arg 


Ser 


o Jin 
40 


Vdl 


Ser 


He 


Ser 


Cys 

H D 


Phe 


Gly 


Lys 


Thr 


Phe 

DU 


Asp 


Ala 


Pro 


Thr 


Ser 


Leu 


Pro 


Lys 


Ala 


Thr 
oU 


Arg 


Lys 


Ala 


Leu 


Gly 


Thr 


Val 


Asn 


Arg 


Ala 


Thr 


Glu 


Lys 


Ser 


Val 


Lys 


Thr 


Asn 


Gly 


Pro 


c. c 

Dj 










70 










7 5 










80 


Leu 


Lys 


Gin 


Lys 


Gin 

85 


Pro 


Ser 


Phe 


Ser 


Ala 
90 


Lys 


Lys 


Met 


Thr 


Glu 
95 


Lys 


Thr 


Val 


Lys 


Ala 
100 


Lys 


Asn 


Ser 


Val 


Pro 
105 


Ala 


Ser 


Asp 


Asp 


Gly 
110 


Tyr 


Pro 


Glu 


He 


Glu 
115 


Lys 


Leu 


Phe 


Pro 


Phe 
120 


Asn 


Pro 


Leu 


Gly 


Phe 
125 


Glu 


Ser 


Phe 


Asp 


Leu 
130 


Pro 


Glu 


Glu 


His 


Gin 
135 


He 


Ala 


His 


Leu 


Pro 
140 


Leu 


Ser 


Glu 


Val 


Pro 


Leu 


Met 


He 


Leu 


Asp 


Glu 


Glu 


Arg 


Glu 


Leu 


Glu 


Lys 


Leu 


Phe 


Gin 


145 










150 










155 










160 


Leu 


Gly 


Pro 


Pro 


Ser 
165 


Pro 


Leu 


Lys 


Met 


Pro 
170 


Ser 


Pro 


Pro 


Trp 


Lys 
175 


Ser 


Asn 


Leu 


Leu 


Gin 
180 


Ser 


Pro 


Leu 


Ser 


He 
185 


Leu 


Leu 


Thr 


Leu 


Asp 
190 


Val 


Glu 


Leu 


Pro 


Pro 
195 


Val 


Cys 


Cys 


Asp 


He 
200 


Asp 


He 















<210> 384 
<211> 609 
<212> DNA 
<213> Homo sapiens 



<400> 384 

atggctactc 

aaggatgggc 

gtttcaatat 

agaaaggctt 

ctcaaacaaa 

aaaaactctg 

aatcctctag 

ttgagtgaag 

ctgggccccc 

tctcctttaa 

gatatttaa 



tgatctatgt 
tgaagetggg 
catgttttgg 
tgggaactgt 
aacagccaag 
ttcctgcctc 
gcttcgagag 
tgcctctcat 
cttcaccttt 
gcattctgtt 



tgataaggaa 
gtctggacct 
caaaacattc 
caacagagct 
cttttctgcc 
agatgatggc 
ttttgacctg 
gatacttgat 
gaagatgccc 
gaccctggat 



aacgaagaac 
tcaatcaaag 
gatgctccca 
acagaaaagt 
aaaaagatga 
tatccagaaa 
cctgaagagc 
gaggagagag 
tctccaccat 
gttgaattgc 



caggcatcct 
ccttagatgg 
catccttacc 
cagtaaagac 
ctgagaagac 
tagaaaaatt 
accagattgc 
agcttgaaaa 
ggaaatccaa 
cacctgtttg 



tgtggctaca 
gagatctcaa 
taaagctacc 
caatggaccc 
tgttaaagca 
atttcccttc 
acatctcccc 
gctgtttcag 
tctgttgcag 
ctgtgacata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

609 



<210> 385 
<211> 322 
<212> PRT 

<213> Homo sapiens 



<400> 385 
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Met Glu Gly 
1 

Lys Lys Ser 

Pro Phe Met 
35 

Met Lys Arg 
50 

Lys lie Asn 
65 

Arg Leu Met 

He Val Gly 

Leu Ala Met 
115 

Glu Arg Tyr 

130 
Lys Val Ala 
145 

Lys Lys Leu 

Gly Asp Phe 

Leu Asp Glu 
195 

Thr Glu Pro 

210 
Thr Asp Lys 
225 

Met Thr Leu 
Glu Asp Lys 



Ala Ala Leu 
275 

Ser Tyr Gin 

290 
Pro Lys Asp 
305 

Asp Val 



He Ser 
5 

Val Leu 
20 

Gin Lys 

Ser Pro 

Pro He 

Asp Glu 
85 

Tyr Arg 
100 

Glu Tyr 

Lys Ala 

Leu Asn 

Leu His 
165 
Glu Thr 
180 

Asn Met 

Trp Lys 

Ala Asp 

Ser He 
245 
Thr Phe 
260 

Gly Thr 



Lys Val 
Arg Pro 



Asn Phe Lys 

Cys Ser Thr 

Leu Gly Phe 
40 

Arg Gly Leu 
55 

Cys Asn Asp 
70 

Ala Lys He 

Ala Phe Thr 

Gly Gly Glu 
120 

Ser Gin Asp 

135 
Met Ala Arg 
150 

Gly Asp He 

He Lys He 

Thr Val Thr 
200 

Pro Lys Glu 

215 

He Phe Ala 
230 

Pro His He 

Asp Glu Ser 

Arg Pro Pro 
280 

He Glu Leu 

295 
Ser Ala Ala 
310 



Thr Pro 
10 

Pro Thr 

25 

Gly Thr 

Ser His 

His Tyr 

Leu Lys 
90 

Glu Ala 
105 

Lys Ser 

Pro Phe 

Gly Leu 

Lys Ser 
170 
Cys Asp 
185 

Asp Pro 

Ala Val 

Phe Gly 

Asn Leu 
250 
Asp Phe 
265 

He Asn 



Ser Lys Leu Ser 
He Asn 
Gly Val 



Ser Pro 
60 

Arg Ser 
75 

Ser Leu 

Asn Asp 

Leu Asn 

Pro Ala 
140 
Lys Tyr 
155 

Ser Asn 

Val Gly 

Glu Ala 

Glu Glu 
220 
Leu Thr 
235 

Ser Asn 



He Pro 

30 
Asn Val 
45 

Trp Ala 



Phe Ser 
His He 



Asp Asp 

Met Glu 

Val Cys 
300 
Val Glu 
315 



Val Tyr 

His His 

Gly Ser 
110 
Asp Leu 
125 

Ala He 

Leu His 

Val Val 

Val Ser 
190 
Cys, Tyr 
205 

Asn Gly 

Leu Trp 

Asp Asp 

Glu Ala 
270 
Glu Leu 
285 

Thr Asn 
Ala Leu 



Glu Lys 
15 

Ala Ser 

Tyr Leu 

Val Lys 

Gin Lys 

80 
Pro Asn 
95 

Leu Cys 

He Glu 

He Leu 

Gin Glu 
160 
He Lys 
175 

Leu Pro 

He Gly 

Val He 

Glu Met 
240* 
Asp Asp 
255 

Tyr Tyr 

Asp Glu 

Glu Asp 

Glu Thr 
320 



<210> 386 
<211> 1899 
<212> DNA 

<213> Homo sapiens 



<400> 386 

agcgcgcgac 

gtgttggtgc 

gaggtacttg 

ggactgagag 

tcagaaaaaa 

tttatgcaga 

agaggtttgt 

tatcgaagtg 

catccaaaca 

gctatggaat 

agccaagatc 



tttttgaaag 
tagaggcagc 
gccacgactt 
tggctttcac 
agaaatctgt 
agcttggctt 
ctcattctcc 
tgtatcaaaa 
ttgttggtta 
atggaggtga 
cttttccagc 



ccaggagggt 
tgcagggtct 
attttcacct 
aatggaaggg 
attatgttca 
tggtactggg 
ttgggctgta 
gagactaatg 
tcgtgctttt 
aaagtctcta 
agccataatt 



tcgaattgca 
cgctgggggc 
ccgacctttc 
atcagtaatt 
actccaacta 
gtaaatgtgt 
aaaaagatta 
gatgaagcta 
actgaagcca 
aatgacttaa 
ttaaaagttg 



acggcagctg 
cgctcgggac 
cttccaggcg 
tcaagacacc 
taaatatccc 
acctaatgaa 
atcctatatg 
agattttgaa 
atgatggcag 
tagaagaacg 
ctttgaatat 



ccgggcgtat 
caattttgaa 
gtgagactct 
aagcaaatta 
ggcctctccg 
aagatctcca 
taatgatcat 
aagccttcat 
tctgtgtctt 
atataaagcc 
ggcaagaggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



-149- 



BNSDOCID: <WO 200405505QA2J_> 



WO 2004/055050 PCT/IB2003/006434 

ttaaagtatc tgcaccaaga aaagaaactg cttcatggag acataaagtc ttcaaatgtt 720 

gtaattaaag gcgattttga aacaattaaa atctgtgatg taggagtctc tctaccactg 780 

gatgaaaata tgactgtgac tgaccctgag gcttgttaca ttggcacaga gccatggaaa 840 

cccaaagaag ctgtggagga gaatggtgtt attactgaca aggcagacat atttgccttt 900 

ggccttactt tgtgggaaat gatgacttta tcgattccac acattaatct ttcaaatgat 960 

gatgatgatg aagataaaac ttttgatgaa agtgattttg atgatgaagc atactatgca 1020 

gcgttgggaa ctaggccacc tattaatatg gaagaactgg atgaatcata ccagaaagta 1080 

attgaactct tctctgtatg cactaatgaa gaccctaaag atcgtccttc tgctgcacac 1140 

attgttgaag ctctggaaac agatgtctag tgatcatctc agctgaagtg tggcttgcgt 1200 

aaataactgt ttattccaaa atatttacat agttactatc agtagttatt agactctaaa 1260 

attggcatat ttgaggacca tagtttcttg ttaacatatg gataactatt tctaatatga 1320 

aatatgctta tattggctat aagcacttgg aattgtactg ggttttctgt aaagttttag 1380 

aaactagcta cataagtact ttgatactgc tcatgctgac ttaaaacact agcagtaaaa 1440 

cgctgtaaac tgtaacatta aattgaatga ccattacttt tattaatgat ctttcttaaa 1500 

tattctatat tttaatggat ctactgacat tagcactttg tacagtacaa aataaagtct 1560 

acatttgttt aaaacactga accttttgct gatgtgttta tcaaatgata actggaagct 1620 

gaggagaata tgcctcaaaa agagtagctc cttggatact tcagactctg gttacagatt 1680 

gtcttgatct cttggatctc ctcagatctt tggtttttgc tttaatttat taaatgtatt 1740 

ttccatactg agtttaaaat ttattaattt gtaccttaag catttcccag ctgtgtaaaa 1800 

acaataaaac tcaaatagga tgataaagaa taaaggacac tttgggtacc agaaaaaaaa 1860 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 1899 

<210> 387 
<211> 202 
<212> PRT 

<213> Homo sapiens 



<400> 387 



Met 


Ala 


Thr 


Leu 


He 


Tyr 


Val 


Asp 


Lys 


Glu 


Asn 


Gly 


Glu 


Pro 


Gly 


Thr 


1 








5 










10 










15 




Arg 


Val 


Val 


Ala 


Lys 


Asp Gly 


Leu 


Lys 


Leu 


Gly 


Ser 


Gly 


Pro 


Ser 


He 








20 










25 










30 






Lys 


Ala 


Leu 
35 


Asp 


Gly 


Arg 


Ser 


Gin 
40 


Val 


Ser 


Thr 


Pro 


Arg 
45 


Phe 


Gly 


Lys 


Thr 


Phe 
50 


Asp 


Ala 


Pro 


Pro 


Ala 
55 


Leu 


Pro 


Lys 


Ala 


Thr 
60 


Arg 


Lys 


Ala 


Leu 


Gly 


Thr 


Val 


Asn 


Arg 


Ala 


Thr 


Glu 


Lys 


Ser 


Val 


Lys 


Thr 


Lys 


Gly 


Pro 


65 










70 










75 










80 


Leu 


Lys 


Gin 


Lys 


Gin 
85 


Pro 


Ser 


Phe 


Ser 


Ala 
90 


Lys 


Lys 


Met 


Thr 


Glu 
95 


Lys 


Thr 


Val 


Lys 


Ala 
100 


Lys 


Ser 


Ser 


Val 


Pro 
105 


Ala 


Ser 


Asp 


Asp 


Ala 
110 


Tyr 


Pro 


Glu 


He 


Glu 
115 


Lys 


Phe 


Phe 


Pro 


Phe 
120 


Asn 


Pro 


Leu 


Asp 


Phe 
125 


Glu 


Ser 


Phe 


Asp 


Leu 
130 


Pro 


Glu 


Glu 


His 


Gin 
135 


He 


Ala 


His 


Leu 


Pro 
140 


Leu 


Ser 


Gly 


Val 


Pro 


Leu 


Met 


He 


Leu 


Asp 


Glu 


Glu 


Arg 


Glu 


Leu 


Glu 


Lys 


Leu 


Phe 


Gin 


145 










150 










155 










160 


Leu 


Gly 


Pro 


Pro 


Ser 
165 


Pro 


Val 


Lys 


Met 


Pro 
170 


Ser 


Pro 


Pro 


Trp 


Glu 
175 


Ser 


Asn 


Leu 


Leu 


Gin 
180 


Ser 


Pro 


Ser 


Ser 


He 
185 


Leu 


Ser 


Thr 


Leu 


Asp 
190 


Val 


Glu 


Leu 


Pro 


Pro 
195 


Val 


Cys 


Cys 


Asp 


lie 
200 


Asp 


He 















<210> 388 
<211> 728 
<212> DNA 
<213> Homo sapiens 
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<400> 388 

gcggcctcag 

tatgttgata 

ctggggtctg 

tttggcaaaa 

actgtcaaca 

ccaagctttt 

gcctcagatg 

gagagttttg 

ctcatgatcc 

cctgtgaaga 

ctgtcgaccc 

tagtgcttca 

aaaaaaaa 



atgaatgcgg 
aggaaaatgg 
gaccttcaat 
cgttcgatgc 
gagctacaga 
ctgccaaaaa 
atgcctatcc 
acctgcctga 
ttgacgagga 
tgccctctcc 
tggatgttga 
gagtttgtgt 



ctgttaagac 
agaaccaggc 
caaagcctta 
cccaccagcc 
aaagtctgta 
gatgactgag 
agaaatagaa 
agagcaccag 
gagagagctt 
accatgggaa 
attgccacct 
gtatttgtat 



ctgcaataat 
acccgtgtgg 
gatgggagat 
ttacctaaag 
aagaccaagg 
aagactgtta 
aaattctttc 
attgcgcacc 
gaaaagctgt 
tccaatctgt 
gtttgctgtg 
taataaagca 



ccagaatggc 
ttgctaagga 
ctcaagt ttc 
ctactagaaa 
gacccctcaa 
aagcaaaaag 
ccttcaatcc 
tccccttgag 
ttcagctggg 
tgcagtctcc 
acatagatat 
ttcttcaaca 



tactctgatc 
tgggctgaag 
aacaccacgt 
ggctttggga 
acaaaaacag 
ctctgttcct 
tctagacttt 
tggagtgcct 
ccccccttca 
ttcaagcatt 
ttaaatttct 
gaaaaaaaaa 



<210> 389 
<211> 221 
<212> PRT 
<213> Homo sapiens 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

728 



<400> 389 




























Met 


Ser 


Glu 


Thr 


Ala 


Pro 


Leu 


Ala 


Pro 


Thr 


He 


Pro 


Ala 


Pro 


Ala 


Glu 


1 








5 










10 










15 




Lys 


Thr 


Pro 


Val 


Lys 


Lys 


Lys 


Ala 


Lys 


Lys 


Ala 


Gly 


Ala 


Thr 


Ala 


Gly 








20 










25 










30 




Lys 


Arg 


Lys 


Ala 


Ser 


Gly 


Pro 


Pro 


Val 


Ser 


Glu 


Leu 


He 


Thr 


Lys 


Ala 






35 










40 










45 






Val 


Ala 


Ala 


Ser 


Lys 


Glu 


Arg 


Ser 


Gly Val 


Ser 


Leu 


Ala 


Ala 


Leu 


Lys 




50 










55 










60 








Lys 


Ala 


Leu 


Ala 


Ala 


Ala 


Gly 


Tyr Asp 


Val 


Glu 


Lys 


Asn 


Asn 


Ser Arg 


65 










70 










75 










80 


He 


Lys 


Leu 


Gly 


Leu 


Lys 


Ser 


Leu 


Val 


Ser 


Lys 


Gly 


Thr 


Leu 


Val 


Gin 










85 










90 










95 




Thr 


Lys 


Gly 


Thr 


Gly 


Ala 


Ser 


Gly 


Ser 


Phe 


Lys 


Leu 


Asn 


Lys 


Lys 


Ala 








100 










105 










110 




Ala 


Ser 


Gly Glu 


Gly 


Lys 


Pro 


Lys 


Ala 


Lys 


Lys 


Ala 


Gly 


Ala 


Ala 


Lys 






115 










120 










125 






Pro 


Arg 


Lys 


Pro 


Ala 


Gly 


Ala 


Ala 


Lys 


Lys 


Pro 


Lys 


Lys 


Val 


Ala 


Gly 




130 










135 










140 






Ala 


Ala 


Thr 


Pro 


Lys 


Lys 


Ser 


He 


Lys 


Lys 


Thr 


Pro 


Lys 


Lys 


Val 


Lys 


145 










150 










155 










160 


Lys 


Pro 


Ala 


Thr 


Ala 


Ala 


Gly 


Thr 


Lys 


Lys 


Val 


Ala 


Lys 


Ser 


Ala 


Lys 










165 










170 










175 


Lys 


Val 


Lys 


Thr 


Pro 


Gin 


Pro 


Lys 


Lys 


Ala 


Ala 


Lys 


Ser 


Pro 


Ala 


Lys 








180 










185 










190 




Ala 


Lys 


Ala 


Pro 


Lys 


Pro 


Lys 


Ala 


Ala 


Lys 


Pro 


Lys 


Ser 


Gly 


Lys 


Pro 






195 










200 










205 








Lys 


Val 


Thr Lys 


Ala 


Lys 


Lys 


Ala 


Ala 


Pro 


Lys 


Lys 


Lys 










210 










215 










220 









<210> 390 
<211> 777 
<212> DNA 
<213> Homo sapiens 

<400> 390 

catgctgttc tgacagtttg agattactta 
ggagactgct ccacttgctc ctaccattcc 
aaaggcgaag aaggcaggcg caactgctgg 
tgagcttatc accaaggcag tggcagcttc 



ttgtcttttc tgggaagaca aaaacatgtc 60 
tgcacccgca gaaaaaacac ctgtgaagaa 120 
gaaacgcaaa gcatccggac ccccagtatc 180 
taaggagcgc agcggcgttt ctctggccgc 240 
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gcttaagaaa 
gcttggcctc 
ttctggctcc 
aaaggctggc 
ggctggcgcc 
agcaaccgct 
gccaaaaaaa 
gcctaagtcg 
aaactggcgg 



gcgcttgcgg 
aagagcttgg 
ttcaaact ca 
gcagccaagc 
gctaccccga 
gctgggacca 
gctgccaaga 
gggaagccga 
gacgttcccc 



ctgctggcta 
tgagcaaagg 
acaagaaagc 
ctaggaagcc 
agaaaagcat 
agaaagtggc 
gtccagctaa 
aggt tacaaa 
tttgaaaatt 



cgatgtagaa 
tactctggtg 
ggcttccggg 
tgctggggca 
caaaaagact 
caagagtgcg 
ggccaaagcc 
ggcaaagaag 
ttaaacggct 



aaaaacaaca 
cagaccaaag 
gaaggcaaac 
gccaagaagc 
cctaagaagg 
aaaaaggtga 
cctaagccca 
gcagctccga 
cttttcagag 



gccgtatcaa 
gtaccggtgc 
ccaaggccaa 
ccaagaaggt 
taaagaagcc 
aaacacctca 
aggcggccaa 
agaaaaagtg 
ccaccca 



300 
360 
420 
480 
540 
600 
660 
720 
777 



<210> 391 
<211> 846 
<212> PRT 
<213> Homo sapiens 

<400> 391 

Met Ser Ser Ser His Phe Ala 

1 5 
Glu Met lie Arg Thr Lys lie 
20 

Glu Asn Arg His Lys Glu Tyr 
35 

Asp Val Asn lie Pro Thr Leu 

50 55 
Glu Thr Ser Gin Glu Leu Val 
65 70 
Ala Met Lys Thr lie Leu Gly 
85 

Tyr Lys Glu Glu Lys Gin Leu 
100 

Ala Lys Arg Gly lie Phe Lys 
115 

Cys Phe Leu Leu Ser Asn Gin 
130 135 
Ala lie Pro Ser Ser Val Arg 
145 150 
Met Glu Gin Thr Lys lie Asp 
165 

Pro Gly Pro Arg Gin Thr Ser 
180 

Lys Val Val Gin Pro Val Met 
195 

Ala Thr Gin Ala Ala Lys Gin 
210 215 
Ala Arg Lys Pro Val Thr Arg 
225 230 
Lys Val Pro Ser Lys Gly Arg 
245 

Asp Lys Gly lie Ser Cys Lys 
260 

Ser Gin Thr Asn Ala Thr Ser 
275 

Lys Met Glu Asn Leu Pro Glu 
290 295 
Asn Ser Phe Ala Pro Lys Asp 
305 310 
Lys Thr Tyr Gin Val Thr Pro 
325 

Leu Thr Pro Ser Tyr Thr Trp 
340 

Ser Gin Ala Thr Lys Glu lie 



Ser Arg His 
10 

Ala His Arg 
25 

Glu Arg Asn 
4 0 

Glu Gly Arg 

Pro Glu Lys 

Asp Gin Arg 
90 

Gin Lys Leu 

105 
Val Gly Arg 
120 

Asn Ala Val 

lie Thr Arg 

Asn Glu Ser 
170 

Glu Lys Lys 

185 
Pro Thr Ser 
200 

Val Pro Arg 

Ala Ala Asn 

Pro Ala Lys 
250 

Val Asp Ser 

265 
Gly Met Asn 
280 

lie Asn Thr 

Phe Met Phe 

Met Thr Pro 
330 

Thr Pro Leu 

345 
Leu Ala Gin 



Arg Lys Asp lie 
Lys Ser 
Arg His 



lie Leu 

60 
Thr Asn 
75 

Lys Gin 

Lys Glu 

Tyr Arg 

Lys Ala 
140 
Ser Lys 
155 

Asp Val 

Val Ser 

Leu Arg 

Thr Val 
220 
Glu Asn 
235 

Asn Val 

Glu Glu 

Pro Asp 

Ala Lys 
300 
Gin Pro 
315 

Arg Ser 
Lys Thr 
Lys Cys 



Leu Ser 
30 

Phe Gly 
45 

Val Glu 



Val Lys 

Met Leu 

Gin Arg 
110 
Pro Asp 
125 

Glu Pro 

Ala Lys 

Arg Ala 

Asp Lys 
190 
Met Thr 
205 

Ser Ser 

Glu Pro 

Glu Thr 

Asn Thr 
270 
Gly Val 
285 

lie Lys 

Leu Asp 

Ala Asn 

Glu Val 
350 
Lys Thr 



Ser Thr 
15 

Gin Lys 

Leu Lys 

Leu Asp 

Pro Arg 

80 
Gin Lys 
95 

Glu Lys 

Met Pro 

Lys Lys 

Asp Gin 
160 
lie Arg 
175 

Glu Lys 

Arg Ser 

Thr Thr 

Glu Gly 
240 
Lys Pro 
255 

Leu Asn 

Leu Ser 

Gly Lys 

Gly Leu 
320 
Ala Phe 
335 

Asp Glu 
Tyr Ser 
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355 

Thr Lys Thr lie 
370 

Pro Leu Thr Val 
385 

Thr Thr Lys Asn 

Glu Arg Asn Glu 
420 

Phe Arg Asn lie 
435 

Phe Glu Trp Asp 
450 

Asp Leu lie Arg 
465 

Arg Phe Lys Gin 

Gly He Lys Glu 
500 

Val Ser Phe Gin 
515 

Lys Leu Glu Glu 
530 

Met Asn Lys Asn 
545 

Lys Pro Lys Gin 

Ala Ala He Lys 
580 

Ala Glu Thr Ala 
595 

Phe Asp Ala Gly 
610 

Gly Leu Ser Val 
625 

Lys Ser Val Asn 

Pro Gin Gin Thr 
660 

Ser Glu His Val 
675 

Ser Ser He Glu 
690 

Glu Asn His Val 
705 

Ser Glu Arg Met 

He Asn Thr Asn 
740 

Glu Leu Asn Ser 
755 

Glu Lys Asn Thr 
770 

Lys He Ser Gin 
785 

Cys His Leu Leu 

Gin Leu Glu Arg 
820 

Gly Asn Leu lie 
835 



360 

Gin Gin Asp Ser 
375 

Trp His Glu Glu 
390 

Leu Asn Gly Leu 
405 

Gly Arg He Ala 

Leu Gin Ser Glu 
440 

Arg Lys Leu Glu 
455 

Thr Ala Val Gly 
4 70 

Phe Glu Gly Leu 
485 

Thr Thr Cys Thr 

He Glu Asp Val 
520 

Ser Gly Trp Gin 
535 

Val Phe Arg Lys 
550 

Asp Asp Ala Gly 
5 65 

Asn Ala Met Arg 

Val Ser Val He 
600 

Phe Phe Arg Val 
615 

Ser Ser Glu Gly 
630 

Lys Ala Val Ser 
645 

Thr Ser Pro Glu 

Lys Lys Thr Leu 
680 

Asp Ala Gin Cys 
695 

Val Asn Lys Thr 
710 

Ser Leu Pro Leu 
725 

Lys Lys Glu Gly 

Ser lie Thr Ser 
760 

Ala Ser Gin Ash 
775 

Ser Glu Leu Phe 
790 

Asp Ser Pro Gly 
805 

Arg His Gin Glu 

Thr Phe Ser Pro 
840 



Asn Lys Leu Pro 
380 

His Val Leu Asn 
395 

Pro lie Lys Glu 
410 

Gin Pro His His 
425 

Thr Glu Lys Leu 

Leu Asp lie Pro 
4 60 

Gin Thr Arg Leu 
475 

Val Asp Asp Cys 
4 90 

Asp Leu Asp Gly 
505 

lie His Lys Phe 

Val Asn Asn Asn 
540 

Lys Val Val Ser 
555 

Arg lie Ala Ala 
570 

Glu Arg lie Arg 
585 

Pro Lys Glu Val 

Glu Ser Pro Val 
620 

Pro Ser Gin Arg 
635 

Gin Ser Arg Asn 
650 

Asn Ala Gly Pro 
665 

Phe Leu Ser lie 

Pro Gly Leu Pro 
700 

Asp Leu Lys Val 
715 

Leu Ala Gly Gly 
730 

lie Ser Asp Val 
745 

Gin Asp Val Leu 

Ser lie Leu Glu 
780 

Asp Asn Lys Ser 
795 

Leu Asn Cys Ser 
810 

His Ala Arg His 
825 

Leu Gin Pro Gly 



365 

Cys Pro Leu Gly 

Lys Asn Glu Ala 
400 

Val Pro Ser Leu 
415 

Gly Val Pro Tyr 
4 30 

Thr Ser His Cys 
445 

Asp Asp Ala Lys 

Leu Met Lys Glu 
480 

Glu Tyr Lys Arg 
495 

Phe Trp Asp Met 
510 

Asn Asn Leu lie 
525 

Met Asn His Asn 

Gly lie Ala Ser 
560 

Arg Asn Arg Leu 
575 

Gin Glu Glu Cys 
590 

Asp Lys lie Val 
605 

Lys Leu Phe Ser 

Leu Gly Thr Pro 
640 

Glu Met Gly lie 
655 

Gin Asn Thr Lys 
670 

Pro Glu Ser Arg 
685 

Asp Leu lie Glu 

Asp Cys Leu Ser 
720 

Val Ala Asp Asp 
735 

Val Glu Gly Met 
750 

Met Ser Ser Pro 
765 

Glu Gly Glu Thr 

Leu Thr Thr Glu 
800 

Asn Pro Phe Thr 
815 

He Ser Phe Gly 

830 
Glu Phe 
845 
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<210> 392 
<211> 2979 
<212> DNA 

<213> Homo sapiens 



<400> 392 

agcaaaccaa 

t taaagcccc 

tgtgagggtt 

tgtttggttg 

tcgacacagg 

actgtctcag 

agatgtaaac 

agagcttgtt 

tcaacgaaaa 

gcagagagag 

ttgttttctt 

tt ctgtacgg 

cgagagtga t 

agacaaagag 

agctactcaa 

agtcacaaga 

tgccaaaaat 

aaatactttg 

aaaaatggaa 

acctaaggat 

gactcccaga 

agaagttgat 

taccaagaca 

ttggcatgaa 

tccaataaaa 

tggtgtgcca 

cttcgagtgg 

cacagcagtt 

ggttgatgat 

at tttgggat 

caaacttgag 

tgtctttagg 

aagaattgca 

gcaggaagaa 

gttcgatgct 

ctcttctgaa 

tcagagtaga 

tcagaatacg 

gagcagcata 

tgtaaataag 

tcttgctggt 

tgtggaagga 

tgaaaaaaat 

gtcagaacta 

tctaaactgc 

catttctttt 

ttaaaaataa 

ttgaaatttt 

ctgtgttcat 

tacattttca 

<210> 393 
<211> 450 



tcgcaagcct 

tccaatcagc 

cctgcttcgg 

aggtttcacg 

aaggatataa 

aaagaaaata 

attccaacct 

ccagaaaaga 

cagatgctcc 

aaagctaaac 

ttatcaaacc 

attacaaggt 

gttcgagcaa 

aaaaaagttg 

gcagcaaagc 

gctgctaatg 

gtagaaacaa 

aattcacaaa 

aact tacctg 

tttatgtttc 

agtgccaatg 

gagtctcaag 

atacagcaag 

gaacatgttt 

gaagtcccat 

tatttcagaa 

gacaggaaac 

ggtcaaacaa 

tgtgaatata 

atggttagtt 

gaatctgggt 

aaaaaagttg 

gcgagaaatc 

tgtgctgaaa 

ggatttttca 

ggcccttctc 

aatgagatgg 

aaaagtgaac 

gaagatgctc 

acagacttga 

ggagtagcag 

atggaactga 

acagcttcac 

tttgataata 

agtaatccat 

ggtggtaacc 

atccaaacat 

ggagaaaatg 

caaagtgtat 

acacagaata 



cgttgagtgg 

ggctcggtgc 

agtcggcggt 

ctagaaggtg 

gtactgaaat 

gacataagga 

tggaaggtag 

ccaatgttaa 

aaaaatacaa 

gaggaatatt 

agaatgctgt 

caaaggccaa 

tccgacctgg 

tgcagcctgt 

aggttcccag 

aaaacgaacc 

aacccgacaa 

ctaatgcaac 

agataaatac 

agccactgga 

cttttttgac 

caacaaaaga 

at tcaaataa 

taaataaaaa 

cacttgaaag 

atatcctcca 

ttgaattgga 

gactccttat 

aacgaggtat 

ttcagataga 

ggcaagtcaa 

tctcaggtat 

gcctagctgc 

cagcagtttc 

gagttgaaag 

aaagacttgg 

gcattccaca 

atgtgaagaa 

agtgtcctgg 

aggtggattg 

atgatattaa 

attcttcaat 

aaaatagcat 

aaagtctcac 

ttactcagct 

tgattacttt 

tttccttcat 

tatttgtgtt 

tttagatata 

aaaaatgtac 



aaggggtggg 

ggcaagtttg 

ggtcgtccag 

gctcaggatg 

gattagaact 

atacgaacga 

aattcttgtt 

gccaagggca 

agaagaaaag 

taaagtgggt 

gaaagctgag 

agaccaaatg 

tccaagacaa 

aatgcccacg 

aacagtctca 

agaaggaaag 

gggtatttct 

aagtggaatg 

tgcaaaaata 

tggtctgaag 

acccagttac 

aattt tggca 

attgccatgt 

tgaagctact 

aaatgaaggt 

gtcagaaact 

cattccagat 

gaaggaaagg 

aaaggagact 

agatgtaatc 

taataatatg 

agcaagtaaa 

cataaaaaat 

tgtgatacca 

tcctgttaaa 

aacacctaag 

acaaactaca 

gactttgttt 

attaccagat 

tttatccagt 

tactaacaaa 

tacatcacag 

cttagaagaa 

tactgaatgc 

ggagaggaga 

ttcacctcta 

attatcaatg 

cacttctata 

ctctttctca 

tgtgccttg 



atcttccccg 

aatttcgtgg 

accgagtgtt 

tcttcatcac 

aaaattgctc 

aatagacact 

gaattagatg 

atgaaaacta 

caacttcaaa 

cgttatagac 

ccaaaaaagg 

gagcagacta 

acttctgaaa 

tcgttgagaa 

tctaccacag 

gtgccaagta 

tgtaaa,gtcg 

aatccagatg 

aaagggaaga 

acctatcaag 

acctggactc 

caaaaatgta 

cctttgggtc 

actaaaaatt 

cgaattgctc 

gagaaattaa 

gatgctaaag 

tttaaacagt 

acctgtacag 

cacaaattca 

aatcataata 

ccaaaacagg 

gcaatgagag 

aaggaagttg 

ttattctcag 

tctgtcaaca 

tcaccagaaa 

ttgagtattc 

ttaattgaag 

gagagaatga 

aaagaaggaa 

gatgttttga 

ggggaaacta 

caccttcttg 

catcaagaac 

caaccaggag 

cttatatatt 

gcatataatg 

agggaagtgg 



gaagttttgg 

aggctcgggt 

ctttactttt 

attttgccag 

ataggaaatc 

ttggt t tgaa 

agacatctca 

t tctaggtga 

aattgaaaga 

ctgatatgcc 

ctattccatc 

agat tgataa 

agaaagtgtc 

tgactcgatc 

caagaaagcc 

aaggaagacc 

atagtgaaga 

gagtcttatc 

attccttcgc 

taacacctat 

ctttaaaaac 

aaacttactc 

ctctaactgt 

taaatggcct 

agccccacca 

cttcacattg 

atcttattcg 

ttgaaggact 

atctggatgg 

acaatctgat 

tgaacaaaaa 

atgatgctgg 

agagaattag 

ataaaatagt 

gactttctgt 

aagctgtatc 

atgccggtcc 

ctgaaagcag 

aaaaccatgt 

gtttgcctct 

tttcagatgt 

tgagtagccc 

aaatttctca 

attcaccagg 

atgccagaca 

aattttgaat 

ccttagacta 

ttttaatatt 

ggatattttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

2979 
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<212> PRT 

<213> Homo sapiens 
<400> 393 

Met Asp Ala Arg Arg Val Pro Gin Lys Asp Leu Arg Val Lys Lys Asn 

1 5 10 15 

Leu Lys Lys Phe Arg Tyr Val Lys Leu He Ser Met Glu Thr Ser Ser 

20 25 30 

Ser Ser Asp Asp Ser Cys Asp Ser Phe Ala Ser Asp Asn Phe Ala Asn 

35 40 45 

Thr Lys Pro Lys Phe Arg Ser Asp He Ser Glu Glu Leu Ala Ser Val 

50 55 60 

Phe Tyr Glu Asp Ser Asp Asn Glu Ser Phe Cys Gly Phe Ser Glu Ser 
65 ™ 75 " 80 

Glu Val Gin Asp Val Leu Asp His Cys Gly Phe Leu Gin Lys Pro Arg 

85 90 95 

Pro Asp Val Thr Asn Glu Leu Ala Gly He Phe His Ala Asp Ser Asp 

100 105 HO 

Asp Glu Ser Phe Cys Gly Phe Ser Glu Ser Glu He Gin Asp Gly Met 

115 120 125 

Arg Leu Gin Ser Val Arg Glu Gly Cys Arg Thr Arg Ser Gin Cys Arq 

130 135 140 

His Ser Gly Pro Leu Arg Val Ala Met Lys Phe Pro Ala Arg Ser Thr 
145 150 155 J 160 

Arg Gly Ala Thr Asn Lys Lys Ala Glu Ser Arg Gin Pro Ser Glu Asn 

165 170 175 

Ser Val Thr Asp Ser Asn Ser Asp Ser Glu Asp Glu Ser Gly Met Asn 

180 185 190 

Phe Leu Glu Lys Arg Ala Leu Asn He Lys Gin Asn Lys Ala Met Leu 

195 200 205 

Ala Lys Leu Met Ser Glu Leu Glu Ser Phe Pro Gly Ser Phe Arg Gly 

21° 215 220 

Arg His Pro Leu Pro Gly Ser Asp Ser Gin Ser Arg Arg Pro Arq Arq 
225 230 235 240 

Arg Thr Phe Pro Gly Val Ala Ser Arg Arg Asn Pro Glu Arg Arg Ala 

245 250 255 

Arg Pro Leu Thr Arg Ser Arg Ser Arg He Leu Gly Ser Leu Asp Ala 

2 60 265 270 

Leu Pro Met Glu Glu Glu Glu Glu Glu Asp Lys Tyr Met Leu Val Arg 

275 280 285 

Lys Arg Lys Thr Val Asp Gly Tyr Met Asn Glu Asp Asp Leu Pro Arq 

290 295 300 

Ser Arg Arg Ser Arg Ser Ser Val Thr Leu Pro His He He Arg Pro 
305 310 315 320 

Val Glu Glu He Thr Glu Glu Glu Leu Glu Asn Val Cys Ser Asn Ser 

325 330 " 335 

Arg Glu Lys He Tyr Asn Arg Ser Leu Gly Ser Thr Cys His Gin Cys 

34 0 345 350 

Arg Gin Lys Thr He Asp Thr Lys Thr Asn Cys Arg Asn Pro Asp Cys 

355 360 365 

Trp Gly Val Arg Gly Gin Phe Cys Gly Pro Cys Leu Arg Asn Arq Tyr 

370 375 380 

Gly Glu Glu Val Arg Asp Ala Leu Leu Asp Pro Asn Trp His Cys Pro 
385 390 395 400 

Pro Cys Arg Gly He Cys Asn Cys Ser Phe Cys Arg Gin Arg Asp Gly 

405 410 415 

Arg Cys Ala Thr Gly Val Leu Val Tyr Leu Ala Lys Tyr His Gly Phe 

420 425 4 30 

Gly Asn Val His Ala Tyr Leu Lys Ser Leu Lys Gin Glu Phe Glu Met 
435 440 445 

Gin Ala 
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<210> 394 

<211> 2824 

<212> DNA 

<213> Homo sapiens 



<400> 394 

cccgagcccc 

tgctcctcct 

ccgatctggg 

gtaaagaaga 

tcctctgatg 

ttcaggtcag 

tctttctgcg 

cagaaaccaa 

gatgaatcat 

gttcgggaag 

atgaagtttc 

ccctcagaga 

tttttggaga 

tctgaattag 

tcacaatcaa 

gaacggagag 

ctacccatgg 

gtggatggct 

acccttccgc 

tgcagcaatt 

cgtcagaaga 

ggccagttct 

ctggatccga 

cagcgagatg 

gggaatgtgc 

ctggaaaatt 

tttcactgaa 

caagttaatc 

actttgccct 

atttccaatg 

gaaagcatat 

acacaataat 

gtttacacaa 

aggcacaaaa 

tcaatcccat 

ctaatttctt 

gttggtgggc 

agtaagctga 

tttcagttca 

atattaaaaa 

ggttcaagtt 

acttttagca 

tgtcagtatt 

tgtttgtatg 

gcataattgg 

acaaaagttt 

tcttactgta 

ttct 



gcccctccgg 
gctgtgggac 
cacccgccac 
acttaaagaa 
acagttgtga 
atatcagtga 
gcttttcaga 
ggccagatgt 
tttgcggttt 
gctgtaggac 
cagcgcggag 
attctgtgac 
aaagggcttt 
aaagcttccc 
ggagaccgcg 
ctcgtcctct 
aggaggagga 
acatgaatga 
atataattcg 
ctcgagagaa 
ctattgatac 
gtggcccctg 
actggcattg 
gacggtgtgc 
atgcctactt 
tgctgcctgc 
acctgagtta 
ttagcagaca 
cctgcagttt 
ctcctctcca 
tttatttact 
agtattaact 
aaacgagtat 
gtcttaaaac 
gtattgcgct 
ctgcccgaag 
aatgtaatac 
tttgaatttt 
catgtaaggt 
ctagtctgtg 
tagattttaa 
gtttgttaac 
ccccctcctc 
tccaatttac 
accttgaatc 
attttaataa 
cttaattcta 



gcccgggtcg 
cgctgaccgc 
cagcatggac 
attcagatat 
cagctttgct 
agaactggca 
aagtgaggtg 
cactaacgaa 
ctcagagagt 
ccgcagccag 
taccagggga 
tgattccaac 
aaatataaag 
tggctcgttc 
aaggcgtaca 
taccaggtca 
ggaagaggat 
agatgacctg 
cccagtggaa 
gatatataac 
caaaacaaac 
ccttcgaaac 
cccgccttgt 
gactggggtc 
gaaaagcctg 
cttctacttc 
aaaatcttga 
tgtgtttctg 
cttctctgct 
accgcttagt 
tggtgttgaa 
aactagatct 
gatttagcat 
catgtggaaa 
tatgttacaa 
ggtaagtggt 
ttaattaaaa 
cagtataaaa 
attgcaaata 
gttctttgca 
gcacttttat 
ctgacatctc 
tttgcattaa 
ttgcatatgt 
gataagtgta 
taaaatgttt 
attttttcct 



gcgcgcccag 
gcggctgctc 
gctcgccgcg 
gtgaagttga 
tctgataatt 
agtgtttttt 
caagatgtat 
ctggccggta 
gagatacaag 
tgcaggcact 
gcaaccaaca 
tccgattcag 
caaaacaaag 
cgtggaagac 
ttcccgggtg 
aggtcccgga 
aagtacatgt 
cccagaagcc 
gaaattacag 
cgttcactgg 
tgcagaaacc 
cgttatggtg 
cgaggaatct 
cttgtgtatt 
aaacaggaat 
tcaaatcttt 
tgatcagcct 
gagcatcaca 
cccaaccccc 
ttctgaattt 
atagccctca 
attgaatttc 
tcatactagt 
aattaggtaa 
gttgttgtca 
gcgtccagct 
taatgatgga 
ctttagtata 
aattcttgga 
gtttcttgta 
aacaatgata 
tgccagtcta 
tcaaggtatt 
aaaccattgc 
aatacagctt 
gttctaactt 
catatttaaa 



cctgccagcc 
cgctctcccc 
tgccgcagaa 
tttccatgga 
ttgcaaacac 
atgaggactc 
tagaccattg 
tttttcatgc 
atggaatgag 
ctggacctct 
aaaaagcaga 
aagatgaaag 
caatgcttgc 
atcccctccc 
ttgcttccag 
tcctcgggtc 
tggtgagaaa 
gtcgctccag 
aggaggagtt 
gctctact tg 
cagactgctg 
aagaggtcag 
gcaactgcag 
tagccaaata 
ttgaaatgca 
cttgtaaaag 
gtttcataag 
gaaggtatat 
atctcatagc 
cttttaaatt 
taaaacctaa 
agagaagagc 
tgaaattttt 
ttattgcaga 
cagttgagac 
tacacaatca 
agagctatct 
attgtagttt 
caattttgta 
aatttataaa 
agtgcctttt 
gtttctgggc 
tggtagaggt 
tgtgccattc 
ttgatctgta 
gtctgctttt 
taaaaggcca 



gcgctgctgc 
gctccaagcg 
agatctcaga 
aacctcgtca 
gaaacctaaa 
tgataatgaa 
tggattttta 
cgactctgac 
gctgcagtca 
cagggtggcg 
gtcccgccag 
tggaatgaat 
aaaactcatg 
aggctccgac 
gagaaaccct 
ccttgacgct 
gaggaagacc 
atcatccgtg 
ggagaacgtc 
tcatcaatgc 
gggcgttcga 
ggatgctctg 
tttctgccgg 
tcatggcttt 
agcataatat 
tttccaattt 
aaactccaat 
tgctagttac 
atccccctct 
acagttttat 
gcacttggaa 
ct tctaactt 
aatagaatca 
ttgatgtctc 
ttaatttctc 
taattcaaag 
ggagattatg 
gcaaagttta 
tggaaacttg 
ccaggcacaa 
tggagatgta 
aggtttcctg 
ggaatctaag 
aatgtttgat 
atgcttttat 
ttaaaaataa 
tttccacctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2824 



<210> 395 
<211> 142 
<212> PRT 



-156- 



BNSDOCID: <WO 20040550 5QA2J_> 



WO 2004/055050 



PCT/IB2003/006434 



<21 3> Homo sapiens 
<400> 395 

Met Gly Ala Pro Thr Leu Pro Pro Ala Trp Gin Pro Phe Leu Lys Asp 

1 5 10 15 

His Arg He Ser Thr Phe Lys Asn Trp Pro Phe Leu Glu Gly Cys Ala 

20 25 30 

Cys Thr Pro Glu Arg Met Ala Glu Ala Gly Phe He His Cys Pro Thr 

35 4 0 4 5 

Glu Asn Glu Pro Asp Leu Ala Gin Cys Phe Phe Cys Phe Lys Glu Leu 

50 55 60 

Glu Gly Trp Glu Pro Asp Asp Asp Pro He Glu Glu His Lys Lys His 
65 70 75 " 80 

Ser Ser Gly Cys Ala Phe Leu Ser Val Lys Lys Gin Phe Glu Glu Leu 

85 90 95 

Thr Leu Gly Glu Phe Leu Lys Leu Asp Arg Glu Arg Ala Lys Asn Lys 

100 105 HO 

He Ala Lys Glu Thr Asn Asn Lys Lys Lys Glu Phe Glu Glu Thr Ala 

115 120 125 

Lys Lys Val Arg Arg Ala He Glu Gin Leu Ala Ala Met Asp 
130 135 140 



<210> 396 
<211> 1619 
<212> DNA 

<213> Homo sapiens 
<400> 396 

ccgccagatt tgaatcgcgg gacccgttgg 
gacgttgccc cctgcctggc agccctttct 
ctggcccttc ttggagggct gcgcctgcac 
ccactgcccc actgagaacg agccagactt 
ggaaggctgg gagccagatg acgaccccat 
cgctttcctt tctgtcaaga agcagtttga 
ggacagagaa agagccaaga acaaaattgc 
tgaggaaact gcgaagaaag tgcgccgtgc 
cctctggccg gagctgcctg gtcccagagt 
gtgccaccag ccttcctgtg ggccccttag 
caaattagat gtttcaactg tgctcctgtt 
tgcctgtgca gcgggtgctg ctggtaacag 
gggggctcat ttttgctgtt ttgattcccg 
aaggcagtgt cccttttgct agagctgaca 
gtgaatgtgt ctggacctca tgttgttgag 
ggtgcctgtt gaatctgagc tgcaggttcc 
acagtttttt tgttgttgtg tttttttgtt 
gtgatgagag aatggagaca gagtccctgg 
tattttgttt gaattgttaa ttcacagaat 
agccattcta agtcattggg gaaacggggt 
agtgatagga agcgtctggc agatactcct 
agtgagccgc ggggcacatg ctggccgctc 
ctttttaaat gacttggctc gatgctgtgg 
tctgtcagcc caaccttcac atctgtcacg 
ccaggtcccc gctttctttg gaggcagcag 
gatggatttg attcgccctc ctccctgtca 
gctggaaacc tctggaggtc atctcggctg 

<210> 397 
<211> 1401 
<212> PRT 

<213> Homo sapiens 



cagaggtggc ggcggcggca tgggtgcccc 60 
caaggaccac cgcatctcta cattcaagaa 120 
cccggagcgg atggccgagg ctggcttcat 180 
ggcccagtgt ttcttctgct tcaaggagct 240 
agaggaacat aaaaagcatt cgtccggttg 300 
a-gaattaacc cttggtgaat ttttgaaact 360 
aaaggaaacc aacaataaga agaaagaatt 420 
catcgagcag ctggctgcca tggattgagg 480 
ggctgcacca cttccagggt ttattccctg 540 
caatgtctta ggaaaggaga tcaacatttt 600 
ttgtcttgaa agtggcacca gaggtgcttc 660 
tggctgcttc tctctctctc tctctttttt 720 
ggcttaccag gtgagaagtg agggaggaag 780 V 
gctttgttcg cgtgggcaga gccttccaca 840 
gctgtcacag tcctgagtgt ggacttggca 900 
ttatctgtca cacctgtgcc tcctcagagg 960 
tttttttttt ggtagatgca tgacttgtgt 1020 
ctcctctact gtttaacaac atggctttct 1080 
agcacaaact acaattaaaa ctaagcacaa 1140 
gaacttcagg tggatgagga gacagaatag 1200 
tttgccactg ctgtgtgatt agacaggccc 1260 
ctccctcaga aaaaggcagt ggcctaaatc 1320 
gggactggct gggctgctgc aggccgtgtg 1380 
ttctccacac gggggagaga cgcagtccgc 1440 
ctcccgcagg gctgaagtct ggcgtaagat 1500 
tagagctgca gggtggattg ttacagcttc 1560 
ttcctgagaa ataaaaagcc tgtcatttc 1619 
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<400> 397 
Met Ala Pro Gin 
1 

Leu Leu Lys Ser 
20 

Ser lie Lys His 

35 

Arg Ala Gin Gly 
50 

Ser lie Leu His 
65 

Thr Leu Glu Phe 

Pro Ala lie Leu 
100 

His Leu Asn Ala 
115 

Glu Ser Phe Glu 
130 

Leu Gly Gly Lys 
145 

Asp Trp Glu Glu 

Leu Gin Leu Asp 
180 

Glu Phe Val Ser 
195 

Pro Thr lie Asn 
210 

His Leu Leu Gly 

225 

Thr Val Lys He 

Val Leu Val Ala 
260 

Ser He Val Gly 
275 

Glu Leu Ser Arg 
290 

Thr Glu Leu Ala 
305 

He Leu Leu Asp 

Val Leu Ala Ala 
340 

Gin Leu Glu Ala 

355 

Leu Gin Ala His 
370 

Leu Gin Leu Phe 
385 

Arg Phe Gin Ala 

Ser Val Leu Val 
420 

Ala Asn Asn Pro 
435 

Pro Leu Gin Lys 
4 50 

Arg Thr Ala Ala 



Met Tyr Glu Phe 
5 

Gly Gly Val Asn 

Leu Pro Pro Gin 
40 

Pro Leu Ala Met 
55 

His Phe Arg Ser 
70 

Leu He Lys Val 
85 

Asp Asp Thr Thr 

Leu Lys Met Asn 
120 

Thr Met Ala Ser 
135 

Gly Lys Lys Ala 
150 

Glu Arg Gin Pro 
165 

He Arg His Leu 

Leu Val Thr Gly 
200 

His Gin Lys Asn 
215 

Val Ala Leu Thr 
230 

He Gin Met Leu 
245 

Ala Val Ser Leu 

Glu He Val Arg 
280 

Asp Pro Ser Gly 
295 

Glu Arg Val Pro 
310 

His Leu Asp Gly 
325 

Met Ala Glu Met 

Ala Ala Arg Asp 
360 

Gly His Asp Val 
375 

Thr Arg lie Val 
390 

Val Val Ala Leu 
405 

Cys Lys Asn Ala 

Phe Ser Cys Lys 
440 

Glu Thr Gin Lys 
455 

Ala Ser Ala Val 



His Leu Pro Leu 
10 

Gin Tyr Val Val 
25 

Leu Arg Ala Phe 

Leu Gin His Phe 
60 

He Asp Pro Gly 
75 

Val Ser Arg His 
90 

Leu Ser Gly Ser 
105 

Cys Tyr Ala Leu 

Gin Thr Asn Leu 
140 

Arg Thr Lys Ala 
155 

lie Leu Gin Leu 
170 

Trp Asn His Ser 
185 

Cys Cys Tyr Arg 

Arg Pro Thr Arg 
220 

Arg Tyr Asn His 

235 

Gin His Phe Glu 
250 

Trp Ala Thr Asp 

265 

Glu He Gly Gin 

Thr Lys Gly Phe 
300 

Ala lie Leu Met 
315 

Glu Asn Tyr Met 
330 

Val Leu Gin Val 
345 

Thr Arg Asp Gin 

Asn Ser Phe Val 
380 

Gin Gin Lys Ala 
395 

Ala Val Gly Arg 
410 

lie Gin Leu Leu 
425 

Leu Ser Asp Ala 

Leu Gin Glu Met 
4 60 

Leu Asp Pro Glu 



Ser Pro Glu Glu 
15 

Gin Glu Val Leu 

30 

Gin Ala Ala Phe 
45 

Asp Thr lie Tyr 

Leu Lys Glu Asp 
80 

Ser Gin Glu Leu 
95 

Asp Arg Asn Ala 
110 

lie Arg Leu Leu 
125 

Val Asp Leu Asp 

Ala His Gly Phe 
160 

Leu Thr Gin Leu 
175 

lie lie Glu Glu 
190 

Leu Leu Glu Asn 
205 

Glu Ala lie Thr 

Met Leu Ser Ala 
240 

His Leu Ala Pro 
255 

Tyr Gly Met Lys 
270 

Lys Cys Pro Gin 
285 

Ala Ala Phe Leu 

Ser Ser Met Cys 
320 

Met Arg Asn Ala 
335 

Leu Ser Gly Asp 
350 

Phe Leu Asp Thr 
365 

Arg Ser Arg Val 

Leu Pro Leu Thr 
400 

Leu Ala Asp Lys 
415 

Ala Ser Phe Leu 
430 

Asp Leu Ala Gly 
445 

Arg Ala Gin Arg 
Glu Glu Trp Glu 
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465 

Ala Met Leu Pro 

Pro Gin Gly Glu 
500 

Thr Glu Asp Val 
515 

Tyr Lys Lys Ala 
530 

Glu Ser Glu Pro 
545 

Arg Leu Leu Asn 

Thr Gin Glu Lys 
580 

Gin Thr Val Cys 
595 

Arg Gly Asn Asp 
610 

Gin Asp Ala Tyr 
625 

lie Ser Lys Met 

lie Glu Phe Phe 
660 

Phe Gly Val Arg 
675 

Val Arg Glu Ala 
690 

Lys Gly Asp Ser 
705 

Ser Leu Leu Leu 

Glu lie Leu Cys 
740 

Thr Gin Leu Leu 
755 

Leu Glu Arg Cys 
77 0 

Lys Pro Glu lie 
785 

Leu Asp Glu Lys 

His Ala He Ala 
820 

Arg His Pro Pro 
835 

Leu Arg Glu Thr 
850 

He Pro Phe Lys 
865 

Gly Pro Glu Val 

Ala Leu Glu Lys 
900 

Glu Ser Pro Ala 
915 

Leu Ala Gly Asp 
930 

Val Ser Gly Glu 
945 



470 

Glu Leu Lys Ser 

4 85 

Glu Glu He Pro 

Lys Gly Arg He 
520 

lie lie Leu Thr 
535 

Phe Ser His He 
550 

lie Leu Gly Leu 

5 65 

Asn Pro Arg Glu 

Lys Asn Lys Pro 
600 

Glu Leu Val Lys 
615 

Ser Phe Ser Arg 
630 

Met Tyr Glu Asn 
645 

Val Met Val Phe 

Arg Met Leu Pro 
680 

Val Leu Asn Ala 
695 

Ala Arg Ala Lys 
710 

Val Asp Ala Ser 
725 

Glu Phe Val Gin 

Trp Glu Arg Ala 
760 

Ser Ser Val Met 
775 

Val Gly Ser Asn 
790 

Phe Pro Gin Asp 
805 

Asn lie Ser Asp 

Phe Arg Leu Pro 
840 

Val Thr Lys Gly 
855 

Glu Val Ala Val 
870 

lie Cys Ala Gin 
885 

Leu Glu Glu Lys 

Met Leu Pro Thr 
920 

Val Ala Leu Gin 
935 

Leu Cys Arg Arg 
950 



475 

Thr Leu. Gin Gin 
490 

Glu Gin lie Ala 
505 

Tyr Gin Leu Leu 

Arg Glu Ala Thr 
540 

Asp Pro Glu Glu 
555 

lie Phe Lys Gly 
570 

Ser Thr Gly Asn 
585 

Asn Met Ser Asp 

Gin Glu Met Leu 
620 

Lys lie Thr Glu 
635 

Thr Thr Thr Val 
650 

Gin Phe Gly Val 
665 

Leu lie Trp Ser 

Tyr Arg Gin Leu 
700 

Ala Gin Ala Leu 
715 

Val Gly Thr lie 
730 

Lys Asp Glu Leu 
745 

Thr Glu Lys Val 

Leu Leu Gly Met 
780 

Leu Asp Thr Leu 
795 

Tyr Arg Leu Ala 
810 

Arg Arg Lys Pro 
825 

Gin Glu His Arg 

Phe Val His Pro 
860 

Thr Leu lie Tyr 
875 

He Leu Gin Gly 
890 

Arg Thr Ser Gin 
905 

Phe Leu Leu Met 

Gin Leu Val His 
940 

Arg Val Leu Arg 
955 



480 

Leu Leu Gin Leu 
495 

Asn Thr Glu Thr 
510 

Ala Lys Ala Ser 
525 

Gly His Phe Gin 

Ser Glu Glu Thr 
560 

Pro Ala Ala Ser 
575 

Met Val Thr Gly 
590 

Pro Glu Glu Ser 
605 

Val Gin Tyr Leu 

Ala lie Gly lie 
64 0 

Val Gin Glu Val 
655 

Pro Gin Ala Leu 

, 670 
Lys Glu Pro Gly 
685 

Tyr Leu Asn Pro 

lie Gin Asn Leu 
720 

Gin Cys Leu Glu 
735 

Lys Pro Ala Val 
750 

Ala Cys Cys Pro 
7 65 

Met Ala Arg Gly 

Val Ser lie Gly 
800 

Gin Gin Val Cys 
815 

Ser Leu Gly Lys 
830 

Leu Phe Glu Arg 
845 

Asp Pro Leu Trp 

Gin Leu Ala Glu 
880 

Cys Ala Lys Gin 
895 

Glu Asp Pro Lys 
910 

Asn Leu Leu Ser 
925 

Leu Glu Gin Ala 

Glu Glu Gin Glu 
960 
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His 


Lys 


Thr 


Lys 


Asp 
965 


Pro 


Lys 


Glu 


Lys 


Asn 
970 


Thr 


Ser 


Ser 


Glu Thr 
975 


Th r 


Met 


Glu 


Glu 


Glu 


Leu 


Gly 


Leu 


Val 


Gly 


Ala 


Thr 


Ala 


Asp Asp Thr 


a 1 n 
ox u 








980 










985 










990 




Ala 


Glu 


Leu 


He 


Arg 


Gly 


lie 


Cys 


Glu 


Met 


Glu 


Leu 


Leu 


Asp Gly 


Lys 






995 










1000 








1005 




Gin 


Thr 


Leu 


Ala 


Ala 


Phe 


Val 


Pro 


Leu 


Leu 


Leu 


Lys 


Val 


Cys Asn 


Asn 




1010 








1015 








1020 






Pro 


Gly 


Leu 


Tyr 


Ser 


Asn 


Pro 


Asp 


Leu 


Ser 


Ala 


Ala 


Ala 


Ser Leu 


Ala 


1025 








1030 








1035 






1040 


Leu 


Gly 


Lys 


Phe 


Cys 


Met 


He 


Ser 


Ala 


Thr 


Phe 


Cys 


Asp 


Ser Gin 


Leu 










1045 








1050 






1055 


Arg 


Leu 


Leu 


Phe 


Thr 


He 


Leu 


Glu 


Lys 


Ser 


Pro 


Leu 


Pro 


lie Val 


Arg 








1060 








1065 








1070 




Ser 


Asn 


Leu 


Met 


Val 


Ala 


Thr 


Gly 


Asp 


Leu 


Ala 


He 


Arg 


Phe Pro 


Asn 






1075 








1080 








1085 




Leu 


Val 


Asp 


Pro 


Trp 


Thr 


Pro 


His 


Leu 


Tyr 


Ala 


Arg 


Leu 


Arg Asp 


Pro 




1090 








1095 








1100 






Ala 


Gin 


Gin 


Val 


Arg 


Lys 


Thr 


Ala 


Gly 


Leu 


Val 


Met 


Thr 


His Leu 


He 


1105 








1110 








1115 






1120 


Leu 


Lys 


Asp 


Met 


Val 


Lys 


Val 


Lys 


Gly 


Gin 


Val 


Ser 


Glu 


Met Ala 


Val 










1125 








1130 






1135 


Leu 


Leu 


He 


Asp 


Pro 


Glu 


Pro 


Gin 


He 


Ala 


Ala 


Leu 


Ala 


Lys Asn 


Phe 








1140 








1145 








1150 




Phe 


Asn 


Glu 


Leu 


Ser 


His 


Lys 


Gly 


Asn 


Ala 


He 


Tyr 


Asn 


Leu Leu 


Pro 






1155 








1160 








1165 




Asp 


He 


He 


Ser Arg 


Leu 


Ser 


Asp 


Pro 


Glu 


Leu 


Gly 


Val 


Glu Glu 


Glu 




1170 








1175 








1180 






Pro 


Phe 


His 


Thr 


He 


Met 


Lys 


Gin 


Leu 


Leu 


Ser 


Tyr 


He 


Thr Lys 


Asp 


1185 








1190 








1195 






1200 


Lys 


Gin 


Thr 


Glu 


Ser 


Leu 


Val 


Glu 


Lys 


Leu 


Cys 


Gin 


Arg 


Phe Arg 


Thr 










1205 








1210 






1215 


Ser 


Leu 


Thr 


Glu 


Arg 


Gin 


Gin 


Arg 


Asp 


Leu 


Ala 


Tyr 


Cys 


Val Ser 


Gin 








1220 








1225 








1230 




Leu 


Pro 


Leu 


Thr 


Glu 


Arg 


Gly 


Leu 


Arg 


Lys 


Met 


Leu 


Asp 


Asn Phe 


Asp 






1235 








1240 








1245 




Cys 


Phe 


Gly 


Asp 


Lys 


Leu 


Ser 


Asp 


Glu 


Ser 


He 


Phe 


Ser 


Ala Phe 


Leu 




1250 








1255 








1260 






Ser 


Val 


Val 


Gly 


Lys 


Leu 


Arg 


Arg 


Gly Ala 


Lys 


Pro 


Glu 


Gly Lys 


Ala 


1265 








1270 








1275 






1280 


He 


He 


Asp 


Glu 


Phe 


Glu 


Gin 


Lys 


Leu 


Arg 


Ala 


Cys 


His 


Thr Arg 


Gly 










1285 








1290 






1295 


Leu 


Asp 


Gly 


He 


Lys 


Glu 


Leu 


Glu 


He 


Gly 


Gin 


Ala 


Gly 


Ser Gin 


Arg 








1300 








1305 








1310 




Ala 


Pro 


Ser 


Ala 


Lys 


Lys 


Pro 


Ser 


Thr 


Gly 


Ser 


Arg 


Tyr 


Gin Pro 


Leu 






1315 








1320 








1325 




Ala 


Ser 


Thr 


Ala 


Ser 


Asp Asn 


Asp 


Phe 


Val 


Thr 


Pro 


Glu 


Pro Arg 


Arg 




1330 








1335 








1340 






Thr 


Thr 


Arg 


Arg 


His 


Pro 


Asn 


Thr 


Gin 


Gin 


Arg 


Ala 


Ser 


Lys Lys 


Lys 



1345 1350 1355 1360 

Pro Lys Val Val Phe Ser Ser Asp Glu Ser Ser Glu Glu Asp Leu Ser 

1365 1370 1375 

Ala Glu Met Thr Glu Asp Glu Thr Pro Lys Lys Thr Thr Pro He Leu 

1380 1385 1390 

Arg Ala Ser Ala Arg Arg His Arg Ser 
1395 1400 



<210> 398 
<211> 5547 
<212> DNA 
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<213> Homo sapiens 
<400> 398 

aagttttgga gcgccggaca agctgaggtc 
gagatttcaa gaaaattcac cctttccgct 
tgaaggttga gaacctggaa cccgcttcca 
cagtcttgcg ctgtagctgt ttttattacc 
gtcctgaaag tgaagtgact tgctggacgt 
ttagaggcga tgccttctaa ctcccagcgt 
atgcccagcc agggacgcga cgtctctggc 
tgggcaaagg gggaccttgc ggcca.aggag 
ggggccggag ccggagcctt ggcccgcccc 
ggcgggcgtc gcgattggcc gtcctgcagc 
ttgcccgcgc gttgccagac tcagaggcgg 
ggattctccg ccgacttgaa aactgccttg 
tttccttttc atttcagcct gactgccgga 
gtgagcctgt aggagtagaa tggctcccca 
agaggagttg ttgaaaagtg gaggggtgaa 
caaacatctt ccaccacagc ttagagcttt 
ggctatgctg cagcactttg atactatcta 
tcctggcctc aaagaagata ctctggaatt 
ggagcttcca gctatcctgg atgatacaac 
aaatgccctc aaaatgaact gttatgctct 
ggccagccag acaaaccttg tggacctgga 
caaggcagcc catggctttg actgggaaga 
acagctactt cagttggaca tccgtcacct 
tgtcagtttg gttactggct gttgctaccg 
gaagaaccgc cccactcggg aagccataac 
taaccatatg ctcagtgcta cagtgaagat 
ggcacctgta ctggttgcag ccgtgagtct 
agtgggagag attgtaagag agattggaca 
ttcagggaca aagggctttg cagcattcct 
cctgatgtcc agcatgtgca ttttgctaga 
taatgctgtg ctggcagcca tggcggagat 
ggaagcagca gcccgagaca ccagagacca 
tgatgtcaac tcctttgtgc ggagccgtgt 
gaaggctctc cccctgacac gtttccaggc 
agacaagtca gtgctagtat gtaaaaatgc 
taatcctttc tcctgcaagc ttagtgatgc 
ccagaaatta caagagatga gggcccagag 
cccagaggag gagtgggaag ccatgctgcc 
acagcttccc cagggagagg aggagattcc 
agatgtgaaa ggacgcatct atcaactgct 
tctcactcga gaagccacag gccacttcca 
agaggagtca gaggagacca ggctcttgaa 
agcttccaca caagaaaaga atccccggga 
tgtctgtaaa aataaaccca atatgtcgga 
agtgaagcag gagatgctgg tacagtatct 
tacagaggcc attggcatca tcagcaagat 
ggaggtgatt gaattctttg tgatggtctt 
ggtgcgccgt atgctgcctc tcatctggtc 
taatgcctac cgccaactct acctcaaccc 
ggctttgatt cagaatctct ctctgctgct 
tcttgaggaa attctctgtg agtttgtgca 
gctgctgtgg gagcgggcca ccgagaaggt 
tgtcatgctt cttggcatga tggcacgagg 
cacactggtg agcatagggc tggatgagaa 
ggtgtgccat gccattgcca acatctcgga 
cccccccttc cggctgcctc aggaacacag 
aaaaggcttt gtccacccag acccactctg 
catttaccaa ctggcagagg gccccgaagt 



cgcgactcgt cgctaagatt ccccaaaact 60 
ttctttggcg cccttcaacc gtgaaggaaa 120 
aagaccaagc cctttctctg tgcctcacaa 180 
cctattttac tactggggag gagatttgag 240 
cacttagcag aggagatggc gaaacctgga 300 
ggacttgcct cctttcctgg gggtgactga 360 
cagcagaaat acggcctcct ccccgccgac 420 
ggattcgcag gcgggccggg ggtgggagcg 480 
cgggcggcgc tgtgattggc cggccgctcc 540 
cgttgagatt tgaactcggt atttgtggct 600 
ccctttgcct ctgcctgccg gggattggcc 660 
ctaattggtg cgtgttgtgc acgcgtgttt 720 
atcagagccg cgggtgagat ccccagccct 780 
aatgtatgag ttccatctgc cattatcccc 840 
tcagtatgtt gtgcaagagg tactgtccat 900 
tcaggctgcc tttcgagctc aggggcccct 960 
cagcattttg catcactttc gaagtataga 1020 
cctgataaaa gtggtatccc gccactccca 1080 
tttgagtgga tcagatagaa acgcccatct 1140 
gatacgtctc ctggaatcct ttgagaccat 1200 
ccttggtggg aagggtaaga aagctcggac 1260 
agagaggcaa ccaattcjttc agcttttaac 1320 
gtggaaccac tcaataattg aagaagaatt 1380 
ccttctggag aatcccacca ttaatcacca 1440 
acacctgctt ggtgtagcct tgacccgtta 1500 
catccagatg ctgcagcact ttgaacacct 1560 
atgggcaact gactatggaa tgaagagcat 1620 
aaagtgtccc caagagctga gtcgagaccc 1680 
gacagaacta gcagaacgtg tcccagctat 1740 
tcacctggat ggagaaaatt acatgatgcg 1800 
ggtgctgcag gttctcagtg gcgatcaact 1860 
gttcttggat actttacaag cccatggcca 1920 
tttgcagctc ttcacccgaa ttgtccagca 1980 
agtggtggct ttagctgtgg gacgtctggc 2040 
catccagctg ctggccagtt ttctagccaa 2100 
tgaccttgcc ggaccactgc agaaggagac 2160 
gcgaactgca gcagcttctg cagtgctgga 2220 
agagttgaag tctaccctgc agcagcttct 2280 
tgagcaaatt gccaatacag agacaactga 2340 
tgccaaagct agttacaaaa aggccatcat 2400 
ggagtccgaa cccttcagtc atatagaccc 2460 
tatattagga cttatcttca aaggcccagc 2520 
gtctacagga aacatggtca caggacagac 2580 
tcctgaggaa tccaggggaa atgatgaact 2640 
gcaggatgcc tacagcttct cccggaagat 2700 
gatgtatgaa aacacaacta cagtggtgca 2760 
ccaatttggg gtaccccagg ccctgtttgg 2820 
taaggagcct ggtgtccggg aagccgtgct 2880 
caaaggggac tctgccagag ccaaggccca 2940 
agtggatgcc tcggttggga ccattcagtg 3000 
gaaggatgag ttgaaaccag cagtgaccca 3060 
cgcctgctgt cctctggagc gctgttcctc 3120 
aaagccagaa attgtgggaa gcaatttaga 3180 
gtttccacag gactacaggc tggcccagca 3240 
caggagaaag ccttctctgg gcaaacgtca 3300 
gttgtttgag cgactgcggg agacagtcac 3360 
gatcccattc aaagaggtgg cagtgaccct 3420 
gatctgtgcc cagatattgc agggctgtgc 3480 



-161- 



BNSDOCID: <WO__200405505QA2J_> 



WO 2004/055050 



PCT/IB2003/006434 



aaaacaggcc ctggagaagc tagaagagaa gagaaccagt caggaggacc cgaaggagtc 3540 
ccccgcaatg ctccccactt tcctgttgat gaacctgctg tccctggctg gggatgtggc 3600 
tctgcagcag ctggtccact tggagcaggc agtgagtgga gagctctgcc ggcgccgagt 3660 
tctccgggaa gaacaggagc acaagaccaa agatcccaag gagaagaata cgagctctga 3720 
gaccaccatg gaggaggagc tggggctggt tggggcaaca gcagatgaca cagaggcaga 3780 
actaatccgt ggcatctgcg agatggaact gttggatggc aaacagacac tggctgcctt 3840 
tgttccactc ttgcttaaag tctgtaacaa cccaggcctc tatagcaacc cagacctctc 3900 
tgcagctgct tcacttgccc ttggcaagtt ctgcatgatc agtgccactt tctgcgactc 3960 
ccagcttcgt cttctgttca ccatcctgga aaagtctcca cttcccattg tccggtctaa 4020 
cctcatggtt gccactgggg atctggccat ccgctttccc aatctggtgg acccctggac 4080 
tcctcatctg tatgctcgcc tccgggaccc tgctcagcaa gtgcggaaaa cagcggggct 4140 
ggtgatgacc cacctgatcc tcaaggacat ggtgaaggtg aaggggcagg tcagcgagat 4 200 
ggcggtgctg ctcatcgacc ccgagcctca gattgctgcc ctggccaaga acttcttcaa 4260 
tgagctctcc cacaagggca acgcaatcta taatctcctt ccagatatca tcagccgcct 4320 
gtcagacccc gagctggggg tggaggaaga gcctttccac accatcatga aacagctcct 4380 
ctcctacatc accaaggaca agcagacaga gagcctggtg gaaaagctgt gtcagcggtt 4 440 
ccgcacatcc ctaactgagc ggcagcagcg agacctggcc tactgtgtgt cacagctgcc 4 500 
cctcacagag cgaggcctcc gtaagatgct tgacaatttt gactgttttg gagacaaact 4560 
gtcagatgag tccatcttca gtgctttttt gtcagttgta ggcaagctgc gacgtggggc 4 620 
caagcctgag ggcaaggcta taatagatga atttgagcag aagcttcggg cctgtcatac 4680 
cagaggtttg gatggaatca aggagcttga gattggccaa gcaggtagcc agagagcgcc 4 74 0 
atcagccaag aaaccatcca ctggttctag gtaccagcct ctggcttcta cagcctcaga 4 800 
caatgacttt gtcacaccag agccccgccg tactacccgt cggcatccaa acacccagca 4860 
gcgagcttcc aaaaagaaac ccaaagttgt cttctcaagt gatgagtcca gtgaggaaga 4 920 
tctttcagca gagatgacag aagacgagac acccaagaaa acaactccca ttctcagagc 4 980 
atcggctcgc aggcacagat cctaggaagt ctgttcctgt cctccctgtg cagggtatcc 5040 
tgtagggtga cctggaattc gaattctgtt tcccttgtaa aatatttgtc tgtctctttt 5100 
ttttaaaaaa aaaaaaggcc gggcactgtg gctcacgcct gtaatcccag cactttgcga 5160 
taccaaggcg ggtggataac ctgaggtagg gagttcgaga ccagcctgac caacatggag 5220 
aaaccccatc tctactaaaa ataaaaaatt agccgggcgt attggcgtgc gcctgtaatc 5280 
ccagctactc aagaggctga ggcaggagaa tcgcctgaac ccagaggcgg aggttgtagt 5340 
gagccgaaat cacaccattg cactccagct tgggcaacaa tagcgaacct ccatctcaaa 5400 
ttaaaaaaaa aatgcctaca cgctctttaa aatgcaaggc tttctcttaa attagcctaa 5460 
ctgaactgcg ttgagctgct tcaactttgg aatatatgtt tgccaatctc cttgttttct 5520 
aatgaataaa tgtttttata tactttt 5547 

<210> 399 
<211> 213 
<212> PRT 

<213> Homo sapiens 



<400> 399 



Met 


Val 


Gin 


Ser 


Cys 


Ser 


Ala 


Tyr 


Gly 


Cys 


Lys 


Asn 


Arg 


Tyr 


Asp 


Lys 


1 








5 










10 










15 




Asp 


Lys 


Pro 


Val 


Ser 


Phe 


His 


Lys 


Phe 


Pro 


Leu 


Thr 


Arg 


Pro 


Ser 


Leu 








20 










25 










30 






Cys 


Lys 


Glu 


Trp 


Glu 


Ala 


Ala 


Val 


Arg 


Arg 


Lys 


Asn 


Phe 


Lys 


Pro 


Thr 






35 










40 










45 








Lys 


Tyr 


Ser 


Ser 


He 


Cys 


Ser 


Glu 


His 


Phe 


Thr 


Pro 


Asp 


Cys 


Phe 


Lys 




50 










55 










60 










Arg 


Glu 


Cys 


Asn 


Asn 


Lys 


Leu 


Leu 


Lys 


Glu 


Asn 


Ala 


Val 


Pro 


Thr 


He 


65 










70 










75 










80 


Phe 


Leu 


Cys 


Thr 


Glu 


Pro 


His 


Asp 


Lys 


Lys 


Glu 


Asp 


Leu 


Leu 


Glu 


Pro 










85 










90 










95 




Gin 


Glu 


Gin 


Leu 


Pro 


Pro 


Pro 


Pro 


Leu 


Pro 


Pro 


Pro 


Val 


Ser 


Gin 


Val 








100 










105 










110 






Asp 


Ala 


Ala 


He 


Gly 


Leu 


Leu 


Met 


Pro 


Pro 


Leu 


Gin 


Thr 


Pro 


Val 


Asn 






115 










120 










125 








Leu 


Ser 


Val 


Phe 


Cys 


Asp 


His 


Asn 


Tyr 


Thr 


Val 


Glu 


Asp 


Thr 


Met 


His 




130 










135 










140 










Gin 


Arg 


Lys 


Arg 


He 


His 


Gin 


Leu 


Glu 


Gin 


Gin 


Val 


Glu 


Lys 


Leu 


Arg 
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145 








Lys 


Lys 


Leu 


Lys 


Leu 


Glu 


Lys 


Leu 








180 


Val 


Ser 


Glu 


Arg 






195 




Val 


Glu 


Val 


Pro 




210 









150 






Thr 


Ala 


Gin 


Gin 


165 








Lys 


Glu 


Val 


Val 


Gly 


Tyr 


Val 


He 








200 


Ala 









155 

Arg Cys Arg Arg 
170 

His Phe Gin Lys 
185 

Leu Pro Asn Asp 



160 

Gin Glu Arg Gin 
175 

Glu Lys Asp Asp 
190 

Tyr Phe Glu He 
205 



<210> 400 
<211> 2167 
<212> DNA 

<213> Homo sapiens 
<400> 400 

gaagctgcct ccgccatctt ggagatggga gacgggcgat ggctgtggtc cttctgctaa 60 
tgcaaacaac aaaacgggca cactagtcac ccccgaggga ggccaccatc actgtaactg 120 
ttggccaaag ctacaaaaga agcgagggaa tccaaccgag cgcagcgaca ctgagaacag 180 
cttcccctgc cttctgcggc ggcagaagtg aagtgcctga ggaccggaag gatggtgcag 240 
tcctgctccg cctacggctg caagaaccgc tacgacaagg acaagcccgt ttctttccac 300 
aagtttcctc ttactcgacc cagtctttgt aaagaatggg aggcagctgt cagaagaaaa 360 
aactttaaac ccaccaagta tagcagtatt tgttcagagc actttactcc agactgcttt 420 
aagagagagt gcaacaacaa gttactgaaa gagaatgctg tgcccacaat atttctttgt 480 
actgagccac atgacaagaa agaagatctt ctggagccac aggaacagct tcccccacct 540 
cctttaccgc ctcctgtttc ccaggttgat gctgctattg gattactaat gccgcctctt 600 
cagacccctg ttaatctctc agttttctgt gaccacaact atactgtgga ggatacaatg 660 
caccagcgga aaaggattca tcagctagaa cagcaagttg aaaaactcag aaagaagctc 720 
aagaccgcac agcagcgatg cagaaggcaa gaacggcagc ttgaaaaatt aaaggaggtt 780 
gttcacttcc agaaagagaa agacgacgta tcagaaagag gttatgtgat tctaccaaat 840 
gactactttg aaatagttga agtaccagca taaaaaaatg aaatgtgtat tgatttctaa 900 
tggggcaata ccacatatcc tcctctagcc tgtaaaggag tttcatttaa aaaaataaca 960 
tttgattact tatataaaaa cagttcagaa tattttttta aaaaaaattc tatatatact 1020 
gtaaaattat aaattttttt gtttgtaatt tcaggttttt tacattttaa caaaatattt 1080 
taaaagttat aaactaacct cagacctcta atgtaagttg gtttcaagat tggggatttt 1140 
ggggtttttt tttttagtat ttatagaaat aatgtaaaaa taaaaagtaa agagaatgag 1200 
aacagtgtgg taaaagggag atttcagttt aaaacttaaa attagtactg ttttattgag 1260 
agaatttagt tatattttaa atcagaagta tgggtcagat catgggacat aacttcttag 1320 
aatatatata tacatatgta catattctca tatgtaaagt cacaaggttc atttatcttt 1380 
ctgaatcagt tatcaaagat aaattggcaa gtcagtactt aagaaaaaag atttgattat 1440 
J??^??" gaaaaaagtc attgcatatc tgatcaataa cttcagattc taagagtgga 1500 
tttttttttt tttacatggg ctcctatttt ttcccctact gtcttgcatt ataaaattag 1560 
aagtgtattt tcagtggaag aaacattttt caataaataa agtaaggcat tgtcatcaat 1620 
gaagtaatta aaactgggac ctgatctatg atacgctttt ttctttcatt acaccctagc 1680 
tgaaggacat ccagttcccc agctgtagtt atgtatctgc cttcaagtct ctgacaaatg 1740 
tgctgtgtta gtagagtttg atttgtatca tatgataatc ttgcacttga ctgagttggg 1800 
acaaggcttc acataaaaaa ttatttcttc acttttaaca caagttagaa attatatccc 1860 
atttagttaa atgcgtgatt tatattcaga acaacctact atgtagcgtt tattttactg 1920 
aatgtggaga tttaaacact gaggtttctg ttcaaactgt gagttctgtt ctttgtgaga 1980 
aattttacat atattggaag tgaaaatatg ttctgagtaa acaaatattg ctatgggagt 2040 
tatcttttta gatttagaat aactgttcca atgataatta ttacttttat atttcaaagt 2100 
acactaagat cgttgaagag caatagaacc tttaagacag tattaaaggt gtgaagcaat 2160 



ggcattc 

<210> 401 
<211> 823 
<212> PRT 

<213> Homo sapiens 
<400> 401 



2167 
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Met Gly Lys Lys Arg Thr Lys Gly Lys Thr Val Pro lie Asp Asp Ser 

1 5 10 15 

Ser Glu Thr Leu Glu Pro Val Cys Arg His lie Arg Lys Gly Leu Glu 

20 25 30 

Gin Gly Asn Leu Lys Lys Ala Leu Val Asn Val Glu Trp Asn lie Cys 

35 40 45 

Gin Asp Cys Lys Thr Asp Asn Lys Val Lys Asp Lys Ala Glu Glu Glu 

50 " 55 60 

Thr Glu Glu Lys Pro Ser Val Trp Leu Cys Leu Lys Cys Gly His Gin 
65 70 . 75 80 

Gly Cys Gly Arg Asn Ser Gin Glu Gin His Ala Leu Lys His Tyr Leu 

85 90 95 

Thr Pro Arg Ser Glu Pro His Cys Leu Val Leu Ser Leu Asp Asn Trp 

100 105 110 

Ser Val Trp Cys Tyr Val Cys Asp Asn Glu Val Gin Tyr Cys Ser Ser 

115 120 125 

Asn Gin Leu Gly Gin Val Val Asp Tyr Val Arg Lys Gin Ala Ser He 

130 135 140 

Thr Thr Pro Lys Pro Ala Glu Lys Asp Asn Gly Asn He Glu Leu Glu 
145 150 155 160 

Asn Lys Lys Leu Glu Lys Glu Ser Lys Asn Glu Gin Glu Arg Glu Lys 

165 170 " 175 

Lys Glu Asn Met Ala Lys Glu Asn Pro Pro Met Asn Ser Pro Cys Gin 

180 185 190 

He Thr Val Lys Gly Leu Ser Asn Leu Gly Asn Thr Cys Phe Phe Asn 

195 200 205 

Ala Val Met Gin Asn Leu Ser Gin Thr Pro Val Leu Arg Glu Leu Leu 

210 215 220 

Lys Glu Val Lys Met Ser Gly Thr He Val Lys He Glu Pro Pro Asp 
225 230 235 240 

Leu Ala Leu Thr Glu Pro Leu Glu He Asn Leu Glu Pro Pro Gly Pro 

245 250 255 

Leu Thr Leu Ala Met Ser Gin Phe Leu Asn Glu Met Gin Glu Thr Lys 

260 265 270 

Lys Gly Val Val Thr Pro Lys Glu Leu Phe Ser Gin Val Cys Lys Lys 

275 280 285 

Ala Val Arg Phe Lys Gly Tyr Gin Gin Gin Asp Ser Gin Glu Leu Leu 

290 295 300 

Arg Tyr Leu Leu Asp Gly Met Arg Ala Glu Glu His Gin Arg Val Ser 
305 310 315 320 

Lys Gly lie Leu Lys Ala Phe Gly Asn Ser Thr Glu Lys Leu Asp Glu 

325 330 335 

Glu Leu Lys Asn Lys Val Lys Asp Tyr Glu Lys Lys Lys Ser Met Pro 

340 345 350 

Ser Phe Val Asp Arg He Phe Gly Gly Glu Leu Thr Ser Met He Met 

355 360 365 

Cys Asp Gin Cys Arg Thr Val Ser Leu Val His Glu Ser Phe Leu Asp 

370 375 380 

Leu Ser Leu Pro Val Leu Asp Asp Gin Ser Gly Lys Lys Ser Val Asn 
385 390 395 " 400 

Asp Lys Asn Leu Lys Lys Thr Val Glu Asp Glu Asp Gin Asp Ser Glu 

405 410 * 415 

Glu Glu Lys Asp Asn Asp Ser Tyr He Lys Glu Arg Ser Asp He Pro 

420 425 430 

Ser Gly Thr Ser Lys His Leu Gin Lys Lys Ala Lys Lys Gin Ala Lys 

435 440 " 445 

Lys Gin Ala Lys Asn Gin Arg Arg Gin Gin Lys He Gin Gly Lys Val 

450 455 4 60 

Leu His Leu Asn Asp lie Cys Thr lie Asp His Pro Glu Asp Ser Glu 
465 470 475 480 

Tyr Glu Ala Glu Met Ser Leu Gin Gly Glu Val Asn lie Lys Ser Asn 
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485 490 495 

His lie Ser Gin Glu Gly Val Met His Lys Glu Tyr Cys Val Asn Gin 

500 505 510 

Lys Asp Leu Asn Gly Gin Ala Lys Met He Glu Ser Val Thr Asp Asn 

515 520 525 

Gin Lys Ser Thr Glu Glu Val Asp Met Lys Asn lie Asn Met Asp Asn 

530 535 540 

Asp Leu Glu Val Leu Thr Ser Ser Pro Thr Arg Asn Leu Asn Gly Ala 

m 550 555 560 

Tyr Leu Thr Glu Gly Ser Asn Gly Glu Val Asp lie Ser Asn Gly Phe 

565 570 575 

Lys Asn Leu Asn Leu Asn Ala Ala Leu His Pro Asp Glu lie Asn He 

580 5 8 5 59Q 

Glu He Leu Asn Asp Ser His Thr Pro Gly Thr Lys Val Tyr Glu Val 

595 6 oo 6 05 

Val Asn Glu Asp Pro Glu Thr Ala Phe Cys Thr Leu Ala Asn Arg Glu 

610 615 620 

Val Phe Asn Thr Asp Glu Cys Ser He Gin His Cys Leu Tyr. Gin Phe 

ZZ 630 63 5 640 

Thr Arg Asn Glu Lys Leu Arg Asp Ala Asn Lys Leu Leu Cys Glu Val 

645 650 655 

Cys Thr Arg Arg Gin Cys Asn Gly Pro Lys Ala Asn He Lys Gly Glu 

660 665 670 

Arg Lys His Val Tyr Thr Asn Ala Lys Lys Gin Met Leu lie Ser Leu 

675 680 685 

Ala Pro Pro Val Leu Thr Leu His Leu Lys Arg Phe Gin Gin Ala Gly 
690 695 7Q0 

Phe Asn Leu Arg Lys Val Asn Lys His He Lys Phe Pro Glu He Leu 
/Uo 710 715 

Asp Leu Ala Pro Phe Cys Thr Leu Lys Cys Lys Asn Val Ala Glu Glu 

725 730 735 

Asn Thr Arg Val Leu Tyr Ser Leu Tyr Gly Val Val Glu His Ser Glv 

740 745 750 

Thr Met Arg Ser Gly His Tyr Thr Ala Tyr Ala Lys Ala Arg Thr Ala 

755 760 765 

Asn Ser His Leu Ser Asn Leu Val Leu His Gly Asp He Pro Gin Asp 

770 775 780 

Phe Glu Met Glu Ser Lys Gly Gin Trp Phe His He Ser Asp Thr His 

790 795 800 

Val Gin Ala Val Pro Thr Thr Lys Val Leu Asn Ser Gin Ala Tyr Leu 

805 810 815 

Leu Phe Tyr Glu Arg He Leu 
820 



<210> 402 

<211> 2903 

<212> DNA 

<213> Homo sapiens 

<400> 402 

ctcaattcgt caccaggagg aagacggagc 
atgcagttat gggctctgtc gccgtggatt 
gtgccaacat gggaaagaaa cggacaaagg 
aaactttaga acctgtgtgc agacacatta 
aggctttagt gaatgtggaa tggaatatct 
aagataaagc tgaagaagaa acagaagaaa 
gccatcaggg ctgtggcaga aattctcagg 
caagatctga acctcactgt ctggttctta 
tatgtgataa tgaggtccag tattgtagtt 
tcagaaaaca agccagcatt acaactccaa 



tggctgccca gcccaaaggc ccatgagggg 60 
gttattttgt gtcagtaagt aatccataaa 120 
gaaaaactgt tccaatcgat gattcctctg 180 
gaaaaggatt ggaacaaggt aatttgaaaa 240 
gccaagactg taagactgac aataaagtga 300 
agccttcagt ttggctgtgt cttaaatgtg 360 
agcagcatgc cttgaagcac tatctgacgc 420 
gtttggacaa ctggagtgta tggtgttacg 480 
caaaccagtt gggtcaagtg gttgattatg 540 
agccagcaga gaaagataat ggaaatattg 600 
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aacttgaaaa taaaaaatta gaaaaagaga gtaagaatga acaagagaga gaaaagaagg 660 
aaaacatggc taaagagaat cctcccatga attctccttg ccaaataacc gtgaaaggac 720 
tcagtaattt gggaaacaca tgtttcttca atgcagttat gcagaacttg tcacaaacac 780 
j cagtgcttag agaactacta aaagaagtga aaatgtctgg aacaattgta aaaattgaac 840 

cacctgattt ggcattaaca gaaccattag aaataaacct tgagcctcca ggccctctta 900 
; ctttagccat gagccagttt cttaatgaga tgcaagagac caaaaagggg gttgtgacac 960 

: cgaaagaact cttttctcag gtctgtaaaa aagcagtgcg gtttaaaggc tatcagcagc 1020 

i aagacagcca ggagctgctt cgctacttat tggatgggat gagagcagaa gaacaccaaa 1080 

; gagtgagtaa aggaatactt aaagcatttg gtaattctac tgaaaagttg gatgaagaac 1140 

j taaaaaataa agttaaagat tatgagaaga aaaaatcaat gccaagtttt gttgaccgca 1200 

j tctttggtgg tgaactaact agtatgatca tgtgtgatca atgcagaact gtctccttgg 1260 

j ttcatgaatc tttccttgat ttgtccctcc cagttttaga tgatcagagt ggtaagaaaa 1320 

gtgtaaatga taaaaatctg aaaaagacag tggaggatga agatcaagat agtgaggaag 1380 
■ aaaaagataa cgacagttac ataaaagaga gaagtgatat tccttctgga acaagtaagc 1440 

acttacagaa aaaagcaaag aaacaagcca aaaagcaagc caagaaccaa cgaagacaac 1500 
aaaaaattca aggaaaagtt cttcatttaa atgatatttg tactattgac catcctgaag 1560 
acagtgaata tgaagctgaa atgtcacttc aaggagaagt aaatattaaa tccaaccata 1620 
tttcacaaga gggtgttatg cataaagaat attgtgtcaa ccagaaagat ttgaatggcc 1680 
: aagcaaaaat gatcgaaagt gtaactgaca • atcaaaaatc cacagaggaa gtagatatga 1740 

aaaatatcaa catggataat gatctggagg ttttaacatc ttctcccact aggaatttaa 1800 
atggtgccta cctaacggaa gggagcaatg gagaagtgga catttccaat ggtttcaaaa 1860 
acctaaattt gaatgctgct cttcatcctg atgaaataaa tatagagatt ctgaatgata 1920 
: gtcatactcc tggaacaaag gtgtatgagg ttgtaaatga agatccagaa actgctttct 1980 

gtactcttgc aaacagggaa gttttcaata ctgatgagtg ttcaatccaa cattgtttat 2040 
atcagttcac ccgtaatgag aaacttcgag atgcgaataa actgctttgt gaagtatgca 2100 
cacggagaca gtgtaatgga ccaaaggcaa atataaaagg tgaaaggaag catgtttaca 2160 
ccaatgccaa aaagcagatg ctaatttctc ttgctcctcc tgttcttact cttcatttaa 2220 
agagatttca gcaggctggt tttaacctac gcaaagttaa caaacacata aagtttccgg 2280 
aaatcttaga tttggctcct ttttgcaccc ttaaatgtaa gaatgttgca gaagaaaata 2340 
caagggtact ctattcctta tatggagttg ttgaacacag tggtactatg aggtcggggc 2400 
attacactgc ctatgccaag gcaagaaccg caaatagtca tctctctaat cttgttcttc 2460 
| acggtgatat tccacaagat tttgaaatgg aatcaaaagg gcagtggttt cacatcagcg 2520 

j acacacatgt gcaagctgtg cctacaacta aagtactaaa ctcacaagcg tacctcctat 2580 

tttatgagag aatactgtaa taatatcaaa agcacttttt ctggaaacac atttatggct 2640 
tttataatgg ctgaaataac gataaaaaaa gactaattaa aatcatgttc acttaacatt 2700 
aaatacatgc cagaagaaat catgtttatt taaatattga agggaaaaat acctaaaaat 2760 
gtacaaaggt tttatattgt catagtggtt tttattcctg ctttgtttct ggaaaggaaa 2820 
tcctgaatta cttaagtact ttgtgtttaa tatatctggg tgatggatca caacacatca 2880 
ataaactgac ttaccctaaa ate 2903 

<210> 403 
<211> 205 
<212> PRT 

<213> Homo sapiens 



<400> 403 



Met 


Ala 


Leu 


Gin 


Leu 


Ser 


Arg 


Glu 


Gin Gly 


He 


Thr 


Leu 


Arg 


Gly 


Ser 


1 








5 










10 








15 




Ala 


Glu 


lie 


Val 
20 


Ala 


Glu 


Phe 


Phe 


Ser 
25 


Phe 


Gly 


He 


Asn 


Ser 
30 


He 


Leu 


Tyr 


Gin 


Arg 


Gly 


lie 


Tyr 


Pro 


Ser 


Glu 


Thr 


Phe 


Thr 


Arg 


Val 


Gin 


Lys 






35 










40 










45 






Tyr 


Gly 


Leu 


Thr 


Leu 


Leu 


Val 


Thr 


Thr 


Asp 


Leu 


Glu 


Leu 


He 


Lys 


Tyr 




50 










55 










60 






Leu 


Asn 


Asn 


Val 


Val 


Glu 


Gin 


Leu 


Lys 


Asp 


Trp Leu 


Tyr 


Lys 


Cys 


Ser 


65 










70 










75 








80 


Val 


Gin 


Lys 


Leu 


Val 


Val 


Val 


He 


Ser 


Asn 


He 


Glu 


Ser 


Gly 


Glu 


Val 










85 










90 








95 




Leu 


Glu 


Arg 


Trp 


Gin 


Phe 


Asp 


He 


Glu 


Cys 


Asp 


Lys 


Thr 


Ala 


Lys 


Asp 








100 










105 










110 




Asp 


Ser 


Ala 


Pro 


Arg 


Glu 


Lys 


Ser 


Gin 


Lys 


Ala 


lie 


Gin 


Asp 


Glu 


He 
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115 120 
Arg Ser Val He Arg Gin He Thr 

130 135 
Leu Glu Val Ser Cys Ser Phe Asp 
145 150 
Leu Val Val Pro Glu Lys Trp Glu 
165 

Asn Ser Glu Glu Val Arg Leu Arg 
18 0 

Val Asn Ser Met Val Ala Tyr Lys 
195 ' 200 



125 



Ala 


Thr 


Val 


Thr 


Phe 


Leu 


Pro 


Leu 








140 










Leu 


Leu 


He 


Tyr 


Thr 


Asp 


Lys 


Asp 






155 










160 


Glu 


Ser 


Gly 


Pro 


Gin 


Phe 


He 


Thr 




170 










175 




Ser 


Phe 


Thr 


Thr 


Thr 


He 


His 


Lys 


185 










190 




He 


Pro 


Val 


Asn 


Asp 









205 



<210> 404 

<211> 1390 

<212> DNA 

<213> Homo sapiens 



<400> 404 

gggaagtgct 

ttgtgtccct 

gcgccgaaat 

gcatatatcc 

ctactgatct 

tatacaagtg 

tcct ggaaag 

ccagagaaaa 

cagctacggt 

atacagacaa 

ccaattctga 

tggtggccta 

taattttgaa 

ttggttaatt 

cctttatttt 

cattgttcaa 

gatagtaact 

gttt tggtca 

aaaggaagtc 

aacaatgaaa 

ttgaatcagt 

atatttgtac 

ttataaaatc 

aaaaaaaaaa 



gttggagccg 

ggccatggcg 

cgtggccgag 

atctgaaacc 

tgagctcata 

ttcagttcag 

atggcagttt 

gtctcagaaa 

gacatttctg 

agatttggtt 

ggaagtccgc 

caaaattcct 

atgtggtttt 

tttacatgga 

tttggtacct 

aaggaaccag 

gtagatggaa 

agtagtttga 

taaatattca 

tattgctgta 

ttccaattat 

tgtttaatgt 

aagttttaag 



ctgtggttgc 

ctgcagctct 

ttcttctcat 

tttactcgag 

aaatacctaa 

aaactggttg 

gatattgagt 

gctatccagg 

ccactgttgg 

gtacctgaaa 

cttcgttcat 

gtcaatgact 

cctgaaatca 

gaaaaccaaa 

atttgactta 

gaggtttttt 

aaacttgtgc 

ctcagtatag 

gaatctttgt 

tagctccttt 

ttgactttaa 

tctgtgatac 

tgaaagtgag 



tgtccgcgga 

cccgggagca 

tcggcatcaa 

tgcagaaata 

ataatgtggt 

tagttatctc 

gtgacaagac 

atgaaatccg 

aagtttcttg 

aatgggaaga 

ttactactac 

gaggatgaca 

ggtcatctat 

atgatactta 

ccatggagtt 

tgtcaacatt 

tataaagcta 

gtagggagat 

taaggtcctg 

tgaccttcat 

tttatgtaac 

agaactctta 

gaaataaagt 



gtggaagcgc 

gggaatcacc 

cagcatttta 

cggactcacc 

ggaacaactg 

aaatattgaa 

tgcaaaagat 

ttcagtgatc 

ttcatttgat 

gtcgggacca 

aatccacaaa 

tgaggaaaat 

agttgatatg 

ctgaactgtg 

aacatcatga 

gtgatgtata 

gatgctttcc 

atttaagtat 

aaagtaactc 

ttcatgtata 

ttgaacctat 

aaaatgtttt 

taagtttgtt 



gtgcttttgt 

ctgcgcggga 

tatcagcgtg 

ttgcttgtaa 

aaagattggt 

a Qtggtgagg 

gacagtgcac 

agacagatca 

ctgctgattt 

cagtttatta 

gtaaatagca 

aatgtaattg 

ttttatttca 

tgtaattgtt 

atttattgca 

ttcctttgaa 

taaatcagat 

aaaatacaac 

ataatctata 

gttttcccta 

gaagcaatgg 

ttcatgtgtt 

ttaaaaaaaa 



<210> 405 
<211> 464 
<212> PRT 
<213> Homo sapiens 

<400> 405 

Met Glu Thr Leu Ser Phe Pro Arg Tyr Asn Val Ala Glu lie Val He 

5 10 15 

Hrs lie Arg Asn Lys lie Leu Thr Gly Ala Asp Gly Lys Asn Leu Thr 

20 25 30 

Lys Asn Asp Leu Tyr Pro Asn Pro Lys Pro Glu Val Leu His Met lie 

35 40 45 

Tyr Met Arg Ala Leu Gin He Val Tyr Gly lie Arg Leu Glu His Phe 

50 55 60 

Tyr Met Met Pro Val Asn Ser Glu Val Met Tyr Pro His Leu Met Glu 

70 75 80 

Gly Phe Leu Pro Phe Ser Asn Leu Val Thr His Leu Asp Ser Phe Leu 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1390 
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Pro 


He 


Cys 


Arg 








100 


Lys 


Ala 


Lys 


Arg 






115 




His 


Phe 


xii- y 


m n 

Vj J. u 




130 






i y j. 


Lys 


Ser 


Ser 


1 4 S 

-L " — ' 








Glu 


Ala 


Leu 


Met 


Gin 


Glu 


Glu 


Phe 








180 


Ser 


Leu 


Asn 


Gin 






i ^? j 




Asn 


Ser 


Gin 


Lys 




£. J. \J 






Glu 


Leu 


V s 


Leu 


0 9 S 








Lys 


Thr 


Lys 


He 


Lys 


Met 


Lys 


Asp 








260 


Vdi. 


OlU 


Lys 


Tyr 






07 c. 






ulll 


Leu 






Z. -7 L/ 






Asp 


Asn 


Arg 


Glu 










Glu 


Asp 


Gin 


He 


Glu 


Glu 


Asn 


Ser 








340 


J.ct 


i in 


nid 








1 ^ s 




Tur 

i yj. 


Lys 


Ar g 


Thr 




O / U 






Gly 


Ala 


Val 


Tvr 










He 


Lys 


Leu 


Gly 


Leu 


Lys 


Ser 


Gin 








420 


Tyr 


His 


Asp 


Gly 






435 




Asp 


Glu 


Lys 


Thr 




450 







Val 


Asn 


Asp 


Phe 


Thr 


Ser 


Arg 


Phe 








120 


-tt-La 




/^.x y 








1 J J 




nld 


rio p 


Lys 


I it! L. 




1 c;n 

JL OU 






Lys 


Leu 


ol U 


Arg 


IDj 








Lys 


bJ- 11 


Leu 




Asp 


Phe 


His 


Gin 








200 


Lys 




Asn 


Tip 










Leu 


vax 


vax 


oer 




0 ^ n 






V d 1 


Asp 


Ser 


Pro 










1 nr 


Vdl 


b»xn 


Lys 


Glu 


lie 


Tyr 


Gly 








280 


vai 


bXn 


Leu 


Tyr 






O Q C 

z y 3 




Lys 


Leu 


/\x a 


<r ^ -y 

oer 




Ol A 
J J.U 






bill 


ber 


Asp 


blU 


JzD 








Fne 


Lys 


Arg 


x#eu 


Phe 


Lys 


He 


Asn 








360 


Vdl 


ire 


blU 


Asp 






J / o 




blU 


Arg 


vax 


i nr 




O Q A 






He 


Gin 


Gin 


Leu 


4 05 








Glu 


lie 


Phe 


Leu 


lie 


Glu 


Lys 


Ala 








440 


Ala 


Glu 


Leu 


Lys 






455 





90 



Glu 


Thr 


Ala 


Asp 


105 








Leu 


Ser Gly 


lie 


Thr 


Tyr Met 


Glu 








140 


Gin 


Gin 


Leu 


Asn 






155 




Leu 


Zi en 
nop 




Val 




170 






Asp 


Gly 


He 


Gin 


185 








Lys 


Thr 


He 


Val 


Ser 




Lys 


Thr 








220 


Leu 


Lys 




lie 






9 "3 R 

6 J J 




Glu 


Lys 


Leu 


Lys 




250 






Leu 


Lys 


Asn 


Ala 


265 








As p 




vdi 


Asp 


Gin 


Lys 


Lys 


lie 








300 


lie 


Leu 


Lys 


Glu 






315 




Jul 


Glu 


Leu 


Lys 




330 






Met 


lie 


Val 


Lys 


345 








Lys 


Lys 


His 


Glu 


Cys 


Asn 


Lys 


Val 








380 


Thr 


lie 


Asn 


Gin 






395 




Lys 


Asp Ala 


Ala 




410 






Asn 


Leu 


Lys 


Thr 


425 








Ala 


Glu 


Asp 


Ser 


Arg 


Lys 


Met 


Phe 








460 



95 

He Leu Cys Pro 
±10 

lie Asn Phe lie 
125 

Phe Leu Trp Gin 

Ala Ala His Gin 
160 



Pro 


Val 


Glu 


Glu 






175 




blU 


Leu 


bin 


bin 




1 on 






Leu 


bin 


bXU 


bi y 










Lys 


Arg 


Leu 


As n 




Pin 


oer 


Leu 








t /i n 
z 4 U 


Asn 


Tyr 


Lys 


Glu 






255 




Arg 


bin 


bill 


17-. 1 

val 




OTA 






Cys 


Leu 


Pro 


oer 


28£ 








bin 


TV 

ASp 


Leu 


ber 


oer 


Leu 


Asn 


Leu 










Lys 


lieu 


Lys 








335 




Lys 


Glu 


Lys 


Leu 




350 






Asp 


Val 


Lys 


Gin 


365 








Gin 


Glu 


Lys 


Arg 


Glu 


lie 


Gin 


Lys 








400 


Glu 


Arg 


Glu 


Lys 






415 




Ala 


Leu 


Glu 


Lys 




430 






Tyr 


Ala 


Lys 


He 


445 








Lys 


Met 


Ser 


Thr 



<210> 406 
<211> 1857 
<212> DNA 

<213> Homo sapiens 
<400> 406 

ggaatggggc gggacttcca gtaggaggcg 
ttgctgattt ttgactttgc ttgtagctgc 
ccggcactgt agattaacag gaaacttcca 
atgtagctga gattgtgatt catattcgca 
acctcaccaa gaatgatctt tatccaaatc 
tgagagcctt acaaatagta tatggaattc 



gcaagtttga aaagtgatga cggttgacgt 60 

tccccgaact cgccgtcttc ctgtcggcgg 120 

agatggaaac tttgtctttc cccagatata 180 

ataagatctt aacaggagct gatggtaaaa 240 

caaagcctga agtcttgcac atgatctaca 300 

gactggaaca tttttacatg atgccagtga 360 
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actctgaagt 

ctcatctgga 

tatgtccaaa 

tcagagaagc 

acaaaatgca 

attctgttcc 

tacaacaatc 

cccaaaagaa 

tggtttcttt 

agttaaagaa 

aagaagtggt 

agttggaagt 

tagccagtat 

aactgaagaa 

aaaaacttgc 

aacgcacagt 

gagtaaccac 

atgctgctga 

tggagaaata 

agaagacagc 

atgtcttttt 

cgaatggaag 

taagtagtta 

gtgcagctat 

tagttacctt 



catgtatcca 

ctcatttttg 

agcaaaacgg 

atgccgtgaa 

acagttaaac 

agttgaagag 

actaaatcag 

gtcaaatatt 

gaaagaaata 

t tataaagaa 

ggagaaatat 

gcagttatat 

ct taaaggag 

attgaagact 

cacagcacaa 

aattgaggat 

aattaatcaa 

aagggagaaa 

ccacgacggt 

tgaactgaag 

gtaaatggct 

tatcagaagt 

ataagatgaa 

tcatgtctct 

tgaaatatat 



catttaatgg 
' cctatctgcc 
acaagtcggt 
acgtata tgg 
gccgcacacc 
caagaagagt 
gattttcatc 
tcagagaaaa 
caagagagtt 
aaaatgaaag 
gaaatctatg 
caaaagaaaa 
agcctgaact 
gaagaaaatt 
ttcaaaataa 
tgcaa taaag 
gaaa tccaaa 
ctgaagtccc 
attgaaaagg 
aggaagatgt 
tgccatcttt 
accaaataat 
tttaatgtag 
actctgcccc 
atattttttt 



aaggcttctt 

gggtgaatga 

ttttaagtgg 

aatttctttg 

aggaggcatt 

tcaagcagct 

aaaaaacgat 

ccaagcgttt 

tgaaaacaaa 

atacggtcca 

gagactcagt 

tacaggacct 

tggaggacca 

cgttcaaaag 

ataagaagca 

ttcaagaaaa 

aaattaaact 

aggaaatatt 

cagcagagga 

tcaaaatgtc 

taattttcta 

gttggcttca 

gcttttatta 

ttgttgtaaa 

ctgttaaaaa 



accattcagc 

ctt tgagact 

cattatcaac 

gcaatataaa 

aatgaaactg 

ttcagatggt 

agtgctgcaa 

gaatgaacta 

aattgtggat 

gaagcttaaa 

tgactgcctg 

ttcagataat 

aattgagagt 

actgatgatt 

tgaagatgtt 

aagaggtgct 

tggaattcaa 

tctaaacttg 

ctcctatgct 

aacctgatta 

tttagaaaga 

tcagttttta 

atttataatt 

tagtttgagt 

aaaaaaaaaa 



aatttagtta 

gctgatattc 

tttattcact 

tcctctgcgg 

gagagacttg 

attcaggagc 

gagggaaatt 

aaattgttgg 

tctccagaga 

aatgccagac 

ccttcatgtc 

agggaaaaat 

gatgagtcag 

gtgaagaagg 

aagcaataca 

gtctatgaac 

caactaaaag 

aaaactgctt 

aagatagatg 

acaaaattac 

aaagttgaag 

tacactctca 

aaaataactt 

aaaacaaaac 

aaaaaaa 



<210> 407 
<211> 1050 
<212> PRT 

<213> Homo sapiens 



<400> 407 
Met Ala Ala Val 
1 

Leu Glu Gly Asp 
20 

Arg Gin Gly Arg 
35 

Ser Ala Cys Asn 
50 

Glu lie Arg Phe 
65 

Tyr He Ser Trp 

Asn Met Ser Thr 
100 

Lys Arg Tyr Tyr 
115 

Gly Arg Leu Cys 
130 

Gin Gly He Gly 
145 

Glu Tyr Glu Ala 

Glu Gly He Gin 
180 

His Arg Gin Phe 
195 

Glu Lys Glu Glu 
210 

Ser Thr Leu Ala 



Lys Lys 
5 

Glu Trp 

He Met 

Asn Thr 

Tyr Thr 

70 
Thr Glu 

85 

Leu Leu 

Ser Asp 

Asn Glu 

Val Ser 
150 
Arg Glu 
165 

Gin Lys 
Gin Ala 
Glu Glu 
Glu Leu 



Glu Gly 

Glu Leu 

Ser Thr 
40 

Leu Gin 
55 

Gly Asn 

Gin Asn 

Glu Arg 

Pro Arg 
120 
Pro Leu 
135 

Leu Ala 

Asn Phe 

Ala Glu 

Arg Val 
200 
Glu Val 
215 

Lys Ser 



Gly Ala 

10 
Ser Lys 
25 

Leu Gin 

Gin Gin 

Asp Pro 

Tyr Pro 
90 

Ala Val 
105 

Phe Leu 

Asp Met 

Gin Phe 

Arg Lys 
170 
Pro Leu 
185 

Ser Arg 
Phe Glu 
Lys Gly 



Leu Ser Glu Ala 
Glu Asn 
Gly Ala 



Lys Arg 

60 
Leu Asp 
75 

Gin Gly 

Glu Ala 

Asn Leu 

Tyr Ser 
140 
Tyr He 
155 

Ala Asp 

Glu Arg 

Gin Thr 

Ser Ser 
220 
Lys Lys 



Val Gin 

30 
Leu Ala 
45 

Ala Phe 



Val Trp. 

Gly Lys 

Leu Gin 
110 
Trp Leu 
125 

Tyr Leu 

Ser Trp 

Ala He 

Leu Gin 
190 
Leu Leu 
205 

Val Pro 
Thr Ala 



Met Ser 
15 

Pro Leu 

Gin Glu 

Glu Tyr 

Asp Arg 

80 
Glu Ser 
95 

Gly Glu 

Lys Leu 

His Asn 

Ala Glu 
160 
Phe Gin 
175 

Ser Gin 
Ala Leu 
Gin Arg 
Arg Ala 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

14 40 

1500 

1560 

1620 

1680 

1740 

1800 

1857 
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225 

Pro He He Arg 

Gly Leu Gin Asn 
260 

Thr Val Phe Asp 
275 

Lys Pro Thr Val 
290 

Glu Asn Glu Leu 
305 

His Arg Pro Arg 

Leu Pro Ser Phe 
340 

Met Thr Pro Cys 
355 

Arg Lys Pro Gly 
370 

His Gin Gin Ala 
385 

Glu Lys He Tyr 

Ala Glu Val Phe 
420 

Leu Thr Ser Ala 
435 

Met Glu Lys Lys 
450 

Gly Asp Gin Gin 
465 

Gin He Ala Ser 

Ser Val Cys Gin 
500 

Asn He Trp Gin 
515 

Ser He Phe Asp 
530 

Pro Ala Asp Pro 
545 

Leu Lys Thr Ser 

Val Cys Asp Glu 
580 

He Thr Gly Phe 
595 

Cys Asp Phe Ala 
610 

lie Met Ser Leu 
625 

Glu Glu Asp Leu 

Gly Thr He Tyr 
660 

He Glu Asp Ser 
675 

Ser Ala Ser Val 
690 

Glu Lys Leu Glu 
705 



230 

Val Gly Gly Ala 
245 

Pro Phe Pro Gin 

Glu Asn Ala Asp 
280 

Gin Pro Trp He 
295 

Gin Ala Gly Pro 
310 

Gly Asn Thr Ala 
325 

Thr Pro Tyr Val 

Lys lie Glu Pro 
360 

Lys Glu Glu Gly 
375 

Ser Glu Glu Lys 
390 

Ala Gly Val Gly 
405 

Arg Lys Lys Leu 

Glu Lys Arg Ala 
440 

Leu Lys Glu He 
4 55 

Glu Glu Thr Met 
470 

Glu Ser Gin Lys 
485 

Val Asn Cys Cys 

Glu Gin Pro His 

520 

Glu Phe Leu Leu 
535 

Pro Arg Val Leu 
550 

Glu Ser He Thr 
565 

Phe Thr Gly lie 

Arg Asn Val Thr 
600 

Arg Ala Ala Arg 
615 

Lys Asp Leu Pro 
630 

Asp Val Lys Thr 
645 

Ser Gin Thr Leu 

Arg Glu Ala Thr 
680 

Ala Ser Thr Ser 
695 

Leu Thr Asn Glu 
710 



235 

Leu Lys Ala Pro 
250 

Gin Met Gin Asn 
265 

Glu Ala Ser Thr 

Ala Pro Pro Met 
300 

Trp Asn Thr Gly 
315 

Ser Leu He Ala 
330 

Glu Glu Thr Ala 
345 

Ser He Asn His 

Asp Ser Leu Gin 
380 

Lys Glu Lys Met 
395 

Glu Phe Ser Phe 
410 

Lys Glu Gin Arg 
425 

Glu Met Gin Lys 

Gin Thr Thr Gin 
460 

Pro Thr Lys Glu 

475 ' 
lie Pro Gly Met 
490 

Ala Arg Glu Thr 
505 

Ser Lys Gly Pro 

Ser Glu Lys Lys 
540 

Ala Gin Arg Arg 
555 

Ser Asn Glu Asp 
570 

Glu Pro Leu Ser 
585 

lie Cys Pro Asn 

Phe Val Ser Thr 
620 

Ser Asp Pro Glu 
635 

Ser Glu Asp Gin 
650 

Ser lie Lys Lys 
665 

His Ser Ser Gly 

Ser lie Lys Cys 
700 

Thr Ser Glu Asn 
715 



240 . 
Ser Gin Asn Arg 
255 

Asn Ser Arg lie 
270 

Ala Glu Leu Ser 
285 

Pro Arg Ala Lys 

Arg Ser Leu Glu 
320 

Val Pro Ala Val 
335 

Gin Gin Pro Val 
350 

lie Leu Ser Thr 
365 

Arg Val Gin Ser 

Met Tyr Cys Lys 
400 

Glu Glu lie Arg 
415 

Glu Ala Glu Leu 

, 430 
Gin lie Glu Glu 
445 

Gin Glu Arg Thr 

Thr Thr Lys Leu 
480 

Thr Leu Ser Ser 
495 

Ser Leu Ala Glu 
510 

Ser Val Pro Phe 
525 

Asn Lys Ser Pro 

Pro Leu Ala Val 
5 60 

Val Ser Pro Asp 
575 

Glu Asp Ala lie 
590 

Pro Glu Asp Thr 
605 

Pro Phe His Glu 

Arg Leu Leu Pro 
640 

Gin Thr Ala Cys 
655 

Leu Ser Pro lie 
670 

Phe Ser Gly Ser 
685 

Leu Gin lie Pro 

Pro Thr Gin Ser 
720 
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Pro Trp Cys Ser Gin Tyr Arg Arg Gin Leu Leu Lys Ser Leu Pro Glu 

725 730 735 

Leu Ser Ala Ser Ala Glu Leu Cys lie Glu Asp Arg Pro Met Pro Lys 

740 745 * 7 5o 

Leu Glu He Glu Lys Glu He Glu Leu Gly Asn Glu Asp Tyr Cys He 

755 760 765 

Lys Arg Glu Tyr Leu He Cys Glu Asp Tyr Lys Leu Phe Trp Val Ala 

770 775 780 

Pro Arg Asn Ser Ala Glu Leu Thr Val He Lys Val Ser Ser Gin Pro 
785 790 795 eoo 

Val Pro Trp Asp Phe Tyr He Asn Leu Lys Leu Lys Glu Arg Leu Asn 

805 810 815 

Glu Asp Phe Asp His Phe Cys Ser Cys Tyr Gin Tyr Gin Asp Gly Cys 

820 825 8 3o 

lie Val Trp His Gin Tyr He Asn Cys Phe Thr Leu Gin Asp Leu Leu 

835 840 845 

Gin His Ser Glu Tyr lie Thr His Glu He Thr Val Leu He He Tyr 

850 855 860 

Asn Leu Leu Thr He Val Glu Met Leu His Lys Ala Glu He Val His 
I 65 a _ 870 875 ego 

Gly Asp Leu Ser Pro Arg Cys Leu He Leu Arg Asn Arg He His Asp 

885 890 895 

Pro Tyr Asp Cys Asn Lys Asn Asn Gin Ala Leu Lys He Val Asp Phe 

900 905 910 

Ser Tyr Ser Val Asp Leu Arg Val Gin Leu Asp Val Phe. Thr Leu Ser 

915 920 925 

Gly Phe Arg Thr Val Gin He Leu Glu Gly Gin Lys He Leu Ala Asn 

9 30 935 940 

Cys Ser Ser Pro Tyr Gin Val Asp Leu Phe Gly lie Ala Asp Leu Ala 
5 950 955 960 

His Leu Leu Leu Phe Lys Glu His Leu Gin Val Phe Trp Asp Gly Ser 

96 5 970 975 

Phe Trp Lys Leu Ser Gin Asn lie Ser Glu Leu Lys Asp Gly Glu Leu 

980 985 990 

Trp Asn Lys Phe Phe Val Arg He Leu Asn Ala Asn Asp Glu Ala Thr 

995 1000 1005 

Val Ser Val Leu Gly Glu Leu Ala Ala Glu Met Asn Gly Val Phe Asn 

1°10 1015 1020 

Thr Thr Phe Gin Ser His Leu Asn Lys Ala Leu Trp Lys Val Gly Lys 

1025 1030 1035 1040 

Leu Thr Ser Pro Gly Ala Leu Leu Phe Gin 

1045 1050 



<210> 408 

<211> 3702 

<212> DNA 

<213> Homo sapiens 

<400> 408 

gtttgttagg gagtcgtgtg cgtgccttgg 
gaagaaagcc caggcggtct gtggcccaga 
gccaggaatg caggatggcg gcggtgaaga 
ccctggaggg agatgaatgg gaactgagta 
ggatcatgtc cacgcttcag ggagcactgg 
agcagcagaa acgggcattt gaatatgaaa 
atgtttggga taggtatatc agctggacag 
gtaatatgtc aacgttatta gaaagagctg 
atagtgatcc tcgatttctc aatctctggc 
tggatatgta cagttacttg cacaaccaag 
tctcatgggc agaagaatat gaagctagag 



tcgcttctgt agctccgagg gcaggttgcg 60 
ggaaaggcct gcagcaggac gaggacctga 120 
aggaaggggg tgctctgagt gaagccatgt 180 
aagaaaatgt acaaccttta aggcaagggc 240 
cacaagaatc tgcctgtaac aatactcttc 300 
ttcgatttta cactggaaat gaccctctgg 360 
agcagaacta tcctcaaggt gggaaggaga 420 
tagaagcact acaaggagaa aaacgatatt 480 
ttaaattagg gcgtttatgc aatgagcctt 540 
ggattggtgt ttcacttgct cagttctata 600 
aaaactttag gaaagcagat gcgatatttc 660 
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aggaagggat tcaacagaag gctgaaccac tagaaagact acagtcccag caccgacaat 720 
tccaagctcg agtgtctcgg caaactctgt tggcacttga gaaagaagaa gaggaggaag 780 
tttttgagtc ttctgtacca caacgaagca cactagctga actaaagagc aaagggaaaa 840 
agacagcaag agctccaatc atccgtgtag gaggtgctct caaggctcca agccagaaca 900 
gaggactcca aaatccattt cctcaacaga tgcaaaataa tagtagaatt actgtttttg 960 
atgaaaatgc tgatgaggct tctacagcag agttgtctaa gcctacagtc cagccatgga 1020 
tagcaccccc catgcccagg gccaaagaga atgagctgca agcaggccct tggaacacag 1080 
gcaggtcctt ggaacacagg cctcgtggca atacagcttc actgatagct gtacccgctg 1140 
tgcttcccag tttcactcca tatgtggaag agactgcaca acagccagtt atgacaccat 1200 
gtaaaattga acctagtata aaccacatcc taagcaccag aaagcctgga aaggaagaag 1260 
gagattctct acaaagggtt cagagccatc agcaagcgtc tgaggagaag aaagagaaga 1320 
tgatgtattg taaggagaag atttatgcag gagtagggga attctccttt gaagaaattc 1380 
gggctgaagt tttccggaag aaattaaaag agcaaaggga agccgagcta ttgaccagtg 1440 
cagagaagag agcagaaatg cagaaacaga ttgaagagat ggagaagaag ctaaaagaaa 1500 
tccaaactac tcagcaagaa agaacaggtg atcagcaaga agagacgatg cctacaaagg 1560 
agacaactaa actgcaaatt gcttccgagt ctcagaaaat accaggaatg actctatcca 1620 
gttctgtttg tcaagtaaac tgttgtgcca gagaaacttc acttgcggag aacatttggc 1680 
aggaacaacc tcattctaaa ggtcccagtg tacctttctc catttttgat gagtttcttc 1740 
tttcagaaaa gaagaataaa agtcctcctg cagatccccc acgagtttta gctcaacgaa 1800 
gaccccttgc agttctcaaa acctcagaaa gcatcacctc aaatgaagat gtgtctccag 1860 
atgtttgtga tgaatttaca ggaattgaac ccttgagcga ggatgccatt atcacaggct 1920 
tcagaaatgt aacaatttgt cctaacccag aagacacttg tgactttgcc agagcagctc 1980 
gttttgtatc cactcctttt catgagataa tgtccttgaa ggatctccct tctgatcctg 2040 
agagactgtt accggaagaa gatctagatg taaagacctc tgaggaccag cagacagctt 2100 
gtggcactat ctacagtcag actctcagca tcaagaagct gagcccaatt attgaagaca 2160 
gtcgtgaagc cacacactcc tctggcttct ctggttcttc tgcctcggtt gcaagcacct 2220 
cctccatcaa atgtcttcaa attcctgaga aactagaact tactaatgag acttcagaaa 2280 
accctactca gtcaccatgg tgttcacagt atcgcagaca gctactgaag tccctaccag 2340 
agttaagtgc ctctgcagag ttgtgtatag aagacagacc aatgcctaag ttggaaattg 2400 
agaaggaaat tgaattaggt aatgaggatt actgcattaa acgagaatac ctaatatgtg 2460 
aagattacaa gttattctgg gtggcgccaa gaaactctgc agaattaaca gtaataaagg 2520 
tatcttctca acctgtccca tgggactttt atatcaacct caagttaaag gaacgtttaa 2580 
atgaagattt tgatcatttt tgcagctgtt atcaatatca agatggctgt attgtttggc 2640 
accaatatat aaactgcttc acccttcagg atcttctcca acacagtgaa tatattaccc 2700 
atgaaataac agtgttgatt atttataacc ttttgacaat agtggagatg ctacacaaag 2760 
cagaaatagt ccatggtgac ttgagtccaa ggtgtctgat tctcagaaac agaatccacg 2820 
atccctatga ttgtaacaag aacaatcaag ctttgaagat agtggacttt tcctacagtg 2880 
ttgaccttag ggtgcagctg gatgttttta ccctcagcgg ctttcggact gtacagatcc 2940 
tggaaggaca aaagatcctg gctaactgtt cttctcccta ccaggtagac ctgtttggta 3000 
tagcagattt agcacattta ctattgttca aggaacacct acaggtcttc tgggatgggt 3060 
ccttctggaa acttagccaa aatatttctg agctaaaaga tggtgaattg tggaataaat 3120 
tctttgtgcg gattctgaat gccaatgatg aggccacagt gtctgttctt ggggagcttg 3180 
cagcagaaat gaatggggtt tttgacacta cattccaaag tcacctgaac aaagccttat 3240 
ggaaggtagg gaagttaact agtcctgggg ctttgctctt tcagtgagct aggcaatcaa 3300 
gtctcacaga ttgctgcctc agagcaatgg ttgtattgtg gaacactgaa actgtatgtg 3360 
ctgtaattta atttaggaca catttagatg cactaccatt gctgttctac tttttggtac 3420 
aggtatattt tgacgtcact gatatttttt atacagtgat atacttactc atggccttgt 3480 
ctaacttttg tgaagaacta ttttattcta aacagactca ttacaaatgg ttaccttgtt 3540 
atttaaccca tttgtctcta cttttccctg tacttttccc atttgtaatt tgtaaaatgt 3600 
tctcttatga tcaccatgta ttttgtaaat aataaaatag tatctgttaa aaaaaaaaaa 3660 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3702 

<210> 409 
<211> 164 
<212> PRT 

<213> Homo sapiens 
<400> 409 

Met Pro Cys Ser Glu Glu Thr Pro Ala lie Ser Pro Ser Lys Arg Ala 

1 5 10 15 

Arg Pro Ala Glu Val Gly Gly Met Gin Leu Arg Phe Ala Arg Leu Ser 
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20 25 
Glu His Ala T hr Ala Pro Thr ^ g Gly Ser ^ ^ ^ ^ ^ ^ 

Asp Leu T yr Ser Ala T yr Asp Tyr Thr Ile Pro pro ^ ^ ^ ^ 

V.1 Val L ys Th r Asp lie Gin lie Ala Leu Pro II Gly Cys Tyr ^ 

" 9 V " ^ His «>• XL Asp Val 

Gly Ala cay Val Xle Asp Glu Asp jg Irg Cl y Asn Val Giy ? ml Val 

" 115 L - f£ V - L - L - Gl y Asp Arg 



He Ma Gin Leu He C ys jj„ ^ Ile phe Tyr prQ J* ^ ^ ^ 



135 



-I Gin Ala Leu Asp Asp Thr Glu Arg Gly Ser Gly ° Qly ^ ^ ^ 
Thr Gly Lys Asn 155 160 



<210> 410 
<211> 1816 
<212> DNA 

<213> Homo sapiens 
<400> 410 



g c ™ gg c 9 c 9 t c gc c g s a tr^oToit L g a a :c gagaca r gccattt cacc — a 6 ° 

cgccacggcc cccacccggg ct c q ^ ttt 9 ccc ^ c tctccgagca 120 

tgattacaca ataccaccta 1 ^ Q 9 f g ° tacgacctgt acagtgccta 180 

cccttct ggg tgttatggaa gagtggc'cc a^caac* aC " a " ttc Wagcgct 240 
t g at g ta gga gctcjgtgtca taLL.l ^cggtcaggc tt ggc t g caa aacactttat 300 
taattttggc aaa a g ttgaagtcaa 393993 aat «"Wtg ttgtactgtt 360 

cgaacggatt ttttatcSag a ! a a 9 ^" 93 C 3 aa ttgcac agctcatttg 420 
ttca ggaggt tttggttcca ctggaaag 93 ttaaa"?^ " ggatgaCa ~gaaa gggg 480 
aagtcatacc tttttcttaa aaaaaa*^ ttaaaattta tgccaagaac agaaaacaag 540 
tgcacttctg taaactta" agc'tta'ct t^aaaata "tggtgtt? 600 

tatgatcaag gaaaagatcg ttaaaaaaaa ™aaagta ctgcattttt tacttttttt 660 
caaaaagaaa ctttg?tt« ccgcaattoa laa.?*? 9 ?* gtttttctt t gtgtttggat 720 
ctgatgtaaa cagtgtc^tc ttaaaaSa ff 5 taaa tctgct ttgtggtgac 780 

gcctgtattt aactcatatg Select" aaclt,?^ attaca 9 att aaaaaaaaaa ,840 
tcttaagcaa tatttttta? tcagtaaaca a 3 "tgctttaa ttgctttaaa 900 

aagctgaact aaaataaaaa tgaaaagg^g " «-Mtacaa aatcttgeat 960 

tcttcactag tctaaaaact tcttJttaa? tttlt ? 99 tattcctt ^ tcttcccttc 1020 
aagaatcctc agtttatt« Sacta'a "tgtgatga gggtgagaaa 1080 

gggtagaggt atgtttgtga aaaacatota tgataaatcc tctattgact 1140 

ccttgaattt c.?ctc££ ggLaaSgt Sa ^f^" "Wtttgttc 1200 

gaagtagcaa taggctgtaa ?caagaaaat at2 ? £ tacga 9" ttccctcagt 1260 
taatacttca gccaccaggt ttttctgtct ta W****9 ataaatgaaa 1320 

atgggactga ttctgtggct taccttgS aacatctt^ agCagCattt cattgeagat 1380 
ttcctttctt tactatgttt ctcagatJcc ttto I'" 93«3ttttg ctagtgtgct 1440 
ttttgtccat cagattctgt gagacaccag « a ?ca^? """^ gtcacttagg 1500 
tggagtgctt ctggaactat c'agee gaccScag J^X'*" gttataCaca "60 

agtgttcttt tctggtgatt ctccaggcca t^aataccc f,"" 9 caca 5gcaag 1620 
gtggctcaca tttcattagt gagecatgaa atoaact^o ^gcaatgtaa ttgtccctct 1680 

tgcataccag gttgggctat aaaatatt'c S«rtcH? llTrTS* 9 1740 
ctaaaaaaaa aaaaaa L 9«gccaat aaattttaaa tgttttcctg 1800 

1816 

<210> 411 
<211> 1388 
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<212> PRT 

<213> Homo sapiens 
<400> 411 

Met Ala Pro Gly Cys Lys Thr Glu Leu Arg Ser Val Thr Asn Gly Gin 

15 10 15 

Ser Asn Gin Pro Ser Asn Glu Gly Asp Ala lie Lys Val Phe Val Arg 

20 25 30 

lie Arg Pro Pro Ala Glu Arg Ser Gly Ser Ala Asp Gly Glu Gin Asn 

35 40 * 45 

Leu Cys Leu Ser Val Leu Ser Ser Thr Ser Leu Arg Leu His Ser Asn 

50 55 60 

Pro Glu Pro Lys Thr Phe Thr Phe Asp His Val Ala Asp Val Asp Thr 
65 70 75 80 

Thr Gin Glu Ser Val Phe Ala Thr Val Ala Lys Ser He Val Glu Ser 

85 90 95 

Cys Met Ser Gly Tyr Asn Gly Thr lie Phe Ala Tyr Gly Gin Thr Gly 

100 105 110 

Ser Gly Lys Thr Phe Thr Met Met Gly Pro Ser Glu Ser Asp Asn Phe 

115 120 125 

Ser His Asn Leu Arg Gly Val He Pro Arg Ser Phe Glu Tyr Leu Phe 

130 135 140 

Ser Leu He Asp Arg Glu Lys Glu Lys Ala Gly Ala Gly Lys Ser Phe 
145 150 155 160 

Leu Cys Lys Cys Ser Phe He Glu He Tyr Asn Glu Gljn He Tyr Asp 

165 170 175 

Leu Leu Asp Ser Ala Ser Ala Gly Leu Tyr Leu Arg Glu His He Lys 

180 185 190 

Lys Gly Val Phe Val Val Gly Ala Val Glu Gin Val Val Thr Ser Ala 

195 200 205 

Ala Glu Ala Tyr Gin Val Leu Ser Gly Gly Trp Arg Asn Arg Arg Val 

210 215 220 

Ala Ser Thr Ser Met Asn Arg Glu Ser Ser Arg Ser His Ala Val Phe 
225 230 235 240 

Thr lie Thr He Glu Ser Met Glu Lys Ser Asn Glu He Val Asn lie 

245 250 255 

Arg Thr Ser Leu Leu Asn Leu Val Asp Leu Ala Gly Ser Glu Arg Gin 

260 265 ~ 270 

Lys Asp Thr His Ala Glu Gly Met Arg Leu Lys Glu Ala Gly Asn lie 

275 280 285 

Asn Arg Ser Leu Ser Cys Leu Gly Gin Val lie Thr Ala Leu Val Asp 

290 295 300 

Val Gly Asn Gly Lys Gin Arg His Val Cys Tyr Arg Asp Ser Lys Leu 
305 310 315 320 

Thr Phe Leu Leu Arg Asp Ser Leu Gly Gly Asn Ala Lys Thr Ala lie 

325 330 335 

He Ala Asn Val His Pro Gly Ser Arg Cys Phe Gly Glu Thr Leu Ser 

340 345 350 

Thr Leu Asn Phe Ala Gin Arg Ala Lys Leu lie Lys Asn Lys Ala Val 

355 360 365 

Val Asn Glu Asp Thr Gin Gly Asn Val Ser Gin Leu Gin Ala Glu Val 

370 375 380 

Lys Arg Leu Lys Glu Gin Leu Ala Glu Leu Ala Ser Gly Gin Thr Pro 
385 390 395 400 

Pro Glu Ser Phe Leu Thr Arg Asp Lys Lys Lys Thr Asn Tyr Met Glu 

405 410 415 

Tyr Phe Gin Glu Ala Met Leu Phe Phe Lys Lys Ser Glu Gin Glu Lys 

420 425 430 

Lys Ser Leu lie Glu Lys Val Thr Gin Leu Glu Asp Leu Thr Leu Lys 

435 4 40 44 5 

Lys Glu Lys Phe lie Gin Ser Asn Lys Met lie Val Lys Phe Arg Glu 
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J.P M» tl . „. at5 Lys Leu His Lys <60 ^ ^ ^ 

»» u. Pro «. ci. <=l„ asp arg Leu leu LI Glu i.„ flrg fls „ J" 
11. a» T hr jjj arg Glu G1 „ n « »° His pro fltg «s 

Tyr AU „« „„ flsn H1 . s „ flrg Glu Hu M. ^ ^ 

Leu Lee Glu Pro val Lys ^ al , 01n Glu „ et flsp ™ 01o 
«. Lys ^ Glu Ly3 „. Ph . ser „„ "0 ^ ^ 

*»P L y s fl „ G1 „ ^ GJy ph( , s „ iys »5 ^ ^ ^ ^ 

~u «» Jjj Ihr Glu Lys ^ »0 ^ u Lm 575 

Thr 0 1U Ljj *s„ ta Ser i y , „ Glu Iyt a , phe HI G ^ ^ 

Thr Jrj Ly , arg Gln L9U G;u ^ Mu 505 ^ ^ ^ 

Lys «. Aso z.e u asn j |u G1 „ as „ Leu Glu «» ^ c ^ 



Lys »r g Gln Glu „ s ;- G1 „ leu ftsn Lys 635 ^ Ma t ^ «j. 
Lys II« U. jjj Thr Pro Thr lys T 5» Gln Lm hu Sk 655 



V.I Pro £ l6u ser Pro G1 „ Gly s ^ Sm 570 ^ ^ 

Sin jjj s„ Ser Ile u. Jjp as „ asp 1U leu as „ "J pro pro 
Pro «!■ H.t As„ Glu a. ». Phe Glu al , n . ™ Gio Lm 
Thr V,! Gln „„ ^ „ ser al , L . u G1 „ flap Giu 7 f 0 

£l " 5S »S -P - «~ =lu His 

His ser ,jr „„ „. t Gln Glu ph . ^ p . r ^ ^ 750 ^ ^ 

THr , y j G1 „ Gln Glu Glu le „ s „ Gln flsn 755 ^ ^ ^ 

«n Leu Gin Glu Thr ^ T „ r Lys p|)e 780 ^ ^ ^ ^ 

His aSp U. Ar, V.J V,i „ is Ser au HI Lys Glu Leu s „ 

V.X L,s to jjj Tyr s „ s „ ppe "J flsn Giu l ^ 8X5 ^ 

»»n lys jjj s „ G1 „ flrg „ ls Mn g ^ 830 a ^ 

Leu jjj u. =1„ a ,„ Glu ^ Leu leu Gl0 s „ lys HI l ^ Gin 

865 S " ^ SS " ^ «*■ =I» a. -t Lys !£ Ga<J Iu asp „„ 
Leu Ser Srg fls „ jg „„ p „ e Ly , yj ^ ^ ^ ^ 880 

Ser asp L . u a; „ a „ leu „ G1 „ JO Giu Ma Gio 885 



As„ a ,„ ffi L . u s „ Leu G1 „ pp? Glu Glu Gju „. sm 



Lys Glu He Leu Lys Val Leu Glu Ala v*i n " „ 925 

930 o^s ^ Gln Glu Lys 

94 0 
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Thr 






Cys 


Glu 


Gin 


Gin 


Met 


Ala 


Lys 


Val 


Gin 


Lys 


Leu 


Glu 


Z*H -J 










950 










955 










960 


UlU 


Ser 


Leu 


Leu 


Ala 
965 


Thr 


Glu 


Lys 


Val 


He 
970 


Ser 


Ser 


Leu 


Glu 


Lys 
975 




Arg 


Asp 


Ser 


980 


Lys 


Lys 


Val 


Val 


Ala 
985 


A so 


Leu 


Met 


Asn 


Gin 
990 


Tip 


Gin 


Glu 


Leu 


Arg 


1 i)-L 


Ser 


Val 


Cys 


Glu 


Lys 


Thr 


Glu 


Thr 


He 


Asp 


Thr 


Leu 






995 










1000 








1005 






Lys 


Gin 


Glu 


Leu 


Lys 


Asp 


He 


Asn 


Cys 


Lys 


Tyr 


Asn 


Ser 


Ala 


Leu 


Val 




1010 








1015 








1020 








Asp 


Arg 


Glu 


Glu 


Ser 


Arg 


Val 


Leu 


He 


Lys 


Lys 


Gin 


Glu 


Val 


Asp 


He 


1025 








1030 








1035 








1040 


Leu Asp 


Leu 


Lys 


Glu 


Thr 


Leu 


Arg 


Leu 


Arg 


He 


Leu 


Ser 


Glu 


Asp 


He 










1045 








1050 








1055 


Glu 


Arg 


Asp 


Met 


Leu 


Cys 


Glu 


Asp 


Leu 


Ala 


His 


Ala 


Thr 


Glu 


Gin 


Leu 








1060 








1065 








1070 




Asn 


Met 


Leu 


Thr 


Glu 


Ala 


Ser 


Lys 


Lys 


His 


Ser 


Gly 


Leu 


Leu 


Gin 


Ser 






1075 








1080 








1085 






Ala 


Gin 


Glu 


Glu 


Leu 


Thr 


Lys 


Lys 


Glu 


Ala 


Leu 


He 


Gin 


Glu 


Leu 


Gin 




1090 








1095 








1100 








His 


Lys 


Leu 


Asn 


Gin 


Lys 


Lys 


Glu 


Glu 


Val 


Glu 


Gin 


Lys 


Lys 


Asn 


Glu 


1105 








1110 








1115 








1120 


Tyr 


Asn 


Phe 


Lys 


Met 


Arg 


Gin 


Leu 


Glu 


His 


Val 


Met 


Asp 


Ser 


Ala 


Ala 










1125 








1130 








1135 


Glu 


Asp 


Pro 


Gin 


Ser 


Pro 


Lys 


Thr 


Pro 


Pro 


His 


Phe 


G\n 


Thr 


His 


Leu 








1140 








1145 








1150 




Ala 


Lys 


Leu 


Leu 


Glu 


Thr 


Gin 


Glu 


Gin 


Glu 


He 


Glu 


Asp 


Gly 


Arg 


Ala 






1155 








1160 








1165 






Ser 


Lys 


Thr 


Ser 


Leu 


Glu 


His 


Leu 


Val 


Thr 


Lys 


Leu 


Asn 


Glu 


Asp Arg 




1170 








1175 








1180 








Glu 


Val 


Lys 


Asn 


Ala 


Glu 


He 


Leu 


Arg 


Met 


Lys 


Glu 


Gin* 


Leu 


Arg 


Glu 


1185 








1190 








1195 








1200 


Met 


Glu 


Asn 


Leu 


Arg 


Leu 


Glu 


Ser 


Gin 


Gin 


Leu 


He 


Glu 


Lys 


Asn 


Trp 










1205 








1210 








1215 


Leu 


Leu 


Gin 


Gly 


Gin 


Leu 


Asp 


Asp 


He 


Lys 


Arg 


Gin 


Lys 


Glu 


Asn 


Ser 








1220 








1225 








1230 




Asp 


Gin 


Asn 


His 


Pro 


Asp 


Asn 


Gin 


Gin 


Leu 


Lys 


Asn 


Glu 


Gin 


Glu 


Glu 






1235 








1240 








1245 






Ser 


He 


Lys 


Glu 


Arg 


Leu 


Ala 


Lys 


Ser 


Lys 


He 


Val 


Glu 


Glu 


Met 


Leu 




1250 








1255 








1260 








Lys 


Met 


Lys 


Ala 


Asp 


Leu 


Glu 


Glu 


Val 


Gin 


Ser 


Ala 


Leu 


Tyr 


Asn 


Lys 


1265 








1270 








1275 








1280 


Glu 


Met 


Glu 


Cys 


Leu 


Arg 


Met 


Thr 


Asp 


Glu 


Val 


Glu Arg 


Thr 


Gin 


Thr 










1285 








1290 








1295 


Leu 


Glu 


Ser 


Lys 


Ala 


Phe 


Gin 


Glu 


Lys 


Glu 


Gin 


Leu 


Arg 


Ser 


Lys 


Leu 








1300 








1305 








1310 




Glu 


Glu 


Met 


Tyr 


Glu 


Glu 


Arg 


Glu 


Arg 


Thr 


Ser 


Gin 


Glu 


Met 


Glu 


Met 






1315 








1320 








1325 






Leu 


Arg 


Lys 


Gin 


Val 


Glu 


Cys 


Leu 


Ala 


Glu 


Glu 


Asn 


Gly 


Lys 


Leu 


Val 




1330 








1335 








1340 








Gly 


His 


Gin 


Asn 


Leu 


His 


Gin 


Lys 


He 


Gin 


Tyr 


Val 


Val 


Arg 


Leu 


Lys 


1345 








1350 








1355 








1360 


Lys 


Glu 


Asn 


Val 


Arg 


Leu 


Ala 


Glu 


Glu 


Thr 


Glu 


Lys 


Leu 


Arg 


Ala 


Glu 










1365 








1370 








1375 


Asn 


Val 


Phe 


Leu 


Lys 


Glu 


Lys 


Lys 


Arg 


Ser 


Glu 


Ser 











1380 1385 



<210> 412 
<211> 4775 
<212> DNA 
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<213> Homo sapiens 

<400> 412 

cagtcgcgcg cggtgcagtc gggaggtgga ggcaccggct gcattatttt r-o™ - 
gggtgagggc gctatggcac ccggctqcaa aactaaan, ™~ gtttt c 99gatcgag 60 
gtctaaccaa ccaag^aatg aaggtgatgc a f 9 ^ 3 Caaat S9 tca 120 

tgcagaaaga tctqqqtcaa p t l„" 9 catcaaagtt tttgtgcgaa ttcgtcctcc 180 

cacga 9 gtc?c cggSg^c? S ££££ JKSSS TT^ ^ 

agatgtggat accactcagg aatctgtatt tocaactata 9 9 atcat 9 tt 9 c 300 

ttgcatgagc ggttataatg gtaccatc^t tacaS^ 9 gCtaaaa 9 ca ttgtggagtc 360 
atttactatg a'gggaccat c^ctga Sttt^ "KSSS Ca '**"* ac 420 
cccacgaagt tttgaatatt tatttterM- I cataacctga gaggagtaat 480 

tggaaagagt ttcctttgta agjgttcctt o a't f aaaa ^^ aggctggagc 540 

tctactggac tctgcatcgg ctaSa S ,tll tacaacgagc agatatatga 600 

tgttgttggt gcggtggagc aaSoa^a^ f^* 9 ?^ "tatcaaga agggagtctt 660 
tggaggat'gg aggaa'agac gtgjgg^c aacatca'tq 9a3gCCtatC ^tgctgtc 720 
tcatgccgtc tttacaatta latitats aacatcaat 9 aacagagaat cgtctaggtc 780 
acggacctcc ctactcLcc tggtgga? t a^Stct agtaatgaga "gtgaatat 840 
tgcagaaggg atgagattga aaa aa agcaggatct gaaaggcaaa aagataccca 900 

ccaagtga?? acagc c t tcqacatooa ItJ ^atcattga gctgcctggg 960 

agactccaaa cttacct Stacoqqa tttrZ??*** CagagaCatg "tgctacag 10 20 
aattgcaaat gttcatcctg a 1 ttcccttgga ggtaatgcca aaacagccat 1080 
tgctcaaaga gccaagctga aaalattT aCCCtatcaa cacttaactt 1140 

tgtgagccag ctccaagc^g aagtg^gag gctcaa^aa ttlT^" ° CCaaggaaa 120 ° 
aggacagaca ccaccagaaa gcttcctqac a la L caactggcgg agcttgcttc 1260 
gtatttccag gaagcaatgt ?attcS?« L a agaagacta actatatgga 1320 

agaaaaagtt acccaattag " caggaaaaga agtctctgat 1380 

taaaatgatt gtgaaattcc qaaaaaatr* ° C ^ Caaaaa 9 gaaaaattta ttcaatctaa 1440 
atcccgggga g g ?tttctgc goatc^t?" " ggaaaagc tCCaCaag * a »<>0 

gattcaaact ctgcgagaac aaatagagca o^tocaf "I t3aggaatga 15 <* 

aaatcattcc ctcagggagg agaatagaag actgagatta ^aqaqccta f tgCtatgga 1620 
tcaagaaatg gatgcccaga ccattocaaa ^t^I ttagagcctg tgaaaagagc 1680 

catggagaaa agtcjacaaaa a" at^cacct S^cS" 9 3aataagtgg 1740 

tttgtttgca aacactqaoa aat-i™* 99 . ! \ CCt aaagctcaga aagagccatg 1800 
taattcaaag caaga aaaac^"? Caaattca 9 a cagagctgaa 1860 

ggaatcagag cttcaqtctt to™ a 9 aactta ct aggaaaaggc agctagaatt 1920 

aacaaaa c tgcaagcggc aaqaaqtttc g ! aCC " aat ^tgaaaacc ttttggaagc 1980 
taagattata aLcS ccaact?^ aaaattca ^ ctgaaacact 2040 

aagccctgaa atgggaagct ttggrtctct ataL tcccgaccag taccaaaatt 2100 
tgatatatta aaigagccag ttcctSqa 1 9 aatt ^agca tattagataa 2160 
tgaagagctt agaacagtgc agqaacaaat ^aagcttttg aggccatttc 2220 

agagcataaa aacctaaagc tSqcaqca ZaltZlr caa gccaaac tggatgaaga 2280 
aatgcaggag cttttctcat cagaaaqaat tllltn ct ^ aaca tc attctaccca 2340 

ctcacagttg aatgtcctjq aaaaqcfort * gattggacc aaacagcagg aagagcttct 2400 
gaaaagjgaj gtacatgacc tgc^gta'gt oTttlT.Tt ToT^** atgaCttt " ^460 
agtgaaattg gaatataqtt cattcaaa!, Ztll g<=tgacaagg agctttcttc 2520 

tgaaagacac atgcatg?ac ag^tcaatt a "t^ 9 aaagaattca acaaactttc 2580 
gcttgagagc aaagcct' tfcaggaSc c?atqacaac 3ggttagaaa «<*aaaagct 2640 
tgagattgac caactttcaa gaaacScca a c "acaagaaa taatgaaatt 2700 

atctgatctg aataatttga tggaqct^ct 1 ' ^^agaaaatg aaactctgaa 2760 
atcattacag tttgaagaa'g aSgaaaa cTall'Tal qaaatc^" 3taaCaaatt 2820 
ggctgtacgt caggagaaac agaaagagac ggccaagjg? qaocaacaf ? agttCttga 2880 
acagaaacta gaagagaqct tacttarfar = aagtgt gagcagcaga tggcaaaagt 2940 

tagagattct gataagaaag tjgtalct'ga cc^tgaa? ToT^ tggaaaagtc 
atcggtctgt gagaaaacag aaactataga cacccjgaaa caaoL t" agCtaagaac 3060 
ttgcaaatac aactctgctt tggttgacaq ao a a ^aagaactga aggacataaa 3120 
ggaagtggat attctggatc tg' agagtgttga tcaagaagca 3180 

agagagggat atgctctgtg aggacctgjc tca^qcc^ 9 agaatacttt ctgaggacat 3240 
agaggcctca aaaaaacac cggggctgc? L!' c ^gcagctga acatgctcac 3300 
ggaagccctg attcaggaac t^g' L tgaccaa ? aa 3360 

gaagaagaat gaatataac, t caaaatgag US SgS >™ 
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tgaggatccc 
ggaaacacaa 
t'gtaacaaag 
gcagttgcgt 
gctcctgcaa 
tccagataat 
aagtaaaata 
cctttacaac 
tttggagtct 
tgaagaaaga 
tgctgaggaa 
agtgcgacta 
aaatgtattt 
ggcatcacct 
ttaggagagc 
ctcaagtttc 
attaagtggc 
aaataaaagc 
ttatgccata 
cctgcagtga 
gattattttt 
aagccatttt 



cagagtccta 
gaacaagaga 
ctaaatgaag 
gaaatggaaa 
ggtcagctgg 
caacagctga 
gttgaagaaa 
aaagagatgg 
aaagcattcc 
gagagaacat 
aatggaaagt 
aagaaggaaa 
ttaaaagaaa 
tgt ttgaaga 
tgaat ttatg 
tgatgaacat 
ctacttcaag 
ctgtagctaa 
atctcctttg 
gtttaatgac 
gtattcttac 
ttaagataaa 



agacaccacc 
tagaagatgg 
acagagaagt 
acctacgcct 
atgatattaa 
agaatgaaca 
tgctgaaaat 
aatgccttag 
aggaaaaaga 
cccaggagat 
tggtaggtca 
atgtcaggct 
agaaaagaag 
tgtttcttct 
gaccttaat t 
tctgcatcca 
gctttgaatc 
ggtttacagt 
attcttatgg 
tgacttagta 
tttaggtatt 
aaaaaaaaaa 



tcact ttcaa 
aagagcctct 
caaaaatgct 
ggaaagtcag 
aagacaaaag 
agaagaaagt 
gaaagcagac 
aatgactgat 
acaactgaga 
ggaaatgt ta 
ccaaaatttg 
tgctgaggag 
tgaatcttga 
cttttacaag 
attaaatgt t 
tatacaccct 
aacttaaggg 
ggacattagc 
aagttctaac 
gcaggtacaa 
ttacttgagc 
aaaaa 



acacatttgg 
aagacttctt 
gaaatcctca 
cagttaatag 
gaaaacagtg 
atcaaagaaa 
ctagaagaag 
gaagtcgaac 
tcaaagctgg 
aggaagcagg 
catcagaaga 
acagaaaagt 
ggattccggt 
taagacctac 
tataaggtgg 
gtgacagtca 
aaaacctttt 
cagatcattt 
aatatatggt 
gaagcaaact 
attttccatg 



caaaactcct 
tggaacacct 
gaatgaagga 
agaaaaactg 
atcagaa tea 
gaettgeaaa 
tccaaagtgc 
gaacccaaac 
aagaaatgta 
tggagtgtct 
ttcagtacgt 
tgcgtgccga 
cagctaccta 
tcctggccac 
tggtaaccac 
geagtctget 
gtctttgtaa 
tcttcttaga 
ggttccaaca 
tgttaatata 
actgtaaata 



3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4775 



<210> 413 
<211> 1056 
<212> PRT 

<213> Homo sapiens 



<400> 413 

Met Ala Ser Gin Pro Asn Ser Ser Ala Lys 

15 10 
Lys Asn lie Gin Val Val Val Arg Cys Arg 

20 25 
Arg Lys Ala Ser Ala His Ser lie Val Glu 

35 40 
Glu Val Ser Val Arg Thr Gly Gly Leu Ala 

50 55 
Thr Tyr Thr Phe Asp Met Val Phe Gly Ala 
65 70 

Val Tyr Arg Ser Val Val Cys Pro lie Leu 

85 90 
Tyr Asn Cys Thr He Phe Ala Tyr Gly Gin 

100 105 
Phe Thr Met Glu Gly Glu Arg Ser Pro Asn 

115 120 
Glu Asp Pro Leu Ala Gly He He Pro Arg 

130 135 
Glu Lys Leu Thr Asp Asn Gly Thr Glu Phe 
145 150 

Leu Glu He Tyr Asn Glu Glu Leu Phe Asp 
165 170 
Asp Val Ser Glu Arg Leu Gin Met Phe Asp 

180 185 
Gly Val He He Lys Gly Leu Glu Glu He 

195 200 
Glu Val Tyr Gin He Leu Glu Lys Gly Ala 

210 215 
Ala Thr Leu Met Asn Ala Tyr Ser Ser Arg 
225 230 

Val Thr lie His Met Lys Glu Thr Thr lie 
245 250 



Lys Lys Glu Glu 
Pro Phe 
Cys Asp 



Asp Lys 
60 

Ser Thr 
75 

Asp Glu 

Thr Gly 

Glu Glu 

Thr Leu 
140 
Ser Val 
155 

Leu Leu 

Asp Pro 

Thr Val 

Ala Lys 
220 
Ser His 
235 

Asp Gly 



Asn Leu 

30 
Pro Val 
45 

Ser Ser 



Lys Gin 

Val He 

Thr Gly 
110 
Tyr Thr 
125 

His Gin 

Lys Val 

Asn Pro 

Arg Asn 
190 
His Asn 
205 

Arg Thr 
Ser Val 
Glu Glu 



Lys Gly 
15 

Ala Glu 

Arg Lys 

Arg Lys 

He Asp 

80 
Met Gly 
95 

Lys Thr 

Trp Glu 

lie Phe 

Ser Leu 
160 
Ser Ser 
175 

Lys Arg 

Lys Asp 

Thr Ala 

Phe Ser 
240 
Leu Val 
255 
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Lys He Gly Lys Leu Asn Leu Val Asp Leu Ala Gly Ser Glu Asn He 

260 265 270 

Gly Arg Ser Gly Ala Val Asp Lys Arg Ala Arg Glu Ala Gly Asn He 

275 280 285 

Asn Gin Ser Leu Leu Thr Leu Gly Arg Val He Thr Ala Leu Val Glu 

290 295 300 

Arg Thr Pro His Val Pro Tyr Arg Glu Ser Lys Leu Thr Arg lie Leu 
3 ? 5 310 315 320 

Gin Asp Ser Leu Gly Gly Arg Thr Arg Thr Ser He He Ala Thr He 

325 330 335 

Ser Pro Ala Ser Leu Asn Leu Glu Glu Thr Leu Ser Thr Leu Glu Tyr 

340 345 350 

Ala His Arg Ala Lys Asn He Leu Asn Lys Pro Glu Val Asn Gin Lys 

355 360 365 

Leu Thr Lys Lys Ala Leu He Lys Glu Tyr Thr Glu Glu He Glu Aro 

370 375 .380 

Leu Lys Arg Asp Leu Ala Ala Ala Arg Glu Lys Asn Gly Val Tyr He 
385 390 395 ' 400 

Ser Glu Glu Asn Phe Arg Val Met Ser Gly Lys Leu Thr Val Gin Glu 

405 410 415 

Glu Gin He Val Glu Leu He Glu Lys He Gly Ala Val Glu Glu Glu 
_ 420 425 430 

Leu Asn Arg Val Thr Glu Leu Phe Met Asp Asn Lys Asn Glu Leu Asp 

435 440 445 

Gin Cys Lys Ser Asp Leu Gin Asn Lys Thr Gin Glu Leu Glu Thr Thr 

450 455 460 

Gin Lys His Leu Gin Glu Thr Lys Leu Gin Leu Val Lys Glu Glu Tyr 
465 470 475 480 

He Thr Ser Ala Leu Glu Ser Thr Glu Glu Lys Leu His Asp Ala Ala 

485 490 495 

Ser Lys Leu Leu Asn Thr Val Glu Glu Thr Thr Lys Asp Val Ser Gly 

500 505 510 

Leu His Ser Lys Leu Asp Arg Lys Lys Ala Val Asp Gin His Asn Ala 

515 520 525 

Glu Ala Gin Asp He Phe Gly Lys Asn Leu Asn Ser Leu Phe Asn Asn 

530 535 540 

Met Glu Glu Leu He Lys Asp Gly Ser Ser Lys Gin Lys Ala Met Leu 
545 550 555 560 

Glu Val His Lys Thr Leu Phe Gly Asn Leu Leu Ser Ser Ser Val Ser 

565 570 575 

Ala Leu Asp Thr lie Thr Thr Val Ala Leu Gly Ser Leu Thr Ser He 

580 585 590 

Pro Glu Asn Val Ser Thr His Val Ser Gin He Phe Asn Met He Leu 

595 600 605 

Lys Glu Gin Ser Leu Ala Ala Glu Ser Lys Thr Val Leu Gin Glu Leu 

610 615 620 

He Asn Val Leu Lys Thr Asp Leu Leu Ser Ser Leu Glu Met lie Leu 
f 25 630 635 640 

Ser Pro Thr Val Val Ser He Leu Lys He Asn Ser Gin Leu Lys His 

645 650 655 

He Phe Lys Thr Ser Leu Thr Val Ala Asp Lys He Glu Asp Gin Lys 

660 665 670 

Lys Glu Leu Asp Gly Phe Leu Ser He Leu Cys Asn Asn Leu His Glu 

675 680 685 

Leu Gin Glu Asn Thr He Cys Ser Leu Val Glu Ser Gin Lys Gin Cys 

690 695 700 

Gly Asn Leu Thr Glu Asp Leu Lys Thr He Lys Gin Thr His Ser Gin 
5 710 715 720 

Glu Leu Cys Lys Leu Met Asn Leu Trp Thr Glu Arg Phe Cys Ala Leu 

72 5 730 735 

Glu Glu Lys Cys Glu Asn He Gin Lys Pro Leu Ser Ser Val Gin Glu 
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Val 
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Cys 


Ser 
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He 
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Val 


Pro 


Phe 


Phe 


Gin 


His 


Lys 


Lys 


Ser 


His 


Gly 


Lys 


Asp 


Lys 


Glu 


Asn 




1010 








1015 








1020 








Arg 


Gly 


He 


Asn 


Thr 


Leu 


Glu 


Arg 


Ser 


Lys 


Val 


Glu 


Glu 


Thr 


Thr 


Glu 


1025 








1030 








1035 








1040 


His 


Leu 


Val 


Thr 


Lys 


Ser 


Arg 


Leu 


Pro 


Leu 


Arg Ala 


Gin 


He 


Asn 


Leu 










1045 








1050 








1055 



<210> 414 
<211> 4908 
<212> DNA 

<213> Homo sapiens 
<400> 414 

acctgcgtgc agteggtect ccaggccacg cagcgcccga gagtaccagg gagactcegg 60 
cccctgtcgg cgccaagccc ctccgcccct cacagcgccc aggtccgegg ccgggccttg 120 
attttttggc ggggaccgtc atggcgtcgc agecaaatte gtctgcgaag aagaaagagg 180 
agaaggggaa gaacatccag gtggtggtga gatgeagace atttaatttg geagagegga 240 
aagctagege ccattcaata gtagaatgtg atcctgtacg aaaagaagtt agtgtacgaa 300 
ctggaggatt ggctgacaag agctcaagga aaacatacac ttttgatatg gtgtttggag 360 
catctactaa acagattgat gtttaccgaa gtgttgtttg tccaattctg gatgaagtta 420 
ttatgggcta taattgeact atetttgegt atggccaaac tggcactgga aaaactttta 4 80 
caatggaagg tgaaaggtca cctaatgaag agtatacctg ggaagaggat cccttggctg 540 
gtataattcc acgtaccctt catcaaattt ttgagaaact tactgataat ggtactgaat 600 
tttcagtcaa agtgtctctg ttggagatct ataatgaaga gctttttgat cttcttaatc 660 
catcatctga tgtttctgag agactacaga tgtttgatga tccccgtaac aagagaggag 720 
tgataattaa aggtttagaa gaaattacag tacacaacaa ggatgaagtc tatcaaattt 780 
tagaaaaggg ggcagcaaaa aggacaactg cagctactct gatgaatgea tactctagtc 840 
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^' C "f C agttttctct gttacaatac atatgaaaga aactacgatt gatggagaag 900 
t C ^ ^ aatcggaaag ttgaacttgg ttgatcttgc aggaagtgaa aacattggcc 960 
gttctggagc tgttgataag agagctcggg aagctggaaa tataaatcaa tccctgttga 1020 
ctaaacta^ 9 ggtCa " act gcccttgtag aaagaacacc tcatgttcct tatcgagaat 1080 
caacaStt. ^agaatcctc "ggattctc ttggagggcg tacaagaaca tctataattg 1140 
tcctgcatct ctcaatcttg aggaaactct gagtacattg gaatatgctc 1200 
gaacatattg aataagcctg aagtgaatca gaaactcacc aaaaaagctc 1260 
lalllZ^ gt f ac ^ a g gagatagaac gtttaaaacg agatcttgct gcagcccgtg 1320 
S !2S!5 a " tct 9 aa ^a attttagagt catgagtgga aaattaactg 1380 
ttcaagaaga gcagattgta gaattgattg aaaaaattgg tgctgttgag gaggagctqa 1440 
tZlT'tll agag " gttt atggataata aaaatgaact tgaccagtg? LatSgacc 1500 
llrttntlt aacacaa <? aa cttgaaacca ctcaaaaaca tttgcaagaa actaaattac 1560 
aacttgttaa agaagaatat atcacatcag ctttggaaag tactgaggag aaacttcatg 1620 

at?ccaaact 9 aaa^T" aaCaCagttg aa gaaactac aaaagatgta tctggtctcc 1680 
attccaaact ggatcgtaag aaggcagttg accaacacaa tgcagaagct caggatattt 1740 
ttggcaaaaa cctgaatagt ctgtttaata atatggaaga attaattaag gatggcag" 18oS 
a?ot?tctac g9 " atgCta ^^tacata agaccttatt tggtaatctg ctgJctt ca 1860 
aalato^ attaga * acc a ttactacag tagcacttgg atctctcaca tctattccag 1920 
aaaatgtgtc tactcatgtt tctcagattt ttaatatgat actaaaagaa caatcattag 1980 
TtlZlSaal f aCa99Mt tgattaatgt actcaaga'ct gatcttct^ 2040 

?aaaacatat ttllt t l CCCCaact ? tggtgtctat actgaaaatc aatagtcaac 2100 
aactaaa^n lilt t ? , tcatt 9 aca 9 tggccgataa gatagaagat caaaaaaagg 2160 
tttottoall alll t 9t atact 9 fc 9 ta acaatctaca tgaactacaa gaaaatacca 2220 
tttqttcctt ggttgagtca caaaagcaat gtggaaacct aactgaagac ctgaagacaa 2280 
taaagcagac ccattcccag gaactttgca agttaatgaa tctttggaca gagagattct 2340 
gtgctttgga ggaaaagtgt gaaaatatac agaaaccact tagtagtgtc caggaaaata 2400 
tacagcagaa atctaaggat atagtcaaca aaatgacttt tcacagtcaa aaStttgtg 2460 
t^oaaaaatc ! g * CttCtca ca ^ aa ctca gaaattttaa ccaagaaggt acaaaat g 2520 
ttgaagaatc tgtgaaacac tctgataaac tcaatggcaa cctggaaaaa atatctcaag 2580 
ta?ct?cctt f gatgtgaa tctctgaaca caagaacagt ttatttttct gaacagtggg 2640 
tatcttcctt aaatgaaagg gaacaggaac ttcacaactt attggaggtt qtaaqccaat 2700 
gttgtgaggc ttcaagttca gacatcactg agaaatcaga tggacgtaag gcagltcatg 2760^ 

aa^tcta^ a'ctt'T" CttgatCaga ^actattga tgaagataaa^ ?tgLagcac 2820 
aaaatctaga acttaatgaa accataaaaa ttggtttgac taagcttaat tgctttctgq 2880 

"ccatca^c TatTl"^ gtaCgaCacc acagaggaaa agttat'S 

acccatcaac actggtaaga actgaaccac gtgaacatct ccttgatcag ctgaaaagga 3000 
aacagcctga gctgttaatg atgctaaact gttcagaaaa caacaaagaa gagacaaSc 3060 
cggatgtgga tgtagaagag gcagttctgg ggcagtatac tgaagaacct t ag ca g fill 
ataaaaaatc "JT^ 9tgg.ttgtt catcaattgg cggggttcca ttttLcagc 3^80 
ataaaaaatc acatggaaaa gacaaagaaa acagaggcat taacacactg gagaggtcta 3240 
Totllllll a a t?ca a ct 9 r CaC ^ tggtta "--^gcag attacctct^ %l q ll c %l 3300 
1*111 attcactt 09 gggttggcaa ttttattttt aaagaaaact taaaaataaa 3360 

acctgaaacc ccagaacttg agccttgtgt atagatttta aaagaatata tatatcagcc 3420 
gggcgcggtg gctcatgcct gtaatcccag cactttggga ggctgaggcg ggtggattgc 3480 
ttgagcccag gagtttgaga ccagcctggc caacgtggca aaacctcgtc tctg?taaaa HVo 
a aatcac 9 ?a tlll^"^ CaCtCCtgta atcccagcta ctggggaggc tgaggcacga 3600 
alrlanZlJ aaCCCa 99 aa gcggggttgc agtgagccaa aggtacacca ctacactcca 3660 
a C a 9 3 C cagagcaaga ctcggtctca aaaacaaaat ttaaaaaaga tataaggcag 3720 
t!t ! It?* ?T " tgatatCt acccattttt ctgtcatccc tatag?? C ac 3780 
Saaaa^aa t^???,^ atttgggatt tgcaatgtaa atacgtattt ctagttttca 3840 
tataaagtag ttcttttata acaaatgaaa agtatttttc ttgtatatta ttaagtaatg 3900 

ctotcctt'a a r t9taCt ^ \ tCtCagctt gagcttaaca taggtaaata tcacc.aca? 3960 
? aaaggaccat ctcatgtttt ttttcttgct atgacttgtg tattttcttg 4020 

catcctccct agacttccct atttcgcttt ctcctcggct cactttctcc ctttttattt 4080 
ttcaccaaac catttgtaga gctacaaaac ctatcctttc ttattttcag tagtcagaat 4140 
aaatctaacc *? HT? CaCCttttta ^tggttattt ctaaaatcac tgLaacaat 4200 
aaqcatttfa CCCtCCttta agtatttaaa acttgttgcc ccaaatgtga 4260 

c a c Ca ^a a ^^"^r 9 aggCC ^ aact "ttcaccct gacagagttc acaaaaagcc 4320 
cactttagag tatacattgc tattatggga gaccacccag acatctgact aatgqctctq 4380 
tgccacactc caagacctgt gccttttaga gaagctcaca atgattLag ga^gtttg 9 44^0 
aacttccaat tatgtctata atttatattc ttttgtttac atgatgaaac ?ttt?g"g? 4500 
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tgcttgtttg tatataatac aatgtgtaca tgtatctttt tctcgattca aatcttaacc 4560 

cttaggactc tggtattttt gatctggcaa ccatatttct ggaagttgag atgtttcagc 4620 

ttgaagaacc aaaacagaag gaatatgtac aaagaataaa ttttctgctc acgatgagtt 4680 

tagtgtgtaa agtttagaga catctgactt tgatagctaa attaaaccaa accctattga 4740 

agaattgaat atatgctact tcaagaaact aaattgatct cgtagaatta tcttaataaa 4800 

ataatggcta taatttctct gcaaaatcag atgtcagcat aagcgatgga taatacctaa 4860 

taaactgccc tcagtaaatc catggttaat aaatgtggtt tctacatt 4908 

<210> 415 
<211> 398 
<212> PRT 
<213> Homo sapiens 



<400> 415 
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Asp 


Thr 


Gly 


Val 


Asn 


Ser 


Lys 


Val 








20 










Thr 


Val 


Leu 


Glu 


Glu 


He 


Gly 


Asn 






35 










40 


Val 


Ala 


Lys 


Lys 


Ala 


Gin 


Asn 


Thr 




50 










55 




Thr 


Thr 


Asn 


Val 


Asn 


Lys 


Gin 


Leu 


65 










70 






Val 


Gin 


Met 


Glu 


Lys 


Leu 


Ala 


Pro 










85 








Asp 


Val 


Ser 


Met 


Lys 


Glu 


Glu 


Asn 








100 










Leu 


Leu 


Cys 


Lys 


lie 


Glu 


Asp 


He 






115 










120 


Gin 


Leu 


Cys 


Ser 


Asp 


Tyr 


Val 


Lys 




130 










135 




Leu 


Glu 


Val 


Leu 


Gin 


Ser 


He 


Asn 


14 5 










150 






lie 


Asn 


Gly 


Arg 


Met 


Arg 


Ala 


He 










165 








His 


Ser 


Lys 


Phe 


Arg 


Leu 


Leu 


Gin 








180 










lie 


Met 


Asp Arg 


Phe 


Leu 


Gin 


Val 






195 










o r\ r\ 


Gin 


Leu 


Val 


Gly 


He 


.Thr 


Ala 


Leu 




210 










215 




Met 


Phe 


Ser 


Pro 


Asn 


He 


Glu 


Asp 


225 










230 






Tyr 


Thr 


Ser 


Ser 


Gin 


He 


Arg 


Glu 










245 








Leu 


Lys 


Phe 


Glu 


Leu 


Gly 


Arg 


Pro 








260 










Ala 


Ser 


Lys 


Ala 


Gly 


Glu 


Val 


Asp 






275 










280 


Tyr 


Leu 


Met 


Glu 


Leu 


Thr 


Leu 


He 




290 










295 




Pro 


Ser 


Lys 


Val 


Ala 


Ala 


Ala 


Ala 


305 










310 






Gly 


Gin 


Gly 


Lys 


Trp Asn 


Leu 


Lys 










325 








Glu 


Asn 


Glu 


Val 


Leu 


Glu 


Val 


Met 








340 










Lys 


Val 


Asn 


Glu 


Asn 


Leu 


Thr 


Lys 






355 










360 


Ala 


Ser 


Ser 


Lys 


Leu 


Leu 


Lys 


He 



Val 


Ser 


Ser 


Asp 


Leu 


Glu 


Asn 


He 




10 










15 




Lys 


Ser 


His 


Val 


Thr 


He 


Arg 


Arg 


25 










30 






Arg 


Val 


Thr 


Thr 


Arg 
45 


Ala 


Ala 


Gin 


Lys 


Val 


Pro 


Val 


Gin 


Pro 


Thr 


Lys 








60 










Lys 


Pro 


Thr 


Ala 


Ser 


Val 


Lys 


Pro 






75 










80 


Lys 


Gly 


Pro 


Ser 


Pro 


Thr 


Pro 


Glu 




90 










95 




Leu 


Cys 


Gin 


Ala 


Phe 


Ser 


Asp 


Ala 


105 










110 






Asp Asn 


Glu 


Asp 


Trp 


Glu 


Asn 


Pro 










125 








Asp 


He 


Tyr 


Gin 


Tyr 


Leu 


Arg 


Gin 








140 










Pro 


His 


Phe 


Leu 


Asp 


Gly 


Arg 


Asp 






155 










160 


Leu 


Val 


Asp 


Trp 


Leu 


Val 


Gin 


Val 




170 










175 




Glu 


Thr 


Leu 


Tyr 


Met 


Cys 


Val 


Gly 


185 










190 






Gin 


Pro 


Val 


Ser 


Arg 


Lys 


Lys 


Leu 










205 








Leu 


Leu 


Ala 


Ser 


Lys 


Tyr 


Glu 


Glu 








220 










Phe 


Val 


Tyr 


He 


Thr 


Asp Asn 


Ala 






235 










240 


Met 


Glu 


Thr 


Leu 


He 


Leu 


Lys 


Glu 




250 










255 




Leu 


Pro 


Leu 


His 


Phe 


Leu 


Arg 


Arg 


265 










270 






Val 


Glu 


Gin 


His 


Thr 


Leu 


Ala 


Lys 










285 








Asp 


Tyr 


Asp 


Met 


Val 


His 


Tyr 


His 








300 










Ser 


Cys 


Leu 


Ser 


Gin 


Lys 


Val 


Leu 






315 










320 


Gin 


Gin 


Tyr 


Tyr 


Thr 


Gly Tyr 


Thr 




330 










335 




Gin 


His 


Met 


Ala 


Lys 


Asn 


Val 


Val 


345 










350 






Phe 


He 


Ala 


He 


Lys 


Asn 


Lys 


Tyr 










365 








Ser 


Met 


He 


Pro 


Gin 


Leu 


Asn 


Ser 
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370 375 380 

Lys Ala Val Lys Asp Leu Ala Ser Pro Leu He Gly Arq Ser 
385 390 395 



<210> 416 
<211> 1530 
<212> DNA 

<213> Homo sapiens 



<400> 416 

rtlZZnnnn* Caaggctaca gcgtcgaaga tccccagcgc tgcgggctcg gagagcagtc 60 
ctaacggcgc ctcgtacgct agtgtcctcc cttttcagtc cgcgtccctc cctgggccgg 120 
iTJ^nn "? C °" CCC c 9tccctcat ggcgctgctc cgacgcccga cggtgtccag 180 
tgatttggag aatattgaca caggagttaa ttctaaagtt aagagtcatg tgactattag 240 
gcgaactgtt ttagaagaaa ttggaaatag agttacaacc agagcagcac aagtagctaa 300 
gaaagctcag aacaccaaag ttccagttca acccaccaaa acaacaaatg tcaacaaaca 360 
actgaaacct actgcttctg tcaaaccagt acagatggaa aagttggctc caaagggtcc 420 
^tgaggatg tctccatgaa ggaagagaat ctctgccaag ctttttctga 480 
It^llnr tgCaaaatc 9 aggacattga taacgaagat tgggagaacc ctcagctctg 540 
tllcTctTj 9tt ^ gga l a tCtatcagta tctcaggcag ctggaggttt tgcagtccat 600 
t ttcttagatg gaagagatat aaatggacgc atgcgtgcca tcctagtgga 660 
Hllt^ 9 t a caagtccact ccaagtttag gcttctgcag gagactctgt acatgtgcgt 720 
talTaltt,? ^tcgatttt tacaggttca gccagtttcc cggaagaagc ttcaattagt 780 
lllltttrtt gctctgctct tggcttccaa gtatgaggag atgttttetc caaatattga 840 
S tacatcacag acaatgctta taccagttcc caaatccgag aaatggaaac 900 

tctaattttg aaagaattga aatttgagtt gggtcgaccc ttgccactac acttcttaag 960 
nlfnfZ t a ^gccgggg aggttgatgt tgaacagcac actttagcca agtatttgat 1020 
ggagctgact ctcatcgact atgatatggt gcattatcat ccttctaagg tagcagcagc 1080 
tlTaTc-lllt " 9tCtCa 9 a a ggttctagg acaaggaaaa tggaacttaa agcagcagta 1140 
ttacacagga tacacagaga atgaagtatt ggaagtcatg cagcacatgg ccaagaatgt 1200 
lllllllltn aatg ^ aaact taa ctaaatt catcgccatc aagaataagt atgcaagcag 1260 
caaactcctg aagatcagca tgatccctca gctgaactca aaagccgtca aagaccttgc 1320 
accctttfro tr aggaagg l CCtaggct 9 c cgtgggccct ggggatgtgt gcttcattgt 1380 
tatlllrl t " attggttt Wctcttg attttgtaca tagtcctctg gtctatctca 1440 
t^ttctt^i tctcagacca gttttctaaa catatattga ggaaaaataa agcgattggt 1500 
ttttcttaag gtaaaaaaaa aaaaaaaaaa 1530 



<210> 417 
<211> 543 
<212> PRT 

<213> Homo sapiens 
<400> 417 

Met Ser Arg Glu Ser Asp Val Glu Ala Gin Gin Ser His Gly Ser Ser 

Ala Cys Ser Gin Pro His Gly Ser Val Thr Gin Ser Gin Gly Ser Ser 

20 25 30 

Ser Gin Ser Gin Gly He Ser Ser Ser Ser Thr Ser Thr Met Pro Asn 

35 40 45 

Ser Ser Gin Ser Ser His Ser Ser Ser Gly Thr Leu Ser Ser Leu Glu 

50 55 60 

Thr Val Ser Thr Gin Glu Leu Tyr Ser lie Pro Glu Asp Gin Glu Pro 

70 75 80 

Glu Asp Gin Glu Pro Glu Glu Pro Thr Pro Ala Pro Trp Ala Arg Leu 

85 90 95 

Trp Ala Leu Gin Asp Gly Phe Ala Asn Leu Glu Cys Val Asn Asp Asn 

100 105 "* 110 

Tyr Trp Phe Gly Arg Asp Lys Ser Cys Glu Tyr Cys Phe Asp Glu Pro 

115 120 125 

Leu Leu Lys Arg Thr Asp Lys Tyr Arg Thr Tyr Ser Lys Lys His Phe 
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130 

Arg lie Phe Arg 
145 

Glu Asp His Ser 

Lys Gly Lys Arg 
180 

Leu Ser Arg Asn 
195 

Gin Ser Val Tyr 
210 

Thr Leu Gly Ser 
225 

Lys Thr Cys Lys 

Ala He Gly Ser 
260 

Glu He Glu He 
275 

Lys Asn Phe Phe 
290 

Glu Gly Gly Glu 
305 

Glu Ala Thr Cys 

Tyr Leu His Glu 
340 

Val Leu Leu Ser 
355 

Phe Gly His Ser 
370 

Cys Gly Thr Pro 
385 

Thr Ala Gly Tyr 

Leu Phe He Cys 
420 

Gin Val Ser Leu 
435 

Pro Glu Val Trp 
450 

Lys Leu Leu Val 
465 

Leu Arg His Pro 

Asp Leu Leu Ser 
500 

Ala Gin Pro Ser 
515 

Gly Ala Glu Thr 
530 



135 

Glu Val Gly Pro 
150 

Gly Asn Gly Thr 
165 

Arg Pro Leu Asn 

Lys Val Phe Val 
200 

Pro Lys Ala Leu 
215 

Gly Ala Cys Gly 
230 

Lys Val Ala He 
245 

Ala Arg Glu Ala 

Leu Lys Lys Leu 
280 

Asp Ala Glu Asp 
295 

Leu Phe Asp Lys 
310 

Lys Leu Tyr Phe 
325 

Asn Gly He He 

Ser Gin Glu Glu 
360 

Lys He Leu Gly 
375 

Thr Tyr Leu Ala 
390 

Asn Arg Ala Val 
405 

Leu Ser Gly Tyr 

Lys Asp Gin lie 
440 

Ala Glu Val Ser 
455. 

Val Asp Pro Lys 
470 

Trp Leu Gin Asp 
485 

Glu Glu Asn Glu 

Thr Ser Arg Lys 
520 

Thr Lys Arg Pro 
535 



140 

Lys Asn Ser Tyr 
155 

Phe Val Asn Thr 
170 

Asn Asn Ser Glu 
185 

Phe Phe Asp Leu 

Arg Asp Glu Tyr 
220 

Glu Val Lys Leu 

235 

Lys He He Ser 
250 

Asp Pro Ala Leu 
265 

Asn His Pro Cys 

Tyr Tyr He Val 
300 

Val Val Gly Asn 
315 

Tyr Gin Met Leu 
330 

His Arg Asp Leu 
345 

Asp Cys Leu He 

Glu Thr Ser Leu 
380 

Pro Glu Val Leu 
395 

Asp Cys Trp Ser 
410 

Pro Pro Phe Ser 
425 

Thr Ser Gly Lys 

Glu Lys Ala Leu 
4 60 

Ala Arg Phe Thr 
475 

Glu Asp Met Lys 
490 

Ser Thr Ala Leu 
505 

Arg Pro Arg Glu 

Ala Val Cys Ala 
540 



He Ala Tyr He 
160 

Glu Leu Val Gly 
175 

He Ala Leu Ser 
190 

Thr Val Asp Asp 
205 

He Met Ser Lys 

Ala Phe Glu Arg 
240 

Lys Arg Lys Phe 
255 

Asn Val Glu Thr 
270 

He lie Lys He 
285 

Leu Glu Leu Met 

Lys Arg Leu Lys 
320 

Leu Ala Val Gin 

335 

Lys Pro Glu Asn 
350 

Lys lie Thr Asp 
365 

Met Arg Thr Leu 

Val Ser Val Gly 
400 

Leu Gly Val He 
415 

Glu His Arg Thr 
430 

Tyr Asn Phe He 
445 

Asp Leu Val Lys 

Thr Glu Glu Ala 
480 

Arg Lys Phe Gin 
495 

Pro Gin Val Leu 
510 

Gly Glu Ala Glu 
525 

Ala Val Leu 



<210> 418 

<211> 2547 

<212> DNA 

<213> Homo sapiens 

<400> 418 

cttacaaggt acagtcctct gctcaggggg gccaggaggg tcttataggc atcattcacc 60 
agggtcgaat gcttctctga gaagtccttt tcagtctgag acctctggct gaagaaatct 120 
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gggtggacaa 
aaggaacggt 
gctcggcact 
ttccaacagc 
cttctgggaa 
cgccacatct 
cgtctaacca 
gggaacccgg 
ggcggcgcct 
ttgattggct 
tgcggagagt 
ttgaggctca 
agtcccaagg 
caaactccag 
ecactcagga 
agcctacccc 
aatgtgtgaa 
aaccactgct 
tcagggaagt 
gaacctttgt 
ctgaaattgc 
atgatcagtc 
gaagtggtgc 
ccataaagat 
ctctcaatgt 
agattaaaaa 
gagagctgtt 
atttttacca 
acttaaagcc 
ctgattttgg 
cccccaccta 
ctgtggactg 
tctctgagca 
tcattcctga 
tggtagtgga 
aggatgaaga 
ctctacccca 
ccgagggtgc 
ggtttgaaca 
tgttttttta 
tacaattaaa 
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gacgctgcag 
tgcagtccat 
gtgggcagaa 
ggggataatt 
cccctgtcgg 
ggccggcggc 
gactaatgtt 
gttctaagtt 
gacggccaca 
gagggtggag 
gtgcggctcc 
gcagtctcat 
ctcctcctca 
ccagtcctct 
actctattct 
tgccccctgg 
tgacaactac 
gaaaagaaca 
gggtcctaaa 
aaatacagag 
actgtcacta 
agtttatcct 
ctgtggagag 
catcagcaaa 
tgaaacagaa 
cttttttgat 
tgacaaagtg 
gatgctcttg 
agagaatgtt 
gcactccaag 
cttggcgcct 
ctggagttta 
taggactcaa 
agtctgggca 
tccaaaggca 
catgaagaga 
ggttctagcc 
cgagaccaca 
cgaaagaaat 
tagtttgtat 
aacctgatgg 



ttgctggtac 
aaggctgaag 
gaacctgtcc 
gcttcccgcc 
ccaaaacccc 
ctatctagcc 
gctgattggc 
ccgctctccc 
tctgctgctc 
tttgtatctg 
aggtgggctc 
ggcagcagtg 
cagtcccagg 
cactccagct 
attcctgagg 
gctcgattat 
tggtttggga 
gataaatacc 
aactcttaca 
cttgtaggga 
agcagaaata 
aaggcattaa 
gtaaagctgg 
aggaagtttg 
atagaaattt 
gcagaagatt 
gtggggaata 
gctgtgcagt 
ttactgtcat 
attttgggag 
gaagttcttg 
ggagttattc 
gtgtcactga 
gaagtctcag 
cgttttacga 
aagtttcaag 
cagccttcta 
aagcgcccag 
gtaccttctt 
tttaattatg 
aacctgg : 



ctgtgctgga 
tagtctcgag 
tcccgcccgg 
tgcgacgcag 
cacacccgga 
gtggtcactc 
tggggagtcg 
ttctaaacta 
ctcattggtc 
caggtttagc 
acgcggtcgt 
cctgttcaca 
gcatatccag 
ctgggacact 
accaagaacc 
gggcccttca 
gggacaaaag 
gaacatacag 
ttgcatacat 
aaggaaaacg 
aagtttttgt 
gagatgaata 
ctttcgagag 
ctattggttc 
tgaaaaagct 
attatattgt 
aacgcctgaa 
accttcatga 
ctcaagaaga 
agacctctct 
tttctgttgg 
tttttatctg 
aggatcagat 
agaaagctct 
cagaagaagc 
atcttctgtc 
ctagtcgaaa 
ctgtgtgtgc 
tcactctgtc 
ggaataattg 



gcttcgctgt 
tggggtcagg 
ggccccatgg 
catcgcagct 
gcaaagcccc 
gtggggaaaa 
agggggcggg 
caactcccag 
cggcggcagg 
gccactctgc 
gatgtctcgg 
gccccatggc 
ctcctctacc 
gagctcctta 
tgaggaccaa 
ggatggattt 
ctgtgaatat 
caagaaacac 
agaagatcac 
ccgtcctttg 
cttttttgat 
catcatgtca 
gaaaacatgt 
agcaagagag 
aaatcatcct 
tttggaattg 
agaagctacc 
aaacggtatt 
ggactgtctt 
catgagaacc 
gactgctggg 
ccttagtggg 
caccagtgga 
ggaccttgtc 
cttaagacac 
tgaggaaaat 
gcggccccgt 
tgctgtgttg 



atctttcttt 
ctttttcaca 



atcaactctg 
tgcctgcagc 
gccgccgcag 
tagcggtctc 
ggctctcccc 
gcaaagagag 
atcacccgag 
gaggcattga 
ggagggggtt 
tggctgaggc 
gagtcggatg 
agcgttaccc 
agcacgatgc 
gagacagtgt 
gaacctgagg 
gccaatcttg 
tgctttgatg 
tttcggattt 
agtggcaatg 
aataacaatt 
ctgactgtag 
aaaactcttg 
aagaaagtag 
gcagacccag 
tgcatcatca 
atggaagggg 
tgcaagctct 
atacaccgtg 
ataaagatta 
ttatgtggaa 
tataaccgtg 
tatccacctt 
aaatacaact 
aagaagttgt 
ccgtggcttc 
gaatccacag 
gaaggggaag 
tgaactccgt 
tctttgagtc 
gtcactgatg 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2547 



<210> 419 
<211> 297 
<212> PRT 

<213> Homo sapiens 



<400> 419 
Met Glu Asp 
1 

Val Val Tyr 

Lys Lys lie 
35 

He Arg Glu 
50 

Ser Leu Gin 
65 

Glu Phe Leu 
Gly Gin Tyr 



Tyr Thr 

5 

Lys Gly 
20 

Arg Leu 

He Ser 

Asp Val 

Ser Met 
85 

Met Asp 



Lys 

Arg 

Glu 

Leu 

Leu 

70 

Asp 

Ser 



He Glu 

His Lys 

Ser Glu 

40 
Leu Lys 
55 

Met Gin 
Leu Lys 
Ser Leu 



Lys He 
10 

Thr Thr 

25 

Glu Glu 

Glu Leu 

Asp Ser 

Lys Tyr 
90 

Val Lys 



Gly Glu Gly Thr 
Gly Gin 
Gly Val 



Arg His 
60 

Arg Leu 
75 

Leu Asp 
Ser Tyr 



Val Val 

30 
Pro Ser 
45 

Pro Asn 



Tyr Leu 
Ser lie 
Leu Tyr 



Tyr Gly 
15 

Ala Met 

Thr Ala 

He Val 

He Phe 

80 
Pro Pro 
95 

Gin He 
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100 



105 



110 



Leu 


Gin 


Gly 

1 lO 


He 


Val 


Phe 


Cys 


His 


Ser 


Arg 


Arg 


Val 


Leu 

125 


His 


Arg 


Asp 


Leu 


Lys 


Pro 


Gin 


Asn 


Leu 


Leu 


He 


Asp Asp 


Lys 


Gly 


Thr 


He 


Lys 


Leu 




i on 
1 j U 




















J. H \J 










Ala 


Asp 


Phe 


Gly 


Leu 


Ala 


Arg 


Ala 


irne 


fl 

biy 


He 


Pro 


He 


Arg 


Val 


Tyr 


1 H D 




















j. *j j 










1 6fi 

1DU 


lnr 


HIS 


CjIU 


val 


Vdl 

loj 


i nr 


Leu 


Trp 


Tyr 


Arg 
170 


JCI 


Pro 


O -L U 


V ci JL 


Leu 

1 7c 


Leu 


Gly 


Ser 


Ala 


Arg 

1 Of) 

loU 


Tyr 


Ser 


Thr 


Pro 


val 
185 


Asp 


He 


Trp 


Ser 


He 

1 Qn 

j. j\j 


Gly 


Thr 


He 


Phe 


Ala 


Glu 


Leu 


Ala 


Thr 


Lys 

9 nn 
zuu 


Lys 


Pro 


Leu 


Phe 


His 
one; 


Gly 


Asp 


Ser 


Glu 


He 

Zlu 


Asp 


Gin 


Leu 


Phe 


Arg 

Z 1 D 


He 


Phe 


Arg 


Ala 


Leu 

JC.Z.U 


Gly 


Thr 


Pro 


Asn 


Asn 


Glu 


Val 


Trp 


Pro 


Glu 


Val 


Glu 


Ser 


Leu 


Gin 


Asp 


Tyr 


Lys 


Asn 


Thr 


o "~i a 










Z JU 










0 ^ c 










Z H U 


Phe 


Pro 


Lys 


Trp 


Lys 
245 


Pro 


Gly 


Ser 


Leu 


Ala 
250 


Ser 


His 


Val 


Lys 


Asn 
255 


Leu 


Asp 


Glu 


Asn 


Gly 
260 


Leu 


Asp 


Leu 


Leu 


Ser 
265 


Lys 


Met 


Leu 


He 


Tyr 
270 


Asp 


Pro 


Ala 


Lys 


Arg 
275 


He 


Ser 


Gly 


Lys 


Met 
280 


Ala 


Leu 


Asn 


His 


Pro 
285 


Tyr 


Phe 


Asn 


Asp 


Leu 
290 


Asp 


Asn 


Gin 


He 


Lys 
295 


Lys 


Met 

















<210> 420 

<211> 1235 

<212> DNA 

<213> Homo sapiens 



<400> 420 

gggggggggg 

gt tgttgtag 

tgactaacta 

gtgtataagg 

gaaagtgaag 

cttcgtcatc 

ctcatctttg 

cagtacatgg 

ttttgtcact 

gacaaaggaa 

agagtatata 

teagctegtt 

gcaactaaga 

agagctttgg 

aagaatacat 

gaaaatggct 

ggcaaaatgg 

tagctttctg 

aactcttgtc 

gtcttctaat 

attctgtaaa 



ggcacttggc 
ctgccgctgc 
tggaagatta 
gtagacacaa 
aggaaggggt 
caaatatagt 
agtttctttc 
attcttcact 
ctagaagagt 
caattaaact 
cacatgaggt 
actcaactcc 
aaccactttt 
gcactcccaa 
ttcccaaatg 
tggatttget 
cactgaatca 
acaaaaagtt 
tatttttgtc 
ttcaaaaata 
tgtgaaaaaa 



ttcaaagctg 
ggccgccgcg 
taccaaaata 
aactacaggt 
tcctagtact 
cagtcttcag 
catggatctg 
tgttaagagt: 
tcttcacaga 
ggctgatttt 
agtaacactc 
agttgacatt 
ccatggggat 
taatgaagtg 
gaaaccagga 
ctcgaaaatg 
tccatatttt 
tccatatgtt 
ttatatatat 
taacttaaaa 
aaaaaaaaaa 



gctcttggaa 
gaataataag 
gagaaaattg 
caagtggtag 
geaatteggg 
gatgtgctta 
aagaaatact 
tatttatacc 
gacttaaaac 
ggccttgcca 
tggtacagat 
tggagtatag 
tcagaaattg 
tggccagaag 
agectagcat 
ttaatctatg 
aatgatttgg 
atgtcaacag 
ttctttgtta 
atgtaaatat 
aaaaa 



attgagegga 
cegggatcta 
gagaaggtac 
ccatgaaaaa 
aaatttctct 
tgeaggatte 
tggattctat 
aaatcctaca 
ctcaaaatct 
gagcttt tgg 
ctccagaagt 
gcaccatatt 
atcaactctt 
tggaatcttt 
cccatgtcaa 
atccagccaa 
acaatcagat 
atagttgtgt 
tcaaacttca 
tctatatgaa 



gagegacgeg 
ccatacccat 
ctatggagtt 
aatcagacta 
attaaaggaa 
caggttatat 
ccctcctggt 
ggggattgtg 
cttgattgat 
aatacctatc 
attgctgggg 
tgctgaacta 
caggattttc 
acaggactat 
aaacttggat 
acgaatttct 
taagaagatg 
ttttattgtt 
gctgtacttc 
tttaaatata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1235 



<210> 421 
<211> 240 
<212> PRT 
<213> Homo sapiens 



<400> 421 
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Met Glu Asp Tyr 
1 

Val Val Tyr Lys 
20 

Lys Lys He Arg 
35 

He Arg Glu He 
50 

Ser Leu Gin Asp 
65 

Glu Phe Leu Ser 

Gly Gin Tyr Met 
100 

Arg Ser Pro Glu 
115 

Asp He Trp Ser 
130 

Pro Leu Phe His 
145 

Arg Ala Leu Gly 

Leu Gin Asp Tyr 
180 

Ala Ser His Val 
195 

Lys Met Leu He 
210 

Leu Asn His Pro 
225 



Thr Lys He Glu 
5 

Gly Arg His Lys 

Leu Glu Ser Glu 
40 

Ser Leu Leu Lys 
55 

Val Leu Met Gin 
70 

Met Asp Leu Lys 
85 

Asp Ser Ser Leu 

Val Leu Leu Gly 
120 

He Gly Thr lie 
135 

Gly Asp Ser Glu 
150 

Thr Pro Asn Asn 
165 

Lys Asn Thr Phe 

Lys Asn Leu Asp 
200 

Tyr Asp Pro Ala 
215 

Tyr Phe Asn Asp 
230 



Lys He Gly Glu 

10 " 
Thr Thr Gly Gin 
25 

Glu Glu Gly Val 

Glu Leu Arg His 
60 

Asp Ser Arg Leu 
75 

Lys Tyr Leu Asp 
90 

Val Lys Val Val 
105 

Ser Ala Arg Tyr 

Phe Ala Glu Leu 
140 

He Asp Gin Leu 
155 

Glu Val Trp Pro 
170 

Pro Lys Trp Lys 
185 

Glu Asn Gly Leu 

Lys Arg He Ser 
220 

Leu Asp Asn Gin 
235 



Gly Thr Tyr Gly 
15 

Val Val Ala Met 
30 

Pro Ser Thr Ala 
45 

Pro Asn He Val 

Tyr Leu He Phe 
80 

Ser He Pro Pro 
95 

Thr Leu Trp Tyr 
110 

Ser Thr Pro Val 
125 

Ala Thr Lys Lys 

Phe Arg He Phe 
160 

Glu Val Glu Ser 
175 

Pro Gly Ser Leu 
190 

Asp, Leu Leu Ser 
205 

Gly Lys Met Ala 

He Lys Lys Met 
240 



<210> 422 
<211> 948 
<212> DNA 

<213> Homo sapiens 
<400> 422 

ccattgacta actatggaag attataccaa 
agttgtgtat aagggtagac acaaaactac 
actagaaagt gaagaggaag gggttcctag 
ggaacttcgt catccaaata tagtcagtct 
atatctcatc tttgagtttc tttccatgga 
tggtcagtac atggattctt cacttgttaa 
agtattgctg gggtcagctc gttactcaac 
atttgctgaa ctagcaacta agaaaccact 
cttcaggatt ttcagagctt tgggcactcc 
tttacaggac tataagaata catttcccaa 
caaaaacttg gatgaaaatg gcttggattt 
caaacgaatt tctggcaaaa tggcactgaa 
gattaagaag atgtagcttt ctgacaaaaa 
tgtttttatt gttaactctt gtctattttt 
tcagctgtac ttcgtcttct aatttcaaaa 
gaatttaaat ataattctgt aaatgtgaaa 

<210> 423 
<211> 433 
<212> PRT 

<213> Homo sapiens 
<400> 423 



aatagagaaa attggagaag gtacctatgg 60 
aggtcaagtg gtagccatga aaaaaatcag 120 
tactgcaatt cgggaaattt ctctattaaa 180 
tcaggatgtg cttatgcagg attccaggtt 240 
tctgaagaaa tacttggatt ctatccctcc 300 
ggtagtaaca ctctggtaca gatctccaga 360 
tccagttgac atttggagta taggcaccat 420 
tttccatggg gattcagaaa ttgatcaact 480 
caataatgaa gtgtggccag aagtggaatc 540 
atggaaacca ggaagcctag catcccatgt 600 
gctctcgaaa atgttaatct atgatccagc 660 
tcatccatat tttaatgatt tggacaatca 720 
gtttccatat gttatgtcaa cagatagttg 780 
gtcttatata tatttctttg ttatcaaact 840 
atataactfa aaaatgtaaa tattctatat 900 
aaaaaaaaaa aaaaaaaa 948 
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Met Ala Leu Arg Val Thr Arg Asn Ser Lys He Asn Ala Glu Asn Lys 

1 5 10 15 

Ala Lys He Asn Met Ala Gly Ala Lys Arg Val Pro Thr Ala Pro Ala 

20 25 30 

Ala Thr Ser Lys Pro Gly Leu Arg Pro Arg Thr Ala Leu Gly Asp He 

35 40 45 

Gly Asn Lys Val Ser Glu Gin Leu Gin Ala Lys Met Pro Met Lys Lys 

50 55 60 

Glu Ala Lys Pro Ser Ala Thr Gly Lys Val He Asp Lys Lys Leu Pro 
65 70 75 80 

Lys Pro Leu Glu Lys Val Pro Met Leu Val Pro Val Pro Val Ser Glu 

85 90 95 

Pro Val Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Val Lys Glu 

100 105 110 

Glu Lys Leu Ser Pro Glu Pro He Leu Val Asp Thr Ala Ser Pro Ser 

115 120 125 

Pro Met Glu Thr Ser Gly Cys Ala Pro Ala Glu Glu Asp Leu Cys Gin 

130 135 140 

Ala Phe Ser Asp Val He Leu Ala Val Asn Asp Val Asp Ala Glu Asp 
145 150 155 160 

Gly Ala Asp Pro Asn Leu Cys Ser Glu Tyr Val Lys Asp lie Tyr Ala 

165 * 170 175 

Tyr Leu Arg Gin Leu Glu Glu Glu Gin Ala Val Arg Pro Lys Tyr Leu 

180 185 190 

Leu Gly Arg Glu Val Thr Gly Asn Met Arg Ala He Leu lie Asp Trp 

195 200 205 

Leu Val Gin Val Gin Met Lys Phe Arg Leu Leu Gin Glu Thr Met Tyr 

210 215 220 

Met Thr Val Ser lie lie Asp Arg Phe Met Gin Asn Asn Cys Val Pro 
225 230 235 240 

Lys Lys Met Leu Gin Leu Val Gly Val Thr Ala Met Phe lie Ala Ser 

245 250 255 

Lys Tyr Glu Glu Met Tyr Pro Pro Glu lie Gly Asp Phe Ala Phe Val 

260 265 270 

Thr Asp Asn Thr Tyr Thr Lys His Gin lie Arg Gin Met Glu Met Lys 

275 280 285 

lie Leu Arg Ala Leu Asn Phe Gly Leu Gly Arg Pro Leu Pro Leu His 

290 295 300 

Phe Leu Arg Arg Ala Ser Lys He Gly Glu Val Asp Val Glu Gin His 
305 310 315 320 

Thr Leu Ala Lys Tyr Leu Met Glu Leu Thr Met Leu Asp Tyr Asp Met 

325 330 335 

Val His Phe Pro Pro Ser Gin lie Ala Ala Gly Ala Phe Cys Leu Ala 

340 345 350 

Leu Lys lie Leu Asp Asn Gly Glu Trp Thr Pro Thr Leu Gin His Tyr 

355 360 365 

Leu Ser Tyr Thr Glu Glu Ser Leu Leu Pro Val Met Gin His Leu Ala 

370 375 380 

Lys Asn Val Val Met Val Asn Gin Gly Leu Thr Lys His Met Thr Val 
385 390 395 400 

Lys Asn Lys Tyr Ala Thr Ser Lys His Ala Lys He Ser Thr Leu Pro 

4 05 410 415 

Gin Leu Asn Ser Ala Leu Val Gin Asp Leu Ala Lys Ala Val Ala Lys 
420 425 430 

Val 



<210> 424 | 
<211> 2101 \ 



<212> DNA 
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<213> Homo sapiens 



<400> 424 



acgaacaggc caataaggag ggagcagtgc ggggtttaaa tctgaggcta ggctggctct 60 
tctcggcgtg ctgcggcgga acggctgttg gtttctgctg ggtgtaggtc cttggctggt 120 
cgggcctccg gtgttctgct tctccccgct gagctgctgc ctggtgaaga ggaagccatg 180 
gcgctccgag tcaccaggaa ctcgaaaatt aatgctgaaa ataaggcgaa gatcaacatg 240 
gcaggcgcaa agcgcgttcc tacggcccct gctgcaacct ccaagcccgg actgaggcca 300 
agaacagctc ttggggacat tggtaacaaa gtcagtgaac aactgcaggc caaaatgcct 360 
atgaagaagg aagcaaaacc ttcagctact ggaaaagtca ttgataaaaa actaccaaaa 420 
cctcttgaaa aggtacctat gctggtgcca gtgccagtgt ctgagccagt gccagagcca 480 
gaacctgagc cagaacctga gcctgttaaa gaagaaaaac tttcgcctga gcctattttg 540 
gttgatactg cctctccaag cccaatggaa acatctggat gtgcccctgc agaagaagac 600 
ctgtgtcagg ctttctctga tgtaattctt gcagtaaatg atgtggatgc agaagatgga 660 
gctgatccaa acctttgtag tgaatatgtg aaagatattt atgcttatct gagacaactt 720 
gaggaagagc aagcagtcag accaaaatac ctactgggtc gggaagtcac tggaaacatg 780 
agagccatcc taattgactg gctagtacag gttcaaatga aattcaggtt gttgcaggag 840 
accatgtaca tgactgtctc cattattgat cggttcatgc agaataattg tgtgcccaag 900 
aagatgctgc agctggttgg tgtcactgcc atgtttattg caagcaaata tgaagaaatg 960 
taccctccag aaattggtga ctttgctttt gtgactgaca acacttatac taagcaccaa 1020 
atcagacaga tggaaatgaa gattctaaga gctttaaact ttggtctggg tcggcctcta 1080 
cctttgcact tccttcggag agcatctaag attggagagg ttgatgtcga gcaacatact 1140 
ttggccaaat acctgatgga actaactatg ttggactatg acatggtgca ctttcctcct 1200 
tctcaaattg cagcaggagc tttttgctta gcactgaaaa ttctggataa tggtgaatgg 1260 
acaccaactc.tacaacatta cctgtcatat actgaagaat ctcttcttcc agttatgcag 1320 
cacctggcta agaatgtagt catggtaaat caaggactta caaagcacat gactgtcaag 1380 
aacaagtatg ccacatcgaa gcatgctaag atcagcactc taccacagct gaattctgca 1440 
ctagttcaag atttagccaa ggctgtggca aaggtgtaac ttgtaaactt gagttggagt 1500 
actatattta caaataaaat tggcaccatg tgccatctgt acatattact gttgcattta 1560 




tgaaaatttt acttccatag gacatactgc atgtaagcca agtcatggag aatctgctgc 1860 
atagctctat tttaaagtaa aagtctacca ccgaatccct agtccccctg ttttctgttt 1920 
cttcttgtga ttgctgccat aattctaagt tatttacttt taccactatt taagttatca 1980 
actttagcta gtatcttcaa actttcactt tgaaaaatga gaattttata ttctaagcca 2040 
gttttcattt tggttttgtg ttttggttaa taaaacaata ctcaaataca aaaaaaaaaa 2100 
3 2101 



<210> 425 
<211> 665 
<212> PRT 

<213> Homo sapiens 



<400> 425 
Met Ala Ala 
1 

Ala Ser Ala 

He Gly Ala 
35 

Arg Leu Arg 
50 

Cys Val Arg 
65 

Asn His Gin 
Arg Ser Thr 
Arg His Leu 



Gly Gly Ser 
5 

Ala Ala He 
20 

Thr Arg Arg 

Pro Phe Val 

Gly Met Asp 
70 

Glu Thr Leu 
85 

Gin Gin Asp 
100 

Leu Glu Gly 



Thr Gin 

Ser Gly 

Pro Pro 
40 

Asp Gly 
55 

Ser Cys 
Lys Tyr 
He Tyr 
Gin Asn 



Gin Arg 
10 

Ala Gly 
25 

Pro Ala 

Thr Ala 

Ser Leu 

Gin Phe 

90 
Ala Gly 
105 

Ala Ser 



Arg Arg Glu Met 
Arg Cys 
Arg Val 



Gly Ala 

60 
Glu He 
75 

Asp Ala 
Ser Val 
Val Leu 



Arg Leu 

30 
Arg Val 
45 

Ser Asp 



Ala Asn 

Phe Tyr 

Gin Pro 
110 
Ala Tyr 



Ala Ala 
15 

Ser Lys 

Ala Val 

Pro Pro 

Trp Arg 

80 
Gly Glu 
95 

He Leu 
Gly Pro 
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115 

Thr Gly Ala Gly 
130 

Gly Val He Pro 
145 

Glu Gly Ala Glu 

Leu Glu He Tyr 
180 

Gly Asp Leu Val 
195 

Gly Leu Ser Gin 
210 

Phe Leu Pro Ala 

225 

Gin Arg Ser Ser 

Arg Glu Arg Leu 
260 

.He Asp Leu Ala 
275 

Leu Arg Leu Lys 
290 

Gly Lys Val Val 
305 

Arg Asp Ser Lys 

Ala His Ser lie 
340 

Leu Asp Thr Val 
355 

He Asn Arg Pro 
370 

Pro Val Lys Leu 
385 

Arg Ala Arg Gly 

Ala Ala Pro Ala 
420 

Ser Ser Met Asp 
435 

Leu Leu Ala Ser 
450 

Lys Arg Glu Arg 
4 65 

Glu He Glu Arg 

Leu Ala Gin Lys 
500 

Arg Pro Leu Ser 
515 

Ala Val Val Met 
530 

Asn Ala Glu lie 
545 

Glu Ser Leu Asp 

Glu Leu Gin He 
580 

Leu Asp Leu Leu 
595 



120 

Lys Thr His Thr 
135 

Arg Ala Leu Met 
150 

Gly Arg Pro Trp 
165 

Gin Glu Lys Val 

lie Arg Glu Asp 
200 

Lys Pro He Ser 
215 

Ser Arg Asn Arg 
230 

Arg Ser His Ala 
245 

Ala Pro Phe Arg 

Gly Ser Glu Asp 
280 

Glu Ser Gly Ala 
295 

Asp Ala Leu Asn 
310 

Leu Thr Arg Leu 
325 

Leu He Ala Asn 

Ser Ala Leu Asn 
360 

Phe Thr Asn Glu 
375 

Ser Gin Lys Glu 
390 

Pro Glu Glu Glu 
405 

Ser Ala Ser Gin 

Pro Ala Met Leu 
440 

Gin Gly Ser Gin 
455 

Met Val Leu Met 
470 

Leu Lys Thr Lys 
485 

Ala Glu Glu Lys 

His Arg Thr Val 
520 

Pro Leu Gin Leu 
535 

His He Leu Lys 
550 

Ala Leu Glu Pro 
565 

Ser Pro Glu Leu 

Asn Glu Gly Ser 
600 



Met Leu Gly Ser 
140 

Asp Leu Leu Gin 
155 

Ala Leu Ser Val 
170 

Leu Asp Leu Leu 
185 

Cys Arg Gly Asn 

Ser Phe Ala Asp 
220 

Thr Val Gly Ala 
235 

Val Leu Leu Val 
250 

Gin Arg Glu Gly 
265 

Asn Arg Arg Thr 

lie Asn Thr Ser 
300 

Gin Gly Leu Pro 
315 

Leu Gin Asp Ser 
330 

lie Ala Pro Glu 
345 

Phe Ala Ala Arg 

Ser Leu Gin Pro 
380 

Leu Leu Gly Pro 
395 

Glu lie Gly Ser 
410 

Lys Leu Ser Pro 
425 

Glu Arg Leu Leu 

Gly Ala Pro Leu 
4 60 

Lys Thr Val Glu 
475 

Gin Lys Glu Leu 
490 

Glu Asn His Cys 
505 

Thr Gly Ala Lys 

lie Gin Glu Gin 
540 

Asn Lys Gly Arg 
555 

Glu Glu Lys Ala 
570 

Leu Ala His Gly 
585 

Ala Arg Asp Leu 



125 

Pro Glu Gin Pro 

Leu Thr Arg Glu 
160 

Thr Met Ser Tyr 
175 

Asp Pro Ala Ser 
190 

lie Leu lie Pro 

205 

Phe Glu Arg His 

Thr Arg Leu Asn 
240 

Lys Val Asp Gin 
255 

Lys Leu Tyr Leu 
270 

Gly Asn Lys Gly 
28 5 

Leu Phe Val Leu 

Arg Val Pro Tyr 
320 

Leu Gly Gly Ser 
335 

Arg Arg Phe Tyr 
350 

Ser Lys Glu Val 
365 

His Ala Leu Gly 

Pro Glu Ala Lys 
400 

Pro Glu Pro Met 
415 

Leu Gin Lys Leu 
430 

Ser Leu Asp Arg 
445 

Leu Ser Thr Pro 

Glu Lys Asp Leu 
4 80 

Glu Ala Lys Met 
495 

Pro Thr Met Leu 
510 

Pro Leu Lys Lys 
525 

Ala Ala Ser Pro 

Lys Arg Lys Leu 
560 

Glu Asp Cys Trp 
575 

Arg Gin Lys lie 
590 

Arg Ser Leu Gin 
605 
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Arg He Gly Pro Lys Lys Ala Gin 

610 615 
His Gly Pro Phe Ser Gin Val Glu 
625 630 
Thr Gly Lys Gin Met Glu Ser Phe 
645 

Ala Ala Gly Gin Arg Cys Gly Ala 
660 



Leu He Val Gly Trp Arg Glu Leu 
620 

Asp Leu Glu Arg Val Glu Gly He 
635 640 
Leu Lys Ala Asn He Leu Gly Leu 
650 655 

Ser 
665 



<210> 426 

<211> 2097 

<212> DNA 

<213> Homo sapiens 



<400> 426 

gggcgggccc 

gagatggcgg 

attggagcta 

tttgtggatg 

tct ctagaga 

ttctatgggg 

aggcacttgc 

aagacgcaca 

gacctcctgc 

accatgtctt 

ggagacctgg 

aagcccatca 

act gtaggag 

aaggtggacc 

attgacttgg 

gagagtggag 

cagggcctcc 

ctgggtggct 

ctagacacag 

tt taccaatg 

ttgcttggtc 

cctgagccca 

agcagcatgg 

caggggagcc 

aagacagtag 
gaggccaaga 
cggccccttt 
cccctacagc 
aataaaggcc 
gaggactgct 
ctggatctgc 
aagaaggccc 
gacctggaac 
atcctgggtc 
gccttttcaa 



aaggagggag 

cagcttcagc 

ctcgtcgtcc 

gaacagcggg 

ttgctaactg 

agaggagtac 

tggaagggca 

caatgctggg 

agctcacaag 

acctagagat 

taatccgaga 

gtagctttgc 

ccacccggct 

agcgggaacg 

ctgggtcaga 

ccatcaacac 

ctcgtgtacc 

cagcccacag 

tctccgcact 

agagcctgca 

caccagaggc 

tggcagctcc 

acccggccat 

agggggcccc 

aagagaagga 
tgttggccca 
cacatcgcac 
taattcagga 
ggaagagaaa 
gggagctaca 
tgaacgaagg 
agctaatcgt 
gcgtggaggg 
tcgccgccgg 
atttttgtat 



tggaatggcc 

ggcggcgatc 

acctccagct 

agcaagtgat 

gaggaaccac 

tcagcaggac 

gaatgccagt 

cagcccagag 

ggaggagggt 

ctaccaggag 

agactgccgg 

tgattttgag 

caaccagcgc 

tttggcccca 

ggacaaccgg 

ctccctgttt 

ttatcgggac 

tatccttatt 

caactttgct 

gcctcatgcc 

aaagagagcc 

agcctctgcc 

gctggagcgc 

tctgttgagt 

cctagagatt 

gaaggctgag 

agtcacaggg 

gcaggcagca 

gctggagtcc 

gatcagcccg 

ctcagcccga 

gggctggcgg 

cataacgggg 

ccagcgctgt 

aaccccgtgt 



gcgggcggct 

tcaggagctg 

cgcgtaaggg 

cccccctgtg 

caggagactc 

atctatgcag 

gtgcttgcct 

caacctgggg 

gccgagggcc 

aaggtattag 

gggaatatcc 

cggcacttcc 

tcctcccgca 

tttcgccagc 

cgcacaggca 

gtcctgggca 

agcaagctca 

gccaacattg 

gccaggtcca 

ttgggacctg 

cgaggccctg 

tcccagaaac 

ctcctcagct 

accccaaagc 

gagaggctta 

gaaaaggaga 

gcaaagcccc 

tccccaaatg 

ctggatgccc 

gagctactgg 

gatctccgca 

gagctccacg 

aaacagatgg 

ggcgcctcct 

tgtgtaaata 



cgacgcagca 
gtcgctgtcg 
tggctgtgcg 

tgcggggcat 

tcaaatacca 

gttcagtgca 

atggacccac 

tgatcccgcg 

ggccatgggc 

acctcctgga 

tgattccggg 

tgccagccag 

gtcatgctgt 

gagagggaaa 

acaagggcct 

aagtggtaga 

ctcgcctatt 

cccctgagag 

aggaggtgat 

ttaagctgtc 

aggaagagga 

tcagccccct 

tggaccgtct 

gagagcggat 

agacgaagca 

accattgtcc 

tgaaaaaggc 

ccgagatcca 

tagagcctga 

ctcatgggcg 

gtcttcagcg 

gccccttcag 

agtccttcct 

gaccgtcgtc 

cagtttttgc 



gaggcgacgc 

gctaagcaag 

actgcggcca 

ggacagctgc 

gtttgatgcc 

gcccatccta 

aggagctggg 

ggctctcatg 

cctttctgtc 

ccctgcttcg 

tctctcccag 

tcgaaatcgg 

gctcctggtc 

actctacctg 

tcggctaaaa 

tgcgctgaat 

gcaggactct 

acgcttctac 

caatcggcct 

tcagaaagaa 

gattgggagc 

acagaagcta 

gcttgcctcc 

ggtgctaatg 

aaaagaactg 

cacaatgctc 

tgtggtgatg 

catcctgaag 

ggagaaggct 

ccaaaaaata 

cattggcccg 

ccaggtggag 

gaaggcaaac 

tcctcactcc 

tccggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2097 



<210> 427 
<211> 209 
<212> PRT 
< 213> Homo sapiens 

<400> 427 

Met Asn Pro Ser Met Lys Gin Lys Gin Glu Glu He Lys Glu Asn He 

1 5 10 is 

Lys Asn Ser Ser Val Pro Arg Arg Thr Leu Lys Met He Gin Pro Ser 
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20 










25 










30 




Ala 


Ser 


Gly 


Ser 


Leu 


Val 


Glv 


Ara 


Glu 


Asn 


Glu 


Leu 


Ser 


Ala 


Glv T.fMl 






35 










40 










45 




Ser 


Lys 


Ara 


Lys 


His 


Ara 


Asn 


Asp 


His 


Leu 


Thr 


Ser 


Thr 


Thr 


Q 7~ Cpr 

JC1 C A- 




50 










55 










60 








Pro 


Glv 


Val 


He 


Val 


Pro 


Glu 


Ser 


Ser 


Glu 


Asn 


Lys 


Asn 


Leu 


Glv Glv 

O _L y Ul j 


65 










70 










75 








80 


Val 


Thr 


Gin 


Glu 


Ser 


Phe 


Asp 


Leu 


Met 


He 


Lys 


Glu 


Asn 


Pro 












85 










90 












Gin 


Tvr 


Tro 


Lys 


Glu 


Val 


Ala 


Glu 


Lys 


Arg 


Arg 


Lys 


Ala 


Leu 


Tyr Glu 








100 










105 










110 




Ala 


Leu 


Lys 


Glu 


Asn 


Glu 


Lys 


Leu 


His 


Lys 


Glu 


He 


Glu 


Gin 


T . \/ Q Sen 
-Li y o nop 






115 










120 










125 






Asn 


Glu 


He 


Ala 


Arg 


Leu 


Lys 


Lys 


Glu 


Asn 


Lys 


Glu 


Leu 


Ala 


Gin Val 




130 










135 










140 








Ala 


Glu 


His 


Val 


Gin 


Tvr 


Met 


Ala 


Glu 


Leu 


He 


Glu 


Arg 


Leu 


Den {Z1 V 












1 ^n 

1 JU 










1 RR 








± t)U 


Glu 


Pro 


Leu 


Asp 


Asn 


Phe 


Glu 


Ser 


Leu 


Asp 


Asn 


Gin 


Glu 


Phe 


Asp Ser 










165 










170 










175 


Glu 


Glu 


Glu 


Thr 


Val 


Glu 


Asp 


Ser 


Leu 


Val 


Glu 


Asp 


Ser 


Glu 


He Gly 








180 










185 










190 




Thr 


Cys 


Ala 


Glu 


Gly 


Thr 


Val 


Ser 


Ser 


Ser 


Thr 


Asp Ala 


Lys 


Pro Cys 






195 










200 










205 






He 































<210> 428 
<211> 1224 
<212> DMA 

<213> Homo sapiens 
<400> 428 

gttcggagcg ggcgagcgga 
cgaccgtggg gatcagegta 
acgtgcgttc gctacgagga 
aagtgggtac geaggggege 
aacctctgaa gccgcgtcct 
tcaccatcta cataatgaat 
taaagaatag ttctgtccca 
ctcttgttgg aagagaaaat 
accacttaac atctacaact 
aaaatcttgg aggagtcacc 
ctcagtattg gaaggaagtg 
aaaatgagaa acttcataaa 
aggagaataa agaactggca 
agagactgaa tggtgaacct 
ctgaagaaga aactgttgag 
aaggaactgt atcttcctct 
actgttgaga attttactgc 
actacataat gccaactctg 
ttaaagtaca aatactatgt 
agttgtcagc catgacttat 
aaaaaaaaaa aaaaaaaaaa 

<210> 429 
<211> 79 
<212> PRT 

<213> Homo sapiens 
<400> 429 
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gttagcaggg etttactgea 
ggtgagctgt ggccttttgc 
ttgagegtet ccacccagta 
aaggegcaca gcctctagac 
actttgacag ctgcagggcc 
cccagtatga agcagaaaca 
agaagaactc tgaagatgat 
gagctgtccg caggcttgtc 
tccagccctg gggttattgt 
caggagtcat ttgatcttat 
gcagaaaaac ggagaaaggc 
gaaattgaac aaaaggacaa 
gaagtagcag aacatgtaca 
ctggataatt ttgaatcact 
gattctctag tggaagactc 
aeggatgeaa agecatgtat 
cgaagtttac ctccactagt 
gaatcaaatt tccttgtttg 
atttttaatc tatgatggtt 
gtttattact aaataaactt 
aaaa 



gagcgcgccg ggcactccag 60 
gaggtgctgc agecataget 120 
agtgggcaag aggeggcagg 180 
gaetegcttt ccctccggcc 240 
gcggcctggt cttctgtgct 300 
agaagaaatc aaagagaata 360 
tcagccttct gcatctggat 420 
caaaaggaaa categgaatg 480 
cccagaatct agtgaaaata 540 
gattaaagaa aatccatcct 600 
gctgtatgaa gcacttaagg 660 
tgaaattgee cgcctgaaaa 720 
gtatatggca gagctaatag 780 
ggataatcag gaatttgatt 840 
agaaattggc acgtgtgctg 900 
atgaaatgea ttaatatttg 960 
tctttgtagc agagtacata 1020 
aatcctggga ccctattgca 1080 
tatgtgaata ggattttctc 1140 
caaactcctg tggaaaaaaa 1200 

1224 



WO 2004/055050 

Met Ala His Lys Gin lie Tyr Tyr Ser 

1 5 
Tyr Glu Tyr Arg His Val Met Leu Pro 

20 25 
Pro Lys Thr His Leu Met Ser Glu Glu 

35 40 
Gin Gin Ser Leu Gly Trp Val His Tyr 

50 - 55 

His lie Leu Leu Phe Arg Arg Pro Leu 
65 70 



PCT/IB2003/006434 

Asp Lys Tyr Phe Asp Glu His 
10 15 
Arg Glu Leu Ser Lys Gin Val 
30 

Glu Trp Arg Arg Leu Gly Val 
45 

Met lie His Glu Pro Glu Pro 
60 

Pro Lys Asp Gin Gin Lys 
75 



<210> 430 
<211> 627 
<212> DNA 
<213> Homo sapiens 



<400> 430 

agtctccggc 

cgctctcgtt 

cggacaagta 

ccaaacaagt 

aacagagtct 

ttagacgacc 

ttttcaaatt 

acaaatcttt 

aaatgcaact 

tttctcttaa 

tatgttgcat 



gagttgttgc 
tcattttctg 
cttcgacgaa 
acctaaaact 
aggctgggtt 
tcttccaaaa 
taatgtatat 
catccatacc 
gcaagtaggt 
gtgcctgttt 
ttaaaaaaaa 



ctgggctgga 
cagcgcgcca 
cactacgagt 
catctgatgt 
cattacatga 
gatcaacaaa 
gtgtatataa 
tgtgcatgag 
tactgtaaga 
gagtttactg 
aaaaaaa 



cgtggttttg 
cgaggatggc 
accggcatgt 
ctgaagagga 
ttcatgagcc 
aatgaagttt 
ggtagtattc 
ctgtattctt 
tgtttaagat 
aaacagttta 



tctgctgcgc 
ccacaagcag 
tatgttaccc 
gtggaggaga 
agaaccacat 
atctggggat 
agtgaat.act 
cacagcaaca 
aaaagttctt 
cttttgttca 



ccgctcttcg 
atctactact 
agagaacttt 
cttggtgtcc 
attcttctct 
cgtcaaatct 
tgagaaatgt 
gagctcagtt 
ccagtcagtt 
ataaagtttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 431 
<211> 620 
<212> PRT 

<213> Homo sapiens 



<400> 431 
Met Arg Arg 
1 

Lys Ala Leu 

Glu Asp Gin 
35 

Glu Leu Leu 
50 

Leu lie Lys 
65 

Ser Glu Leu 

Leu Gin Leu 

Gin Lys Lys 
115 

Gin Glu Leu 

130 
Ala Ser Val 
145 

Thr Thr Leu 
He Lys Arg 
Asp Thr Ser 



Ser 

Asn 

20 

Arg 

Asp 

Ser 

His 

Glu 
100 
Glu 

Cys 

Pro 

Arg 

Gin 
180 
Phe 



Glu 

5 
His 

Leu 

Met 

He 

Val 

85 

Lys 

Arg 

Glu 

Ser 

Glu 
165 
He 

Glu 



Val Leu 

Leu Arg 

Gin Arg 

Met He 

55 
Ser Val 
70 

Glu Pro 

Asp Leu 

Lys Gin 

He Leu 
135 
Leu Glu 
150 

Thr Lys 
He Leu 
Arg Asp 



Ala 

Glu 

Thr 

40 

Ala 

Cys 

Phe 

Arg 

Glu 
120 
Cys 

Glu 

Ala 

Cys 

Val 



Glu 

He 

25 

Glu 

Glu 

Gin 

Gin 

Thr 
105 
Leu 

Met 

Leu 

Ser 

Met 
185 
Val 



Glu Ser 
10 

Trp Glu 

Val Val 

Glu Glu 

Lys Glu 

75 
Glu Glu 
90 

Gin Val 

Lys Leu 

Pro His 

Asn Gin 
155 
Arg Arg 
170 

Glu Glu 



He Val 

Leu He 

Lys Lys 
45 

Ser Leu 
60 

Leu Asn 

Gly Glu 

Glu Leu 

Leu Gin 
125 
Tyr Asp 
140 

Phe Arg 



Glu Glu 
Leu Asp 
Cys Glu Asp Glu 



Cys Leu Gin 
15 

Gly He Pro 
30 

His .lie Lys 

Lys Glu Arg 

Thr Leu Cys 
80 

Thr Thr He 
95 

Met Arg Lys 
110 

Glu Gin Asp 

He Asp Ser 

Gin His Val 
160 

Phe Val Ser 
175 

His Thr Pro 
190 

Asp Ala Phe 
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195 200 205 

Cys Leu Ser Leu Glu Asn lie Ala Thr Leu Gin Lys Leu Leu Arg Gin 

210 215 220 

Leu Glu Met Gin Lys Ser Gin Asn Glu Ala Val Cys Glu Gly Leu Arg 
225 230 235 240 

Thr Gin He Arg Glu Leu Trp Asp Arg Leu Gin He Pro Glu Glu Glu 

245 250 255 

Arg Glu Ala Val Ala Thr He Met Ser Gly Ser Lys Ala Lys Val Arg 

260 265 270 

Lys Ala Leu Gin Leu Glu Val Asp Arg Leu Glu Glu Leu Lys Met Gin 

275 280 285 

Asn Met Lys Lys Val He Glu Ala He Arg Val Glu Leu Val Gin Tyr 

290 295 300 

Trp Asp Gin Cys Phe Tyr Ser Gin Glu Gin Arg Gin Ala Phe Ala Pro 
305 310 315 320 

Phe Cys Ala Glu Asp Tyr Thr Glu Ser Leu Leu Gin Leu His Asp Ala 

325 330 335 

Glu He Val Arg Leu Lys Asn Tyr Tyr Glu Val His Lys Glu Leu Phe 

340 345 350 

Glu Gly Val Gin Lys Trp Glu Glu Thr Trp Arg Leu Phe Leu Glu Phe 

355 360 365 

Glu Arg Lys Ala Ser Asp Pro Asn Arg Phe Thr Asn Arg Gly Gly Asn 

370 375 380 | 

Leu Leu Lys Glu Glu Lys Gin Arg Ala Lys Leu Gin Lys Met Leu Pro 1 
385 390 395 . 400 j 

Lys Leu Glu Glu Glu Leu Lys Ala Arg He Glu Leu Trp Glu Gin Glu j 

405 410 415 

His Ser Lys Ala Phe Met Val Asn Gly Gin Lys Phe Met Glu Tyr Val i 
420 425 430 j 

Ala Glu Gin Trp Glu Met His Arg Leu Glu Lys Glu Arg Ala Lys Gin ; 

4 35 4 40 445 ' j 

Glu Arg Gin Leu Lys Asn Lys Lys Gin Thr Glu Thr Glu Met Leu Tyr ; ' 

4 50 4 55 4 60 

Gly Ser Ala Pro Arg Thr Pro Ser Lys Arg Arg Gly Leu Ala Pro Asn • 
465 470 475 480 ] 

Thr Pro Gly Lys Ala Arg Lys Leu Asn Thr Thr Thr Met Ser Asn Ala { 

485 490 495 

Thr Ala Asn Ser Ser He Arg Pro He Phe Gly Gly Thr Val Tyr His ] 
500 505 510 " ] 

Ser Pro Val Ser Arg Leu Pro Pro Ser Gly Ser Lys Pro Val Ala Ala } 
515 520 525 \ 

Ser Thr Cys Ser Gly Lys Lys Thr Pro Arg Thr Gly Arg His Gly Ala I 
530 535 540 t 

Asn Lys Glu Asn Leu Glu Leu Asn Gly Ser He Leu Ser Gly Gly Tyr [' 
545 550 555 " 560 f 

Pro Gly Ser Ala Pro Leu Gin Arg Asn Phe Ser He Asn Ser Val Ala 1 
565 570 575 \ 

Ser Thr Tyr Ser Glu Phe Ala Lys Asp Pro Ser Leu Ser Asp Ser Ser I 
580 585 590 jj 

Thr Val Gly Leu Gin Arg Glu Leu Ser Lys Ala Ser Lys Ser Asp Ala I 
595 600 605 * J 

Thr Ser Gly lie Leu Asn Ser Thr Asn He Gin Ser | 
610 615 620 4 

? 

<210> 432 I 
<211> 3044 I 
<212> DNA | 
<213> Homo sapiens \ 

\ 

<400> 432 \ 
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S'"!f 999t ^ttgttgct ctcggggccg tgtggagtag gtctggacct ggactcacgg 60 
SSLTo c 9tccgccat gaggagaagt gaggtgctgg cggaggagtc catagtatgt 120 
' ^ 9 ccctaaatca ccttcgggaa atatgggagc taattgggat tccagaggac 180 
attaclalt aaagaaCtga ggtggtaaag aagcatatca aggaactcct ggatatgatg 240 
aaoctaaao 9 aggaaagCCt S^aggaaaga ctcatcaaaa gcatatccgt ctgtcagaaa 300 
acScttcc tT,l 9 ^ Cgagttacat gttgagccat ttcaggaaga aggagagacg 360 
S aactagaaaa agatttgcgc acccaagtgg aattgatgcg aaaacagaaa 420 
ctttalltn. aaCaggaaCt gaagctactt caagagcaag atcaagaact gtgcgaaatt 480 
^"^ atgC cccacta tga tattgacagt gcctcagtgc ccagcttaga agagctgaac 540 
" 9 " C3g9C aacat 9tgac aactttgagg gaaacaaagg cttctaggcg tgaggagttt 600 
laaTtalT, agag ! Cagat -tactgtgt atggaagaat tagaccacac ccca acaca 660 
agctttgaaa gagatgtggt gtgtgaagac gaagatgcct tttgtttgtc tttggagaat 720 
attgcaacac tacaaaagtt gctacggcag ctggaaatgc agaaatcaca aaatgaagca 780 
gtgtgtgagg ggctgcgtac tcaaatccga gagctctggg acaggttgca aatacctgaa 840 
gaagaaagag aagctgtggc caccattatg tctgggtcaa aggccaaggt ccggaaagcg 900 
ctgcaattag aagtggatcg gttggaagaa ctgaaaatgc aaaacatgaa gaaagtgatt 960 
gaggcaattc gagtggagct ggttcagtac tgggaccagt gcttttatag ccaggagcag 1020 
S " gcccct " ctgtgctgag gactacacag aaagtctgc? ccagctccac JSIS 
qtccaoaaot aaZlT" aaaaaactac tatgaagttc acaaggaact ctttgaaggt 1140 
L; ca tf 9a t g Ctggaggctt ttcttagagt ttgagagaaa agcttcagat 1200 

crccaqafaa f' 9 ag ? a ^ aaat cttctaaaag aagaaaaaca acgagccaag 1260 

ctccagaaaa tgctgcccaa gctggaagaa gagttgaagg cacgaattga attqtqoaaa 1320 
caggaacatt caaaggcatt tatggtgaat gggcagaaa? tcatggagta tgtggcagaa 138^ 
aaaaaacaaa tgCatCgatt 99-gaaagag agagccaagc aggaaagaca actgaagaaS 1440 
aaaaaacaga cagagacaga gatgctgtat ggcagcgctc ctcgaacacc tagcaagcgq 1500 
cgaggactgg ctcccaatac accgggcaaa gcacgtaagc tgaacactac caccatgtcc 1560 

gtg^c cglc TtllllZil 9 r ttCg9CCt atCtttggag ^ a cagtcta ccactccccc 1620 
gtgtctcgac ttcctccttc tggcagcaag ccagtcgctg cttccacctg ttcagggaaq 1680 

atcctaaoto 9taCtggC3g g -t99agcc aacaaggaga acctggagc? caacggS^ 7^0 
atcctgagtg gtgggtaccc tggctcggcc cccctccagc gcaacttcag cattaattct 1800 
9 " 9 " agCa cctattc tga gtttgcgaag gatccgtccc tctctgacag ttccactgS 1860 
gggcttcagc gagaactttc aaaggcttcc aaatctgatg ctacttctgg aatcctcaat 1920 

ta^ctoqlc ctaST? agaagCCCtg -tcagtcaac cagctgtggc ttcctg^gcc llll 
tagactggac ctaattatat gggggtgact ttagtttttc ttcagcttag gagtgcttga 2040 
aaccttggcc aggttccatg accatgggcc taacttaaag atgtgaatga gtgttacag? 2100 
aa^St™ ItT '"J" agtggtCCta ^-^^tgg ttttgactL Ltacatgaa 2160 
aag "^ tgg caa 9 a agtgc aaattttagc atatggggcc tgacttctct accacataat 2220 
tctacttgct gaagcatgat caaagcttgt tttatttcac cactgtagga aaatgattga 2280 
ttStttao oS g r gt aattttggCa tgtatacctg taactag'aa ttaacatm 234u 
c«a?ccccl g ^ tgttCaa "aatgctgt agctatcata gctttgctct tacctgaagc 2400 
ttoaaataoo ^ aCaCagga c agccttcct cctgaagaga atgtctttgt gtgtccgaag 2460 
aJta?at?ca T,Tt ,1 ggCaaagagg tgacaggaag gctgggagca gctttgttaa 2520 
acacaSct? lllnll a " gtgcatt 9«ctttgtt gggggtatgc atgtatgaac 2580 

! t C " ^ ^ggaacgc tttctcggcg tttgtccctt ggctctcatc tcccccattc 2640 
ctgtgcctac tttgcctgag ttcttctacc cccgcagttg ccagccagat tgggagtctg 2700 
tttgttccaa tgggttgagc tgtctttgtc gtggagatct ggaactt?gc a t g ca 2760 
Sc? SSS? 9 ** CtagCttcca -g-tgaggcg ccctctttac ctatcctctc 2820 
t ' 3 a C ! IT^T^ " Ctattatt tattcttccg ctgtctgcct gcagcagtac 2880 
acqctStta tltTJ ^ ttctcaa a agcttacca gtgtggactt ggtgttagcc 2940 

^o^" aCa9 tacgtgtcct gtttttaaaa tatacaatta ttcttaaaaa 3000 
taaattaaaa tcgtatactt acatttcaaa aaaaaaaaaa aaaa 3044 

<210> 433 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<400> 433 

Met Pro Val Ala Gly Ser Glu Leu Pro Arg Arg Pro Leu Pro Pro Ala 

, 5 10 15 

Ala Gin Glu Arg Asp Ala Glu Pro Arg Pro Pro His Gly Glu Leu Gin 
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20 





T .on 


Gly 


Gin 


He 


Gin 


His 


He 






35 










40 


Asp 


Arg 


Thr 


Gly 


Thr 


Glv 


Thr 


Leu 














55 




Tvr 


Ser 


Leu Arg Asp 


Glu 


Phe 


Pro 


O -J 










7 0 






Trn 

IT 


Lys 




vai 


Ton 

i-itr Li 


Glu 


Glu 


Leu 










ft S 








cn 

I 1 


Ala 


Lys 


blU 


Leu 


Ser 


Ser 


Lys 








J. U v 










Gly 


Ser 


Arg 


Asp 


riie 


Leu 


Asp 


Ser 






113 










120 


oiy 


Asp 


Leu 


tj-iy 


Pro 


Vai 


Tyr 


lp -L _y 




X J v 










135 




L3 J- U 


i yr 


Arg 


Asp 


vie l. 


Glu 


Ser 


Asp 


X *i Z> 










150 






-LifcS Li 




Arg 


vai 


lie 


Asp 


Thr 


Tip 
lit. 










IDj 








He 


T 1 o 


Met 


Cys 


A J. a 


Trp Asn 


Pro 








ion 

lov 










Pr a 


nu 


Cys 


HIS 


Ala 


Leu 


Cys 


















7 no 

C. U VJ 


Ser 


Cy s 


Gin 


Leu 


Tyr 


Gin 


Arg 


Ser 














215 




Phe 


As n 


He 


Ala 


Ser 


Tyr Ala 


Leu 


ooc: 
J 










230 






I ilxT 


y 


Leu 


Lys 


Pro 


Gly 


Asp 












245 








He 


Tyr 


Leu 


Asn 


His 


He 


Glu 


Pro 








260 










Pro 


Arg 


Pro 


Phe 


Pro 


Lys 


Leu 


Arg 






275 










280 


Asp 


Asp 


Phe 


Lys 


Ala 


Glu 


Asp 


Phe 




290 










295 




Pro 


Thr 


He 


Lys 


Met 


Glu 


Met 


Ala 


305 










310 







<210> 434 

<211> 1536 

<212> DNA 

<213> Homo sapiens 

<400> 434 

9999999999 ggaccacttg gcctgcctcc 
cccgccgcgc cacttcgcct gcctccgtcc 
teggagctge cgcgccggcc cttgcccccc 
ccgccgcacg gggagctgea gtacctgggg 
aggaaggacg accgcacggg caccggcacc 
agectgagag atgaattccc tetgetgaca 
gaggagttgc tgtggtttat caagggatcc 
gtgaaaatct gggatgccaa tggatcccga 
agagaagaag gggacttggg cccagtttat 
tacagagata tggaatcaga ttattcagga 
gacaccatca aaaccaaccc tgacgacaga 
gatcttcctc tgatggcgct gcctccatgc 
agtgagctgt cctgccagct gtaccagaga 
aacatcgcca gctacgccct gctcacgtac 
ggtgacttta tacacacttt gggagatgea 
aaaattcagc ttcagegaga acccagacct 
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25 










30 






Leu 


Arg 


Cys 


Gly Vai 




Lys 


Asp 










4 S 








Ser 


Vai 


Phe 




11 fc; l_ 


Gin 


Ala 


Ara 








fin 










Leu 


Leu 


Thr 


1 * Ji 


Lys 


Arg 


Vai 


Phe 






75 










80 


Leu 


Trn 


Phe 


lie 


Lys 


Gly 


Ser 


Thr 




90 










95 




Glv 


Vai 


Lys 


lie 


i rp 


Asp 


Ala 


Asn 


105 










110 






Le u 


Glv 


Phe 


Ser 


inr 


Arg 


Glu 


Glu 










X ^ O 








Phe 


Gin 


Trp 


Arg 


nlS 


Phe 


Gly 


Ala 








140 










Tvr 


Ser 


Gly Gin 


LaX y 


Vai 


Asp 


Gin 






155 










160 




Thr 


Asn 


Pro 


Asp 


fl en 


Arg 


Arg 




170 










175 




Arg 


Asp 


Leu 


Pro 


Leu 


Mot* 

J.JC L. 


Ala 


Leu 


185 










1 _7 V/ 






Phe 




Vai 


Vai 


Asn 


Ser 


Glu 


Leu 










t U J 








Gly 


Asp 


Met 


Gly 


Leu 


L3 JL _y 


Vai 


Pro 








220 










Leu 


Thr 


Tyr 


Met 


i±e 


Ala 


His 


He 






235 










24 0 


He 


His 


Thr 


Leu 


Gly 


Asp 


Ala 


His 




o ^ n 










1 c. 




Leu 


Lys 


He 


Gin 


Leu 


Gin 


Arg 


Glu 


265 










270 






lie 


Leu 


Arg 


Lys 


Vai 


Glu 


Lys 


He 










285 








Gin 


He 


Glu 


Gly 


Tyr 


Asn 


Pro 


His 








300 










Vai 

















gtcccgccgc gccacttggc ctgcctccgt 60 
cccgcccgcc gcgccatgcc tgtggccggc 120 
gccgcacagg agegggaege cgagccgcgt 180 
cagatccaac acatcctccg ctgcggcgtc 240 
ctgtcggtat teggcatgea ggcgcgctac 300 
accaaaegtg tgttctggaa gggtgttttg 360 
acaaatgeta aagagctgtc ttccaaggga 420 
gactttttgg acagcctggg attctccacc 480 
ggcttccagt ggaggcattt tggggcagaa 540 
cagggagttg accaactgca aagagtgatt 600 
agaatcatca tgtgcgcttg gaatccaaga 660 
catgccctct gecagttcta tgtggtgaac 720 
tegggagaca tgggectegg tgtgcctttc 780 
atgattgege acatcaeggg cctgaagcca 840 
catatttacc tgaatcacat cgagccactg 900 
ttcccaaagc tcaggattct tcgaaaagtt 960 
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gagaaaattg 
actattaaaa 
gtctttaggg 
aactaggtca 
ggcaaatgta 
gtatctgaca 
caaaaacatg 
aagctatttt 
ctgagggagc 
tgttttatat 



atgacttcaa 
tggaaatggc 
gttgggctgg 
aaaatctgtc 
actgtgccag 
atgctgaggt 
tatgtgcatt 
tggaatattt 
tgagtaacac 
gttgctataa 



agctgaagac 
tgtttagggt 
atgccgaggt 
cgtgacctat 
ttctttccat 
tatgaacaaa 
tcaatcccac 
ttagaatatt 
catcgatcat 
taaagaagtg 



tttcagattg 
gctttcaaag 
aaaagttctt 
cagttattaa 
aataaaaggc 
gtgaggagaa 
gtacttataa 
ttaagaattt 
gatgtagagt 
ttctgc 



aagggtacaa 
gagcttgaag 
tttgctctaa 
tttttaagga 
tttgagttaa 
tgaaatgtat 
agaaggttgg 
cacaagctat 
gtggttatga 



tccgcatcca 
gatattgtca 
aagaaaaagg 
tgttgccact 
ctcactgagg 
gtgctcttag 
tgaatttcac 
tccctcaaat 
actttatagt 



<210> 435 
<211> 225 
<212> PRT 

<213> Homo sapiens 



<400> 435 
Met Asn Ser Asn 
1 

Tyr Lys Glu Val 
20 

Val Phe Pro Asn 
35 

Gly Pro Glu Gly 
50 

Leu Leu Gly Lys 
65 

Thr Lys He Phe 

Asn Val Leu Lys 
100 

Leu Leu Thr He 
115 

Leu Asn Glu Glu 
130 

Ala Ala Arg Ala 
145 

Pro Ser Gly Arg 

Ala Ser Ser Thr 
180 

Gly Pro Met Ala 
195 

Ala Lys Lys Lys 
210 

Leu 
225 



Val Glu Asn 
5 

Thr Thr Leu 

Glu Glu Asp 

Thr Pro Tyr 

55 

Asp Phe Pro 
70 

His Pro Asn 
85 

Arg Asp Trp 

Lys Cys Leu 

Ala Gly Arg 
135 

Arg Leu Leu 

150 
Ala Glu Ala 
165 

Asp Pro Gly 

Lys Lys His 

Thr Asp Lys 
215 



Leu Pro 

Thr Ala 

25 
Leu Thr 
40 

Ala Gly 

Ala Ser 

Val Gly 

Thr Ala 
105 
Leu He 
120 

Leu Leu 

Thr Glu 

Gly Arg 

Ala Pro 
185 
Ala Gly 
200 

Lys Arg 



Pro His 
10 

Asp Pro 

Asp Leu 

Gly Leu 

Pro Pro 

75 
Ala Asn 
90 

Glu Leu 

His Pro 

Leu Glu 

He His 
155 
Ala Leu 
170 

Gly Gly 
Glu Arg 
Ala Leu 



lie He 

Pro Asp 

Gin Val 

45 
Phe Arg 
60 

Lys Gly 

Gly Glu 

Gly lie 

Asn Pro 
125 
Asn Tyr 
140 

Gly Gly 

Ala Ser 

Pro Gly 

Asp Lys 
205 
Arg Ala 
220 



Arg Leu Val 
15 

Gly lie Lys 
30 

Thr He Glu 

Met Lys Leu 

Tyr Phe Leu 
80 

lie Cys Val 
95 

Arg His Val 
110 

Glu Ser Ala 

Glu Glu Tyr 

Ala Gly Gly 
160 

Gly Thr Glu 
175 

Gly Ala Glu 
190 

Lys Leu Ala 
Leu Arg Arg 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1536 



<210> 436 

<211> 890 

<212> DNA 

<213> Homo sapiens 

<400> 436 

ggcggaccga agaacgcagg aagggggccg gggggacccg cccccggccg gccgcagcca 60 

tgaactccaa cgtggagaac ctacccccgc acatcatccg cctggtgtac aaggaggtga 120 

cgacactgac cgcagaccca cccgatggca tcaaggtctt tcccaacgag gaggacctca 180 

ccgacctcca ggtcaccatc gagggccctg aggggacccc atatgctgga ggtctgttcc 240 

gcatgaaact cctgctgggg aaggacttcc ctgcctcccc acccaagggc tacttcctga 300 

ccaagatctt ccacccgaac gtgggcgcca atggcgagat ctgcgtcaac gtgctcaaga 360 
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gggactggac 
tccaccctaa 
acgaggagta 
ccagcggcag 
accctggggc 
gcgagcgcga 
cgctgcggcg 
cccctccctc 
agggggcact 



ggctgagctg 
ccccgagtct 
tgcggctcgg 
ggccgaagcc 
cccagggggc 
taagaagctg 
gctgtagtgg 
caactctgtc 
gggacctgga 



ggcatccgac 
gcactcaacg 
gcccgtctgc 
ggtcgggccc 
ccgggagggg 
gcggccaaga 
gctctcttcc 
tctaagttat 
tttgtttttc 



acgtactgct 
aggaggcggg 
tcacagagat 
tggccagtgg 
ctgagggtcc 
aaaagacgga 
tccttccacc 
ttaaattatg 
taaataaagt 



gaccatcaag 
ccgcctgctc 
ccacgggggc 
cactgaagct 
catggccaag 
caagaagcgg 
gtgaccccaa 
gctggggtcg 
tggaaaagca 



tgcctgctga 420 
ttggagaact 480 
gccggcgggc 540 
tcctccaccg 600 
aagcatgctg 660 
gcgctgcggg 720 
cctctcctgt 780 
gggagggtac 840 
890 



<210> 437 
<211> 197 
<212> PRT 

<213> Homo sapiens 



<400> 437 
Met Gin Arg Ala 
1 

Glu Pro Pro Pro 
20 

Asp Leu Arg Ala 
35 

Gly Val Phe Lys 
50 

Pro Pro Gin lie 
65 

Ser Ala Gly Arg 

Ala Trp Arg Pro 
100 

Leu Leu Met Ser 
115 

Ser Ser Glu Phe 
130 

Gin Trp Thr Glu 
145 

Glu Met Leu Asp 

Ser Thr Gin Lys 
180 

Phe His Pro Asp 
195 



Ser Arg Leu Lys 
5 

Gly He Thr Cys 

Gin He Leu Gly 
40 

Leu Glu Val He 
55 

Arg Phe Leu Thr 
70 

He Cys Leu Asp 
85 

Ser Leu Asn He 

Glu Pro Asn Pro 
120 

Lys Tyr Asn Lys 
135 

Lys His Ala Arg 
150 

Asn Leu Pro Glu 
165 

Arg Lys Ala Ser 
Val 



Arg Glu Leu His 
10 

Trp Gin Asp Lys 
25 

Gly Ala Asn Thr 

He Pro Glu Arg 
60 

Pro He Tyr His 

75 

Val Leu Lys Leu 
90 

Ala Thr Val Leu 
105 

Asp Asp Pro Leu 

Pro Ala Phe Leu 
140 

Gin Lys Gin Lys 
155 

Ala Gly Asp Ser 
170 

Gin Leu Val Gly 
185 



Met Leu Ala Thr 
15 

Asp Gin Met Asp 
30 

Pro Tyr Glu Lys 
45 

Tyr Pro Phe Glu 

Pro Asn He Asp 
80 

Pro Pro Lys Gly 
95 

Thr Ser He Gin 
110 

Met Ala Asp He 
125 

Lys Asn Ala Arg 

Ala Asp Glu Glu 
160 

Arg Val His Asn 
175 

He Glu Lys Lys 
190 



<210> 438 

<211> 928 

<212> DNA 

<213> Homo sapiens 



<400> 438 

gcgcgcagcg 

agtgcatccc 

gggatcatgc 

cccccaggca 

ttaggtggag 

gagaggtacc 

attgattctg 

agaccatccc 

aaccctgatg 

ttcctcaaga 

gaggaagaga 



ctggtacccc 
aggcagctct 
agagagcttc 
tcacatgttg 
ccaacacacc 
catttgaacc 
ctggaaggat 
tcaacatcgc 
acccgctcat 
atgccagaca 
tgcttgataa 



gttggtccgc 
tagtgtggag 
acgtctgaag 
gcaagataaa 
ttatgagaaa 
tcctcagatc 
ttgtctggat 
aactgtgttg 
ggctgacata 
gtggacagag 
tctaccagag 



gcgttgctgc 
cagtgaactg 
agagagctgc 
gaccaaatgg 
ggtgttttta 
cgatttctca 
gttctcaaat 
acctctattc 
tcctcagaat 
aagcatgcaa 
gctggtgact 



gttgtgaggg 
tgtgtggttc 
acatgttagc 
atgacctgcg 
agctagaagt 
ctccaat tta 
tgccaccaaa 
agctgctcat 
ttaaatataa 
gacagaaaca 
ccagagtaca 



gtgtcagctc 
cttctacttg 
cacagagcca 
agctcaaata 
tatcattcct 
tcatccaaac 
aggtgcttgg 
gtcagaaccc 
taagccagcc 
aaaggctgat 
caactcaaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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cagaaaagga aggccagtca gctagtaggc atagaaaaga aatttcatcc tgatgtttag 720 

gggacttgtc ctggttcatc ttagttaatg tgttctttgc caaggtgatc ' taaqttqcct 780 

accttgaatt tttttttaaa tatatttgat gacataattt ttgtgtagtt tatttatctt 840 

gtacatatgt attttgaaat cttttaaacc tgaaaaataa atagtcattt aatgttqaaa 900 

aaaaaaaaaa aaaaaaaaaa aaaaaaaa 928 

<210> 439 
<211> 91 
<212> PRT 

<213> Homo sapiens 
<400> 439 

Met Ala Ala Asn Ala Thr Thr Asn Pro Ser Gin Leu Leu Pro Leu Glu 

1 5 10 15 

Leu Val Asp Lys Cys lie Gly Ser Arg He His He Val Met Lys Ser 

20 25 30 

Asp Lys Glu He Val Gly Thr Leu Leu Gly Phe Asp Asp Phe Val Asn 

35 40 45 

Met Val Leu Glu Asp Val Thr Glu Phe Glu He Thr Pro Glu Gly Ara 

50 55 60 

Arg He Thr Lys Leu Asp Gin He Leu Leu Asn Gly Asn Asn He Thr 

65 70 75 80 

Met Leu Val Pro Gly Gly Glu Gly Pro Glu Val 
85 90 



<210> 440 
<211> 749 
<212> DNA 
<213> Homo sapiens 



<400> 440 

atggcggcta 

tgtataggat 

ctaggatttg 

ccagaaggaa 

atgctggttc 

attttgtttt 

aaatcccata 

ttgtaagtta 

aaaataaagg 

acaaagtttg 

catttacgtg 

atttgaacag 

tgttgttttg 



acgctactac 
caagaattca 
atgactttgt 
gaaggattac 
ctggaggaga 
ggcttataat 
aagctaagtt 
atcatgatta 
tgtttttggt 
tagttatttt 
accatttgat 
gtgcatttaa 
taaaaaaaaa 



caacccgtcg 
catcgtgatg 
caatatggta 
taaattagat 
aggacctgaa 
gacaagaaaa 
tcccgttaaa 
tcctggaaaa 
taactgtcat 
gccacttact 
tctcaaacaa 
acaaccggaa 
aaaaaaaaa 



cagctgctgc 
aagagtgata 
ctggaagatg 
cagattttgc 
gtgtgaatga 
tggaattttt 
gggaagtgct 
agaagaaaag 
tttgtttatt 
tttctgtcat 
aagttgttcc 
atgatcactt 



cgttagagct 
aggaaattgt 
tcactgagtt 
taaatggaaa 
gtttccttga 
tttcccactt 
ttgaagatgt 
aacttcttct 
ctactgcagt 
tatatgctta 
aaacaaaatg 
agaaaattca 



tgtggacaaa 
tggtactctt 
tgaaatcaca 
taatataaca 
cttacactag 
tctaatgttt 
gtacccattt 
tttgcagatg 
agccagtgga 
tttgttttgt 
atgaactttg 
attaaaatgc 



<210> 441 
<211> 642 
<212> PRT 

<213> Homo sapiens 
<400> 441 

Met Lys Arg Ser Ser Val Ser Ser Gly Gly Ala Gly Arg Leu Ser Met 

: 5 io 15 

Gin Glu Leu Arg Ser Gin Asp Val Asn Lys Gin Gly Leu Tyr Thr Pro 

20 25 30 

Gin Thr Lys Glu Lys Pro Thr Phe Gly Lys Leu Ser He Asn Lys Pro 

35 40 45 

Thr Ser Glu Arg Lys Val Ser Leu Phe Gly Lys Arg Thr Ser Gly His 

50 55 60 

Gly Ser Arg Asn Ser Gin Leu Gly He Phe Ser Ser Ser Glu Lys He 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

749 
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65 

Lys Asp Pro Arg 

Arg Gin Leu Cys 
100 

Ser Met Lys Ser 
115 

Phe Thr Phe Leu 
130 

Thr Lys Phe Glu 
145 

Pro Phe Ala Leu 

Thr Trp Pro His 
180 

Lys lie His Thr 
195 

Gin Pro Trp Gly 
210 

Phe Leu Asp Tyr 
225 

Asp Ser Phe Asp 

Leu Phe Asn Val 

260 

Arg Ala Leu Asn 
275 

Glu Pro Asn Arg 
290 

Gly Asp Val Gin 
305 

Ser Ala lie Leu 

Arg Val Glu Leu 
340 

Gin Asn lie lie 
355 

lie Asn His Glu 
370 

Lys Asp Leu Glu 
385 

Tyr Ala Arg Gly 

Lys Leu Ala Arg 
420 

Lys Gly Tyr Asp 
4 35 

Cys Leu Val Lys 
450 

Leu Asn Glu Thr 
4 65 

Gly Leu Glu Asp 

Lys Arg Ser Val 
500 

Leu Tyr Gin Gin 
515 

Ala Ser Glu Leu 
530 

Thr Val Asn Gin 
545 



70 

Pro Leu Asn Asp 
85 

Glu Phe Leu Thr 

Leu Gin Ala Pro 
120 

Tyr Gly Phe Leu 
135 

Glu Glu Val Pro 
150 

Ser Lys Ser Ser 
165 

lie Val Ala Ala 

Ala Met Lys Glu 
200 

Glu Glu Thr Glu 
215 

Thr He Lys Cys 
230 

Glu Met Asn Ala 
245 

Asp Ala Phe Lys 

Glu Gin He Ala 
280 

Leu Glu Ser Leu 
295 

Lys Tyr Gin Ala 
310 

Asp Gin Lys Leu 
325 

Glu Cys Glu Thr 

Asp Asn Gin Lys 
360 

Arg Asn Glu Leu 
375 

Ala Glu Gin Gin 
390 

Lys Glu Ala He 
405 

Lys Leu Lys Leu 

Phe Glu lie Lys 
440 

Tyr Arg Ala Gin 
455 

Glu Glu Glu He 
470 

Thr Leu Glu Gin 
485 

Arg Thr Leu Lys 

Lys He Lys Glu 
520 

Glu Ser Leu Glu 
535 

Gly Leu Ser Glu 
550 



75 

Lys Ala Phe He 
90 

Glu Asn Gly Tyr 
105 

Ser Val Lys Asp 

Cys Pro Ser Tyr 
140 

Arg He Phe Lys 
155 

Met Tyr Thr Val 
170 

Leu Val Trp Leu 
185 

Ser Ser Pro Leu 

Asp Gly He Met 
220 

Tyr Glu Ser Phe 
235 

Glu Leu Gin Ser 
250 

Leu Glu Ser Leu 

265 

Arg Leu Glu Gin 

Arg Lys Leu Lys 
300 

Tyr Met Ser Asn 
315 

Asn Gly Leu Asn 
330 

He Lys Gin Glu 
345 

Tyr Ser Val Ala 

Gin Gin Thr He 
380 

Lys Leu Trp Asn 
395 

Glu Thr Gin Leu 
410 

He Pro Lys Gly 
425 

Phe Asn Pro Glu 

Val Tyr Val Pro 
460 

Asn Lys Ala Leu 
475 

Leu Asn Ala Met 
4 90 

Glu Glu Val Gin 
505 

Ala Glu Glu Glu 

Lys His Lys His 
540 

Ala Met Asn Glu 
555 



80 

Gin Gin Cys He 
95 

Ala His Asn Val 
110 

Phe .Leu Lys lie 
125 

Glu Leu Pro Asp 

Asp Leu Gly Tyr 
160 

Gly Ala Pro His 
175 

He Asp Cys lie 
190 

Phe Asp Asp Gly 
205 

His Asn Lys Leu 

Met Ser Gly Ala 
24 0 

Lys Leu Lys Asp 
255 

Glu Ala Lys Asn 

. 270 
Glu Arg Glu Lys 
285 

Ala Ser Leu Gin 

Leu Glu Ser His 
320 

Glu Glu He Ala 
335 

Asn Thr Arg Leu 
350 

Asp He Glu Arg 
365 

Asn Lys Leu Thr 

Glu Glu Leu Lys 
400 

Ala Glu Tyr His 
415 

Ala Glu Asn Ser 
430 

Ala Gly Ala Asn 
445 

Leu Lys Glu Leu 

Asn Lys Lys Met 
480 

He Thr Glu Ser 
4 95 

Lys Leu Asp Asp 
510 

Asp Glu Lys Cys 
525 

Leu Leu Glu Ser 

Leu Asp Ala Val 
560 
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Gin Arg Glu Tyr 

Lys Val Gly Asn 
580 

Val Gly Ser Val 
595 

Arg Glu Tyr Glu 
610 

Glu lie Arg Asp 
625 

Glu Glu 



Gin Leu Val Val 
565 

Asn Leu Gin Arg 

Glu Lys His Leu 
600 

Glu Cys Met Ser 
615 

Lys Tyr Glu Lys 
630 



Gin Thr Thr Thr 
570 

Leu Leu Glu Met 
585 

Glu Glu Gin He 

Glu Asp Leu Ser 
620 

Lys Ala Thr Leu 
635 



Glu Glu Arg Arg 
575 

Val Ala Thr His 
590 

Ala Lys Val Asp 
605 

Glu Asn He Lys 

He Lys Ser Ser 
640 



<210> 442 
<211> 2150 
<212> DNA 

<213> Homo sapiens 



<400> 442 
ctcgagccac 
cggtccctgg 
tttccagcgg 
aacaaggcct 
acaaaccgac 
cccggaatag 
ttaatgacaa 
atggttatgc 
tgaagatctt 
agtttgaaga 
aaagctccat 
tttggctaat 
atgatgggca 
tggactacac 
tgaatgcaga 
aatcattaga 
gagaaaaaga 
atgttcaaaa 
agaaattaaa 
aacaggagaa 
ttgagcgaat 
acctggaagc 
aagcgattga 
ctaaaggtgc 
gtgccaactg 
atgaaactga 
tagaacaatt 
aagttcaaaa 
aaaaatgtgc 
ttaaccaggg 
tagttgtgca 
tagagatggt 
aagttgatag 
ttagagataa 
atgttgatca 
aaaaaaaaaa 



gaaggccccg 
gtcgtgtcag 
tggtgctggc 
ctatacccct 
atctgaaaga 
tcaacttggt 
agcattcatt 
acataatgtg 
cacatttctt 
agaggttcca 
gtacacagtg 
agactgcatc 
gccttgggga 
cataaaatgc 
gctgcagtca 
agcaaaaaac 
accgaatcgt 
gtatcaggca 
tggtctcaat 
cactcgacta 
aaatcatgaa 
tgaacaacag 
aacacaatta 
tgagaattcc 
ccttgtcaaa 
agaagaaatt 
gaatgcaatg 
gctggatgat 
cagtgagctt 
gctcagtgaa 
aaccacgact 
tgctacacat 
agaatatgaa 
gtatgagaag 
tgtatatata 
aaaaaaaaaa 



ctgtcctgtc 
gaaactggaa 
cgcctctcca 
caaaccaaag 
aaagtctcgc 
atattttcca 
cagcagtgta 
tccatgaaat 
tatggcttcc 
agaatcttta 
ggggctcctc 
aagatacata 
gaagaaactg 
tatgagagtt 
aaactgaagg 
agagcattga 
ctagagtcgt 
tacatgagca 
gaggaaattg 
cagaatatca 
agaaatgaat 
aagttgtgga 
gcagagtatc 
aaaggttatg 
tacagggctc 
aataaagccc 
ataacagaaa 
ctttaccaac 
gagtccttgg 
gctatgaatg 
gaagaaagac 
gttgggtctg 
gaatgcatgt 
aaagctactc 
tccatagtga 
aaaaaaaaaa 



tagcagatac 
aaaaggtcat 
tgcaggagtt 
agaaaccaac 
tatttggcaa 
gttctgagaa 
ttcgacaact 
ctctacaagc 
tgtgcccctc 
aagaccttgg 
atacatggcc 
ctgccatgaa 
aagatggaat 
ttatgagtgg 
atttatttaa 
atgaacagat 
tgagaaaact 
atttggagtc 
ctagagtaga 
ttgacaacca 
tgcagcagac 
atgaggagtt 
acaaattggc 
actttgaaat 
aagtttatgt 
taaataaaaa 
gcaagagaag 
aaaaaattaa 
agaaacacaa 
aattagatgc 
gaaaagtggg 
tagagaaaca 
cagaagatct 
taattaagtc 
ataaaattgt 
aaaaaaaaaa 



ttgcacggtt 
aagcatgaag 
aagatcccag 
ctttggaaag 
aagaactagt 
aatcaaggac 
ctgtgagttt 
tccctctgtt 
atacgaactt 
gtatcctttt 
tcacattgtg 
agaaagctca 
tatgcataat 
tgccgacagc 
tgtggatgct 
tgcaagattg 
gaaggcttcc 
tcattcagcc 
actagaatgt 
gaagtactca 
tattaataaa 
aaaatatgcc 
tagaaaatta 
taagtttaat 
acctcttaag 
aatgggtttg 
tgtgagaact 
ggaagcagag 
gcacctgcta 
tgttcagcgg 
aaataacttg 
tct tgaggag 
ctcggaaaat 
ttctgaagaa 
ctcagtaaaa 
aaaaaaaaaa 



tacagaaatt 
cgcagttcag 
gatgtaaata 
ttgagtataa 
ggacatggat 
ccgagaccac 
cttacagaaa 
aaagacttcc 
cctgacacaa 
gcactatcca 
gcagccttag 
cctttatttg 
aagttgtttt 
tttgatgaga 
tttaagctgg 
gaacaagaaa 
ttacaaggag 
attcttgacc 
gaaacaataa 
gttgcagaca 
ttaaccaagg 
agaggcaaag 
aaacttattc 
cccgaggctg 
gaactcctga 
gaggatactt 
ctgaaagaag 
gaagaggatg 
gaaagtactg 
gaataccaac 
caacgtctgt 
cagattgcta 
attaaagaga 
tgaagataaa 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2150 



<210> 443 
<211> 380 
<212> PRT 

<213> Homo sapiens 
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<400> 443 



Mot- 
ive L. 


Gly 


He 


bill 


p 1 v 
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He 
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1 3 
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He 
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Phe 


Met 


Cys 


p 1 n 

Lily 


GlU 


Lys 


bin 


rne 


Ser 


fit-, 

blU 


blU 


Arg 


305 










310 










315 










320 


He 


Arg 


Ser 


Gly 


Val 


Lys 


Arg 


Leu 


Ser 


Lys 


Ser 


Arg 


Gin 


Gly 


Ser 


Thr 










325 










330 










335 




Gin Gly Arg 


Leu 


Asp Asp 


Phe 


Phe 


Lys 


Val 


Thr 


Gly 


Ser 


Leu 


Ser 


Ser 








340 










345 










350 






Ala 


Lys 


Arg 


Lys 


Glu 


Pro 


Glu 


Pro 


Lys 


Gly 


Ser 


Thr 


Lys 


Lys 


Lys 


Ala 






355 










360 










365 








Lys 


Thr 


Gly 


Ala 


Ala 


Gly 


Lys 


Phe 


Lys 


Arg 


Gly 


Lys 












370 










375 










380 











<210> 444 
<211> 2265 
<212> DNA 

<213> Homo sapiens 
<400> 444 

agtcctgega tttcgggtgt agagggagca ggggcctgcg gggacctggt gtgggtggag 60 
tggggacaag cggtggagaa gggtacgeca gggtcgctga gagactctgt tctccctgga 120 
gggactggtt gecatgagag cagccgtctg aggggacgea gcctgcacta cgcgccccaa 180 
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gaggctgtgc gtggcgagca ggtcacgtga cgggagcgcg ggctttggaa ggcggctgaa 240 
cgtcaggcca cccgccgcta agctgagaag ggagagcgag cttaggaccg cctgcccggg 300 
gcaaccccga accaagcttt agccgccgag gccgcgtgtc ccaaaggcca gtcatccctc 360 
ctctgtgttg ccatgggaat tcaaggcctg gccaaactaa ttgctgatgt ggcccccagt 420 
gccatccggg agaatgacat caagagctac tttggccgta aggtggccat tgatgcctct 480 
atgagcattt atcagttcct gattgctgtt cgccagggtg gggatgtgct gcagaatgag 540 
gagggtgaga ccaccagcca cctgatgggc atgttctacc gcaccattcg catgatggag 600 
aacggcatca agcccgtgta tgtctttgat ggcaagccgc cacagctcaa gtcaggcgag 660 
ctggccaaac gcagtgagcg gcgggctgag gcagagaagc agctgcagca ggctcaggct 720 
gctggggccg agcaggaggt ggaaaaattc actaagcggc tggtgaaggt cactaagcag 780 
cacaatgatg agtgcaaaca tctgctgagc ctcatgggca tcccttatct tgatgcaccc 840 
agtgaggcag aggccagctg tgctgccctg gtgaaggctg gcaaagtcta tgctgcggct 900 
accgaggaca tggactgcct caccttcggc agccctgtgc taatgcgaca cctgactgcc 960 
agtgaagcca aaaagctgcc aatccaggaa ttccacctga gccggattct gcaggagctg 1020 
ggcctgaacc aggaacagtt tgtggatctg tgcatcctgc taggcagtga ctactgtgag 1080 
agtatccggg gtattgggcc caagcgggct gtggacctca tccagaagca caagagcatc 1140 
gaggagatcg tgcggcgact tgaccccaac aagtaccctg tgccagaaaa ttggctccac 1200 
aaggaggctc accagctctt cttggaacct gaggtgctgg acccagagtc tgtggagctg 1260 
aagtggagcg agccaaatga agaagagctg atcaagttca tgtgtggtga aaagcagttc 1320 
tctgaggagc gaatccgcag tggggtcaag aggctgagta agagccgcca aggcagcacc 1380 
cagggccgcc tggatgattt cttcaaggtg accggctcac tctcttcagc taagcgcaag 14 40 
gagccagaac ccaagggatc cactaagaag aaggcaaaga ctggggcagc agggaagttt 1500 
aaaaggggaa aataaatgtg tttccccatt atacctcctt caccccagaa tatttgccgt 1560 
cttgtaccct taagagctac agctagagaa accttcacgg ggtggagaga ggattctaag 1620 
gcttttctag cgtgaccctt ttcagtagtg ctagtccctt ttttacttga tcttaatggc 1680 
aagaaggcca cagaggtact tttccttttt tagctcagga aaatatgtca ggctcaaacc 1740 
acttctcagg cagtttaatg gacactaagt ccattgttac atgaaagtga tagatagcaa 1800 
caagttttgg agaagagaga gggagataaa agggggagac aaaagatgta cagaaatgat 1860 
ttcctggctg gccaactggt ggccagtggg aggtgatggt ggacctagac tgtgcttttc 1920 
tgtcttgttc agccttgacc caccttgaga gagagccacc aggaaggcgc atcttagcag 1980 
atgggaggaa ctgctgagag aagatgggca gaaagctgga gcccctggag ttggctgtgt 2040 
ctgtgtttgt gactgattac tggctgtgtc ttgggtgggc agaaactcga acttgctatg 2100 
taatttgtgt ctagttattc agaggagtaa gatggtgatg ttcacctggc aatcagctga 2160 
gttgagactt tggaataaga cactggtttt catgcgctgt ttttgtttta aagttatgaa 2220 
gaaaaaagtc aataaaattc taaaagtaaa aaaaaaaaaa aaaaa 2265 

<210> 445 
<211> 277 
<212> PRT 
<213> Homo sapiens 



<400> 445 



Met 


Glu 


Ala 


Ala 


Glu 


Thr 


Glu 


Ala 


Glu 


Ala 


Ala 


Ala 


Leu 


Glu 


Val 


Leu 


1 


Glu 






5 










10 










15 




Ala 


Val 


Ala 


Gly 


lie 


Leu 


Glu 


Pro 


Val 


Gly 


Leu 


Gin 


Glu 


Glu 


Ala 


Glu 






20 










25 










30 






Leu 


Pro 


Ala 


Lys 


lie 


Leu 


Val 


Glu 


Phe 


Val 


Val 


Asp 


Ser 


Gin 


Lys 






35 










40 










45 






Lys 


Asp 


Lys 


Leu 


Leu 


Cys 


Ser 


Gin 


Leu 


Gin 


Val 


Ala 


Asp 


Phe 


Leu 


Gin 




50 










55 










60 








Asn 


lie 


Leu 


Ala 


Gin 


Glu 


Asp 


Thr 


Ala 


Lys 


Gly Leu 


Asp 


Pro 


Leu 


Ala 


65 


Glu 








70 










75 








80 


Ser 


Asp 


Thr 


Ser 


Arg 


Gin 


Lys 


Ala 


He 


Ala 


Ala 


Lys 


Glu 


Gin 


Trp 


Lys 


Glu 






85 










90 










95 


Leu 


Lys 


Ala 


Thr 


Tyr 


Arg 


Glu 


His 


Val 


Glu 


Ala 


He 


Lys 


He 


Gly 






100 










105 










110 




Leu 


Thr 


Lys 


Ala 


Leu 


Thr 


Gin 


Met 


Glu 


Glu 


Ala 


Gin 


Arg 


Lys 


Arg 


Thr 




115 










120 










125 


Gin 


Leu 


Arg 


Glu 


Ala 


Phe 


Glu 


Gin 


Leu 


Gin 


Ala 


Lys 


Lys 


Gin 


Met 


Ala 


130 










135 










140 






Met 


Glu 


Lys 


Arg 


Arg 


Ala 


Val 


Gin 


Asn 


Gin 


Trp 


Gin 


Leu 


Gin 


Gin 
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145 

Glu Lys His Leu 

Arg Lys Thr Gly 
180 

Gly Asn Leu Lys 
195 

Tyr Gin Thr Phe 
210 

Phe Pro Glu Ala 
225 

Gin Gin Pro Thr 

Arg Asp Pro Gly 
260 

Asp Val Asn Leu 
275 



150 

Gin His Leu Ala 
165 

Thr Gin Gin Glu 

Gin Gin Ala Glu 
200 

Leu Gin Leu Leu 
215 

Glu Ala Glu Ala 

230 

Arg Pro Gin Glu 
245 

Val- Ser Phe Lys 
Pro 



155 

Glu Val Ser Ala 
170 

Leu Asp Arg Val 
185 

Gin Glu Arg Asp 

Tyr Thr Leu Gin 
220 

Glu Asn Leu Pro 
235 

Gin Ser Thr Gly 
250 

Ala Val Gly Leu 
265 









160 


Glu 


Val 


Arg 


Glu 






175 




Phe 


Gin 


Lys 


Leu 




190 






Lys 


Leu 


Gin 


Arg 


205 








m w 

y 


Lys 


Jj8U 


JLe u 


Asp 


Asp 


Lys 


Pro 








240 


Asp 


Thr 


Met 


Gly 






255 




Gin 


Pro 


Ala 


Gly 




270 







<210> 446 ' -"" J' ■ 

<211> 1658 .." J " V: '^" 

<212> DNA 

<213> Homo sapiens 



<400> 446 

ggcacgagct 

ctagaggtcc 

gaactgccag 

ctctgcagcc 

gctaagggtc 

aaggaacaat 

ggcctcacca 

gaagcctttg 

cagaaccagt 

gaggtgaggg 

ggaaacctga 

ctccagct tc 

gagaatcttc 

gacaccatgg 

gatgtaaatt 

cctctgactt 

agcttgattt 

ctctctcagt 

tactcagaag 

ggggaggggg 

aggaggagaa 

aaatgtttgt 

gtgagctcat 

tgtgccttat 

ttcaaagatt 

ctagtgatac 

tgtcaactct 

caaataaata 



cggctcctgg 
tggctgaggt 
ccaagatcct 
agcttcaggt 
tcgacccctt 
ggaaagagct 
aggccctgac 
agcagctcca 
ggcagctaca 
agcgtaagac 
agcagcaggc 
tgtataccct 
cagatgataa 
ggagagaccc 
tgccatgact 
ccctcacctc 
ctgactacat 
tgtgtatttg 
ccaataacat 
tactggcctt 
tgttttaggc 
tgagtccttt 
ctgactgttt 
ctaaaacatt 
cagagattgg 
cttgatcttt 
tacttttatc 
tttaaggtta 



aaagatggag 
ggcaggcatc 
ggttgagttt 
agcggatttc 
ggcttctgaa 
gaaggccacc 
tcagatggag 
ggccaagaaa 
acaggagaag 
agggactcag 
agaacaggag 
gcagggtaag 
accccagcag 
tggtgtgtcc 
tcctggagga 
ccaaccatca 
cccagcaagc 
ttcatcttca 
gacaggagct 
gggcctccta 
aggtggttat 
ctttgtgcta 
taggatgtac 
tctcgatcac 
cttttgtcat 
cccactttct 
agatgatcaa 
aaaaaaaaaa 



gcagcggaga 
ttggaacctg 
gtggtggact 
ctgcagaaca 
gacacgagcc 
tacagggagc 
gaagcccaga 
caaatggcca 
catctgcagc 
caggagcttg 
cgggacaagc 
ctgttgttcc 
ccgactcgac 
ttcaaggctg 
cagcagcatg 
ttacaggaaa 
tctggcatct 
tatgctggca 
gggactggtt 
tgatgcagac 
atgtgggaag 
aggttcttgc 
agcctagtgt 
tggtttcaga 
ccactattgt 
gttttcggat 
ctcacgtatt 
aaaaaaaa 



cagaggcgga 
taggcctgca 
ctcagaagaa 
tcctggctca 
gacagaaggc 
acgtagaggc 
ggaaacggac 
tggagaaacg 
atctggcgga 
acagggtgtt 
tgcagaggta 
ctgaggctga 
cccaggagca 
ttggtctaca 
gagaaagatc 
gactgtgaac 
gtggattaaa 
ggaacaacta 
tgaacacagg 
atggtgaatt 
ataattttat 
ggtgaaccag 
taacattctt 
tgttcattta 
atgttttgtt 
tggagaagat 
tggatcttta 



agctgcagcc 
ggaggaggca 
agacaagctg 
ggaggacact 
aattgcagct 
catcaaaatt 
acaactccgg 
cagagcagtc 
ggtttctgca 
tcagaaactt 
tcagaccttc 
ggctgaggca 
gagtacagga 
acctgctgga 
ctagaaaagg 
tcctgagttc 
atccctggat 
ttaatacaga 
gtgtgcagat 
taattcaagg 
tcatggatcc 
aattataaca 
ggtatctttt 
ttatattctt 
tcattgacct 
gtaccttttt 
tttgttttct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1658 



<210> 447 
<211> 277 
<212> PRT 

<213> Homo sapiens 
<400> 447 

Met Glu Ala Ala Glu Thr Glu Ala Glu Ala Ala Ala Leu Glu Val Leu 
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15 10 15 

Ala Glu Val Ala Gly He Leu Glu Pro Val Gly Leu Gin Glu Glu Ala 

20 25 30 

Glu Leu Pro Ala Lys He Leu Val Glu Phe Val Val Asp Ser Gin Lys 

35 40 45 

Lys Asp Lys Leu Leu Cys Ser Gin Leu Gin Val Ala Asp Phe Leu Gin 

50 55 60 

Asn He Leu Ala Gin Glu Asp Thr Ala Lys Gly Leu Asp Pro Leu Ala 
65 7 ° 75 * so 

Ser Glu Asp Thr Ser Arg Gin Lys Ala He Ala Ala Lys Glu Gin Trp 

85 90 " 95 

Lys Glu Leu Lys Ala Thr Tyr Arg Glu His Val Glu Ala He Lys He 

100 105 no 

Gly Leu Thr Lys Ala Leu Thr Gin Met Glu Glu Ala Gin Arg Lys Arg 

115 120 125 

Thr Gin Leu Arg Glu Ala Phe Glu Gin Leu Gin Ala Lys Lys Gin Met 

13 ° 135 140 

Ala Met Glu Lys Arg Arg Ala Val Gin Asn Gin Trp Gin Leu Gin Gin 
l? 5 150 155 * 160 

Glu Lys His Leu Gin His Leu Ala Glu Val Ser Ala Glu Val Arg Glu 

165 170 175 

Arg Lys Thr Gly Thr Gin Gin Glu Leu Asp Arg Val Phe Gin Lys Leu 

180 185 190 

Gly Asn Leu Lys Gin Gin Ala Glu Gin Glu Arg Asp Lys Leu Gin Arg 

195 200 205 

Tyr Gin Thr Phe Leu Gin Leu Leu Tyr Thr Leu Gin Gly Lys Leu Leu 

210 215 220 

Phe Pro Glu Ala Glu Ala Glu Ala Glu Asn Leu Pro Asp Asp Lys Pro 
.225 230 235 " 240 

Gin Gin Pro Thr Arg Pro Gin Glu Gin Ser Thr Gly Asp Thr Met Gly 

245 250 255 

Arg Asp Pro Gly Val Ser Phe Lys Ala Val Gly Leu Gin Pro Ala Gly 

260 265 270 

Asp Val Asn Leu Pro 
275 



<210> 448 

<211> 1851 

<212> DNA 

<213> Homo sapiens 



<400> 448 

ggcacgagct 

ctagaggtcc 

gaactgccag 

ctctgcagcc 

gctaagggtc 

aaggaacaat 

ggcctcacca 

gaagcctttg 

cagaaccagt 

gaggtgaggg 

ggaaacctga 

ctccagcttc 

gagaatcttc 

gacaccatgg 

gatgtaaatt 

tcagacccaa 

gtttacctct 

gaaagagctt 



cggctcctgg 

tggctgaggt 

ccaagatcct 

agcttcaggt 

tcgacccctt 

ggaaagagct 

aggccctgac 

agcagctcca 

ggcagctaca 

agcgtaagac 

agcagcaggc 

tgtataccct 

cagatgataa 

ggagagaccc 

tgccatgact 

ctcaggcctt 

ctctgcatct 

atgttacacc 



aaagatggag 

ggcaggcatc 

ggttgagttt 

agcggatttc 

ggcttctgaa 

gaaggccacc 

tcagatggag 

ggccaagaaa 

acaggagaag 

agggactcag 

agaacaggag 

gcagggtaag 

accccagcag 

tggtgtgtcc 

tcctggagga 

ggtgtccctg 

gagctctgct 

tcattcccat 



gcagcggaga 
ttggaacctg 
gtggtggact 
ctgcagaaca 
gacacgagcc 
tacagggagc 
gaagcccaga 
caaatggcca 
catctgcagc 
caggagcttg 
cgggacaagc 
ctgttgttcc 
ccgactcgac 
ttcaaggctg 
cagcagcatg 
gactgcaagt 
acccatggag 
gcttagccca 



cagaggcgga 
taggcctgca 
ctcagaagaa 
tcctggctca 
gacagaaggc 
acgtagaggc 
ggaaacggac 
tggagaaacg 
atctggcgga 
acagggtgtt 
tgcagaggta 
ctgaggctga 
cccaggagca 
ttggtctaca 
gagaaagatc 
gtggaaggag 
cagatggatg 
cccagagcta 



agctgcagcc 
ggaggaggca 
agacaagctg 
ggaggacact 
aattgcagct 
catcaaaatt 
acaactccgg 
cagagcagtc 
ggtttctgca 
tcagaaactt 
tcagaccttc 
ggctgaggca 
gagtacagga 
acctgctgga 
ctagaaaagg 
ggaaagcctg 
gtgggaacag 
acccctgtct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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tcttccccag 
ctcctgagtt 
aatccctgga 
attaatacag 
ggtgtgcaga 
ttaattcaag 
ttcatggatc 
ttataacagt 
tatctttttg 
atattctttt 
attgacctct 
accttttttg 
tgttttctca 



gcctctgact 
cagcttgatt 
tctctctcag 
atactcagaa 
tggggagggg 
gaggaggaga 
caaatgtttg 
gagctcatct 
tgccttatct 
caaagattca 
agtgatacct 
tcaactctta 
aataaatatt 



tccctcacct 
tctgactaca 
ttgtgtattt 
gccaataaca 
gtactggcct 
atgttttagg 
ttgagtcctt 
gactgtttta 
aaaacatttc 
gagattggct 
tgatctttcc 
cttttatcag 
taaggttaaa 



cccaaccatc 
tcccagcaag 
gttcatcttc 
tgacaggagc 
tgggcctcct 
caggtggtta 
ttgtgctaag 
ggatgtacag 
tcgatcactg 
tttgtcatcc 
cactt tctgt 
atgatcaact 
aaaaaaaaaa 



attacaggaa 
ctctggcatc 
atatgctggc 
tgggactggt 
atgatgcaga 
tatgtgggaa 
gttcttgcgg 
cctagtgtta 
gtttcagatg 
actattgtat 
tttcggattg 
cacgtatttg 
aaaaaaaaaa 



agactgtgaa 
tgtggattaa 
aggaacaact 
ttgaacacag 
catggtgaat 
gataatttta 
tgaaccagaa 
acat tcttgg 
ttcatttatt 
gttttgtttc 
gagaagatgt 
gatctttatt 
a 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1851 



<210> 449 
<211> 211 
<212> PRT 

<213> Homo sapiens 
<400> 449 

Met Ala Ala Arg Arg Gly Ala Leu He Val Leu Glu Gly Val Asp Arg 

1 5 10 15 

Ala Gly Lys Ser Thr Gin Ser Arg Lys Leu Val Glu Ala Leu Ser Arg 

20 25 30 

Gly Pro Pro Pro Glu Leu Leu Arg Phe Pro Glu Arg Ser Thr Glu He 

35 40 45 

Gly Lys Leu Leu Ser Ser Tyr Leu Gin Lys Lys Ser Asp Val Glu Asp 

50 55 60 

His Ser Val His Leu Leu Phe Ser Ala Asn Arg Trp Glu Gin Val Pro 
65 70 75 80 

Leu He Lys Glu Lys Leu Ser Gin Gly Val Thr Leu Val Val Asp Arg 

85 " 90 95 

Tyr Ala Phe Ser Gly Val Ala Phe Thr Gly Ala Lys Glu Asn Phe Ser 

100 105 110 

Leu Asp Trp Cys Lys Gin Pro Asp Val Gly Leu Pro Lys Pro Asp Leu 

115 120 125 

Val Leu Phe Leu Gin Leu Gin Leu Ala Asp Ala Ala Lys Arg Gly Ala 

130 135 140 

Phe Gly His Glu Arg Tyr Glu Asn Gly Ala Phe Gin Glu Arg Ala Leu 
145 150 155 160 

Arg Cys Phe His Gin Leu Met Lys Asp Thr Thr Leu Asn Trp Lys Met 

165 170 175 

Val Asp Ala Ser Lys Arg Leu Glu Ala Val His Glu Glu Leu Arg Val 

180 185 190 

Leu Ser Glu Asp Ala He Arg Thr Ala Thr Glu Lys Pro Leu Gly Glu 

195 200 205 

Leu Trp Lys 
210 



<210> 450 
<211> 1000 
<212> DNA 

<213> Homo sapiens 



<400> 450 

aattcgccag 

gagggcgtgg 

cgcgggccac 

ctgagttcct 

tctgcaaatc 



cggcgcggtg 
accgcgccgg 
cgcccgaact 
acttgcaaaa 
gctgggaaca 



gacagtcatg gcggcccggc gcggggctct catagtgctg 60 

gaagagcacg cagagccgca agctggtgga agcgctgtcg 120 

gctccggttc ccggaaagat caactgaaat cggcaaactt 180 

gaaaagtgac gtggaggatc actcggtgca cctgcttttt 240 

agtgccgtta attaaggaaa agttgagcca gggcgtgacc 300 
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ctcgtcgtgg 
tccctagact 
ctccagttac 
aacggggctt 
ttgaactgga 
gtgctctctg 
tgacccaagg 
cgcggaaggc 
cagcagagat 
tggtggggac 
ccctgtcctg 
gtttcattgt 



acagatacgc 

ggtgtaaaca 

agctggcgga 

tccaggagcg 

agatggtgga 

aggacgccat 

ctgcccactg 

cccgtcccca 

ctgcaggctg 

cttggcagtg 

atggcactct 

gatcactgta 



attttctggt 

gccagacgtg 

tgctgccaag 

ggcgctccgg 

tgcttccaaa 

ccgcactgcc 

gagacgcctc 

gcggagtcca 

cctcttctgc 

tggacatgag 

gttgtatttt 

aatggtaatc 



gtggcct tea 
ggccttccca 
eggggagegt 
tgtttccacc 
agactcgaag 
acagagaagc 
tccctgcagt 
gaccccacaa 
cccggagctg 
cagagegatg 
cttactgaag 
agttggaatt 



ccggtgccaa 
aacccgacct 
ttggccatga 
agctcatgaa 
ctgtccatga 
cgctggggga 
cccccgagag 
cttcaggagc 
gggtgcactg 
gagcagtctc 
ttcagtgata 



ggagaatttt 
ggtcctgttc 
gegctatgag 
agacacgact 
ggaactccgc 
gctatggaag 
gtgggagact 
tctttcccgg 
gggacccccg 
ctgccctctc 
actctgagca 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1000 



<210> 451 
<211> 282 
<212> PRT 
<213> Homo sapiens 



<400> 451 
Met Pro Leu 
1 

Leu Val Ala 

Ala lie His 

35 

lie Lys Val 
50 

He Gin Ala 
65 

Thr Phe Arg 

Gly Ser Ser 

Gin Gly Tyr 
115 

Val Thr Val 

130 
Phe Asp Phe 
145 

Glu Gly Leu 
Leu Gin He 



Phe Gly Asn 
195 

Leu Met Glu 

210 
He Ser Leu 
225 

Val Ser He 

He Arg Asn 

Pro Asp Glu 
275 



Leu Thr Gin 

5 

Ser Leu Asp 
20 

Phe Arg Glu 

Thr Val Glu 

Gly lie Phe 
70 

He Asn Leu 
85 

Pro Met Pro 
100 

Gly Tyr Pro 

Cys Lys He 

Cys Ser Thr 
150 

Arg Glu Ala 

165 
Thr Met Ser 
180 

Ala Gly Ser 



Gin He 

Asn Val 

His Ala 
40 

Asn Ala 
55 

Gin Glu 

Thr Val 

Gly Thr 

Leu Met 
120 
Asn Thr 
135 

Asn Val 
Phe Ser 
Pro Asp 



Gin Asp Glu Asp Asp Gin 
10 

Arg Asn Leu Ser 
25 

Thr Cys Phe Ala 



Ala Phe His 

Leu Lys Pro 
230 

Arg Thr Asp 

245 
Glu Asp Gly 
260 

Glu Val Pro 



Ser His 
200 
Cys Asn 
215 

Ser Thr 

Asn Arg 

Gin lie 

Glu Ser 
280 



Lys Cys Val Gin 
60 

Phe Lys Val Gin 
75 

Leu Leu Asp Cys 
90 

Leu Thr Ala Leu 
105 

Leu Phe Leu Glu 

Gin Glu Pro Glu 
140 

lie Asn Lys He 
155 

Glu Leu Asp Met 
170 

Lys Pro Tyr Phe 
185 

Leu Asp Tyr Pro 

Gin Thr Gin Val 
220 

Lys Ala Leu Val 
235 

Gly Phe Leu Ser 
250 

Cys Phe Val Glu 
265 
Glu Ser 



Thr He 
30 

Thr Lys 

45 , 

Ala Asn 



Glu Glu 
Leu Ser 



Arg Met 
110 
Glu Gly 
125 

Glu Thr 

lie Leu 

Thr Ser 

Arg Leu 
190 
Lys Asp 
205 

Asn Arg 

Leu Ser 

Leu Gin 

Tyr Tyr 
270 



Tyr Ser 
15 

Leu Lys 

Asn Gly 

Ala Phe 

Ser Val 

80 
lie Phe 
95 

Cys Tyr 

Gly Val 

Leu Asp 

Gin Ser 
160 
Glu Val 
175 

Ser Thr 

Ser Asp 

Tyr Lys 

Cys Lys 
240 
Tyr Met 
255 

Cys Cys 



<210> 452 

<211> 1776 

<212> DNA 

<213> Homo sapiens 
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<400> 452 

ctctatggtt 

ctccctccgg 

tgacggtggc 

gcggttcctc 

gt ccagaccc 

atccctcggg 

aggatgatca 

tgaaagctat 

taacagtgga 

aggagtttaa 

actgtttatc 

gt taccaagg 

tctgcaaaat 

atgttattaa 

atatgacgag 

ctacttttgg 

aagcatttca 

ctacaaaggc 

tttcattaca 

gctgccctga 

atgtgtacat 

tattggattt 

taatagttac 

gtagtcatag 

tgtgagtcag 

agcactttgg 

accaacgtgg 

acatgcctgt 

ggcggaggtt 

aaactctgcc 



cggcggctct 
tctcggtggc 
cgaggtggag 
ggagccgcct 
caccgcaaag 
gccgaatgcg 
gtacagcctt 
tcatttccga 
aaatgcaaag 
agttcaggaa 
tatttttgga 
ttatggttac 
caatacacag 
taaaattatt 
tgaagtccta 
aaatgcagga 
ttgtaatcag 
attagtccta 
gtatatgatt 
tgaagaagtt 
ttatgataga 
tctatagaga 
tatgttttgt 
attattcagg 
cgtaacattt 
gaacccgagg 
agaaacccca 
aatgccagct 
gtggtgagct 
tcaaaaaaaa 



aagcgctcag 
gcggcacgcg 
ggccggtctg 
cgctcctctt 
gtgtttggcg 
cagtggacga 
gtggccagcc 
gaacatgcca 
tgtgtgcaag 
gagtctgtta 
tcaagtccta 
cctttgatgc 
gaacctgagg 
ctgcagtcag 
caaattacca 
agttcccacc 
acccaagtca 
tcttgtaagg 
agaaatgaag 
cctgaatctg 
tgaagttctt 
agaagcacaa 
aaatccattt 
actgtccttt 
caagagtcta 
tgggcggatc 
tctctactaa 
actcgggagg 
gaaatgacgc 
aaaaaaaaaa 



ctccgcggtc 
cggctctcta 
aagagtggcg 
cagggacttt 
atccgccgag 
tgccccttct 
ttgacaacgt 
cgtgtttcgc 
caaatgcttt 
cttttcgaat 
tgccagggac 
tgttcctgga 
agaccctgga 
aggggctccg 
tgtctcctga 
ttgactatcc 
acagatacaa 
tatctattcg 
atggacaaat 
agtcttgagt 
attctgagta 

tggggaagat 
tgtagagggc 
agttctgtct 
aaggtggccg 
acctgaggtc 
aaatacaaaa 
ctgaggcagg 
cattgcactc 
aaaaaa 



gtcctccagg 
ggcctccttc 
ggactggctt 
gctgagaagg 
aagttgttgg 
gacccaacag 
taggaatctc 
aactaaaaat 
tattcaggct 
taatttaact 
tttaactgca 
agaaggagga 
ctttgatttc 
tgaagcattt 
caagccttat 
caaaga ttct 
gatttcctta 
gacagataac 
atgttttgtg 
atgacaattc 
cagtactctt 
aggagcaagg 
atgtaaataa 
tttgaactca 
ggtgtggtgg 
aggagttcga 
aattagccgg 
aaaattgctt 
cagcctaggc 



tctgtgccgc 
agctctgtgg 
cact tcctcc 
gctctcgggc 
ccccaggagc 
at ccaagacg 
tccactatct 
ggtat caaag 
ggaatatttc 
gtccttttag 
cttcgaatgt 
gtggtgacag 
tgcagcacca 
tctgaattgg 
ttcaggttat 
gatttgatgg 
ctgaaaccct 
agaggcttcc 
gaatattact 
actgatattt 
tgtcatttca 
tcatgtaccc 
atgttttcct 
tgggaataat 
ctttaatccc 
aaccagcctg 
acgtggtggc 
gaacccggga 
aataagagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1776 



<210> 453 

<211> 838 

<212> PRT 

<213> Homo sapiens 



<400> 453 

Met Ser Leu Gin Val 

1 5 
Thr Glu Asn Cys Asp 
20 

Ser Ser Val Leu Arg 
35 

Leu Ala Lys Ala Met 
50 

Gin Thr His Arg lie 
65 

Pro Phe Thr Gin Asp 
85 

Trp Thr Gin Lys Glu 
100 

Thr Thr His Gly He 
115 

Leu Gly Asp Ala Ser 
130 

Gly Pro Gly Ala Leu 
145 

Pro Gly Ser Ser Glu 
165 



Leu Asn Asp Lys 
Phe Leu 
Val Ser 



Lys Val 

55 
Leu Ser 
70 

Asp Thr 

Asn Gin 

Leu Gin 

Pro Ala 
135 
Ala Asp 
150 

Asn Gin 



Phe Ser 

25 
Gin Lys 
40 

Thr Phe 



Pro Ser 

Leu Gly 

Gin Leu 
105 
Lys Pro 
120 

Phe Gly 
Leu Asp 
Met Val 



Asn Val 
10 

Pro Pro 

Glu Asn 

Gin Thr 

Met Ala 

75 
Leu Glu 
90 

He Lys 

Val Glu 

Ser Gly 

Cys Ser 
155 
Ser Pro 
170 



Ser Asn Glu 

Glu Val Thr 
30 

Val Pro Pro 
45 

Pro Leu Arg 
60 

Ser Lys Leu 

Asn Ser His 

Glu Val Asp 
110 

Ala Asp Thr 

125 
Ser Ser Ser 
140 

Ser Ser Ser 
Gly Lys Val 



Lys 

15 

Gly 

Lys 

Asp 

Glu 

Pro 

95 

Ala 

Asp 

Glu 

Gin 

Ser 
175 



Asn 

Arg 

Asn 

Pro 

Ala 

80 

Val 

Lys 

Leu 

Ser 

Ser 
160 
Gly 
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Ser Pro Glu Gin Ala Val Glu Glu Asn Leu Ser Ser Tyr Ser Leu Asp 

180 185 190 

Arg Arg Val Thr Pro Ala Ser Glu Thr Leu Glu Asp Pro Cvs Arq Thr 

195 200 205 

Glu Ser Gin His Lys Ala Glu Thr Pro His Gly Ala Glu Glu Glu Cys 

210 215 220 

Lys Ala Glu Thr Pro His Gly Ala Glu Glu Glu Cys Arq His Gly Glv 
225 230 235 240 

Val Cys Ala Pro Ala Ala Val Ala Thr Ser Pro Pro Gly Ala He Pro 

245 250 " 255 

Lys Glu Ala Cys Gly Gly Ala Pro Leu Gin Gly Leu Pro Gly Glu Ala 

260 265 270 

Leu Gly Cys Pro Ala Gly Val Gly Thr Pro Val Pro Ala Asp Gly Thr 

275 280 285 

Gin Thr Leu Thr Cys Ala His Thr Ser Ala Pro Glu Ser Thr Ala Pro 

290 295 300 

Thr Asn His Leu Val Ala Gly Arg Ala Met Thr Leu Ser Pro Gin Glu 
305 310 315 320 

Glu Val Ala Ala Gly Gin Met Ala Ser Ser Ser Arg Ser Gly Pro Val 

325 330 335 

Lys Leu Glu Phe Asp Val Ser Asp Gly Ala Thr Ser Lys Arg Ala Pro 

340 345 350 

Pro Pro Arg Arg Leu Gly Glu Arg Ser Gly Leu Lys Pro Pro Leu Arq 
355 360 365 

Lys Ala Ala Val Arg Gin Gin Lys Ala Pro Gin Glu Val. Glu Glu Asd 
370 375 3S0 

Asp Gly Arg Ser Gly Ala Gly Glu Asp Pro Pro Met Pro Ala Ser Arq 
385 390 395 400 

Gly Ser Tyr His Leu Asp Trp Asp Lys Met Asp Asp Pro Asn Phe He 

405 410 415 

Pro Phe Gly Gly Asp Thr Lys Ser Gly Cys Ser Glu Ala Gin Pro Pro 

420 425 430 

Glu Ser Pro Glu Thr Arg Leu Gly Gin Pro Ala Ala Glu Gin Leu His 

435 440 445 

Ala Gly Pro Ala Thr Glu Glu Pro Gly Pro Cys Leu Ser Gin Gin Leu 

450 455 4 6 o 

His Ser Ala Ser Ala Glu Asp Thr Pro Val Val Gin Leu Ala Ala Glu 
465 470 475 480 

Thr Pro Thr Ala Glu Ser Lys Glu Arg Ala Leu Asn Ser Ala Ser Thr 

485 490 495 

Ser Leu Pro Thr Ser Cys Pro Gly Ser Glu Pro Val Pro Thr His Gin 

500 505 510 

Gin Gly Gin Pro Ala Leu Glu Leu Lys Glu Glu Ser Phe Arg Asp Pro 

515 520 525 

Ala Glu Val Leu Gly Thr Gly Ala Glu Val Asp Tyr Leu Glu Gin Phe 

530 535 540 

Gly Thr Ser Ser Phe Lys Glu Ser Ala Leu Arg Lys Gin Ser Leu Tyr 
545 550 555 560 

Leu Lys Phe Asp Pro Leu Leu Arg Asp Ser Pro Gly Arg Pro Val Pro 

565 570 "* 575 

Val Ala Thr Glu Thr Ser Ser Met His Gly Ala Asn Glu Thr Pro Ser 

580 585 590 

Gly Arg Pro Arg Glu Ala Lys Leu Val Glu Phe Asp Phe Leu Gly Ala 

595 600 605 

Leu Asp He Pro Val Pro Gly Pro Pro Pro Gly Val Pro Ala Pro Glv 

610 615 620 

Gly Pro Pro Leu Ser Thr Gly Pro lie Val Asp Leu Leu Gin Tyr Ser 
" 5 _ 630 635 640 

Gin Lys Asp Leu Asp Ala Val Val Lys Ala Thr Gin Glu Glu Asn Arg 

64 5 650 655 

Glu Leu Arg Ser Arg Cys Glu Glu Leu His Gly Lys Asn Leu Glu Leu 
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660 



Gly 


Lys 


He 


Met 






675 




Glu 


Val 


Gin 


Lys 




690 






Leu 


Lys 


Glu 


Lys 


705 








Ser 


Phe 


Ser 


Asp 


Glu 


Gly 


Tyr 


Arg 








740 


Tyr 


Leu 


Ala 


Arq 






755 




Ala 


His 


Ala 


Glu 




770 






Val 


Arg 


Ser 


Lys 


785 








Arg 


Lys 


Glu 


Gin 


Lys 


Thr 


Lys 


Glu 








820 


Ser 


Lys 


Met 


Glu 






835 





Asp Arg Phe Glu 
680 

Gin Lys Glu Leu 
695 

Asp Gin Leu Thr 
710 

Leu Phe Lys Arg 
725 

Lys Asn Glu Glu 

He Thr Gin Glu 
760 

Glu Lys Leu Gin 
775 

Ala Gin Ala Glu 
790 

Met Arg He Gin 
805 

Asn Glu Glu Leu 
Lys He 



665 

Glu Val Val Tyr 

Ser Lys Ala Glu 
700 

Thr Asp Leu Asn 
715 

Phe Glu Lys Gin 
730 

Ser Leu Lys Lys 
745 

Gly Gin Arg Tyr 

Leu Ala Asn Glu 
780 

Ala Leu Ala Leu 
795 

Ser Leu Glu Lys 
810 

Thr Arg lie Cys 
825 





670 






Gin 


Ala 


Met 


Glu 


685 








He 


Gin 


Lys 


Val 


Ser 


Met 


Glu 


Lvs 








720 


Lys 


Glu 


Val 


lie 






735 




Cvs 


Val 


Glu 


Asp 




750 






Gin 


Ala 


Leu 


Lvs 


765 








Glu 


He 


Ala 


Gin 


Gin 


Ala 


Ser 


Leu 








800 


Thr 


Val 


Glu 


Gin 






815 




Asp 


Asp 


Leu 


lie 




830 







<210> 454 

<211> 2788 

<212> DNA 

<213> Homo sapiens 

<400> 454 

ggcggcggta gcagccaggc ttggcccccg 
ggcggcggcg gcgcgggctc agagcccggc 
gtcttaaacg acaaaaatgt cagcaatgaa 
tcgccaccag aagttaccgg aagatcgtct 
ccacccaaga acctggccaa agctatgaag 
cagacgcaca ggattctaag tcctagcatg 
gatgacaccc ttggactgga aaactcacac 
ctcatcaagg aagtggatgc caaaactact 
gacaccgacc tcctggggga tgcaagccca 
ggcccaggtg ccctggctga cctggactgc 
gagaaccaaa tggtgtctcc aggaaaagtg 
aaccttagtt cctattcctt agacagaaga 
ccttgcagga cagagtccca gcacaaagcg 
aaagcggaga ctccgcacgg agccgaggag 
gcagcagtgg ccacttcgcc tcctggtgca 
ctgcagggtc tgcctggcga agccctgggc 
gcagatggca ctcagaccct tacctgtgca 
accaaccacc tggtggctgg cagggccatg 
ggccaaatgg ccagctcctc gaggagcgga 
ggcgccacca gcaaaagggc acccccacca 
cctcccttga ggaaagcagc agtgaggcag 
gacggtagga gcggagcagg agaggacccc 
ctcgactggg acaaaatgga tgacccaaac 
ggttgcagtg aggcccagcc cccagaaagc 
gaacagttgc atgctgggcc tgccacggag 
cattcagcct cagcggagga cacgcctgtg 
gagagcaagg agagagcctt gaactctgcc 
agtgagccag tgcccaccca tcagcagggg 
ttcagagacc ccgctgaggt tctaggcacg 
ggaacttcct cgtttaagga gtcggccttg 



gcgtggagca gacgcggacc cctccttcct 60 
aacgggcggg cgggcagaat gagtctgcag 120 
aaaaatacag aaaattgcga cttcctgttt 180 
gttcttcgtg tgtcacagaa agaaaatgtg 240 
gtgacttttc agacacctct gcgggatcca 300 
gccagcaaac ttgaggctcc tttcactcag 360 
ccggtctgga cacagaaaga gaaccaacag 420 
catggaattc tacagaaacc agtggaggct 480 
gcctttggga gtggcagctc cagcgagtct 540 
tcaagctctt cccagagccc aggaagttct 600 
tctggcagcc ctgagcaagc cgtggaggaa 660 
gtgacacccg cctctgagac cctagaagac 720 
gagactccgc acggagccga ggaagaatgc 780 
gaatgccggc acggtggggt ctgtgctccc 840 
atccctaagg aagcctgcgg aggagcaccc 900 
tgccctgcgg gtgtgggcac ccccgtgcca 960 
cacacctctg ctcctgagag cacagcccca 1020 
accctgagtc ctcaggaaga agtggctgca 1080 
cctgtaaaac tagaatttga tgtatctgat 1140 
aggagactgg gagagaggtc cggcctcaag 1200 
caaaaggccc cgcaggaggt ggaggaggac 1260 
cccatgccag cttctcgggg ctcttaccac 1320 
ttcatcccgt tcggaggtga caccaagtct 1380 
cctgagacca ggctgggcca gccagcggct 14 4 0 
gagccaggtc cctgtctgag ccagcagctg 1500 
gtgcagttgg cagccgagac cccaacagca 1560 
agcacctcgc ttcccacaag ctgtccaggc 1620 
cagcctgcct tggagctgaa agaggagagc 1680 
ggcgcggagg tggattacct ggagcagttt 1740 
aggaagcagt ccttatacct caagttcgac 1800 
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cccctcctga 
cacggtgcaa 
ttcttgggag 
ggcccacccc 
gatgcagtgg 
ctccacggga 
caggccatgg 
ctaaaagaaa 
ctcttcaagc 
tcactgaaga 
caagccctga 
gtccggagca 
atgcgcatcc 
accaggatct 
tgtccccgcc 
ttttttctgt 
taaaagtttc 



gggacagtcc 

atgagactcc 

cactggacat 

tgtccaccgg 

taaaggcgac 

agaacctgga 

aggaagttca 

aagaccaact 

gttttgagaa 

agtgcgtgga 

aggcccacgc 

aggcccaggc 

agtcgctgga 

gcgacgacct 

cccctgctcc 

cttgtcttca 

ctttcaattt 



tggtagacca 
ctcaggacgt 
tcctgtgcca 
acctatagtg 
acaggaggag 
actggggaag 
gaagcagaag 
taccacagat 
acagaaagag 
ggattacctg 
ggaggagaag 
ggaagcgttg 
gaagacagtg 
catctccaag 
cgtctgtctg 
acttttttta 
aaaaaaaa 



gtgcccgtgg 
ccgcgggaag 
ggcccacccc 
gacctgctcc 
aaccgggagc 
atcatggaca 
gaactttcca 
ctgaactcca 
gtgatcgagg 
gcaaggatca 
ctgcagctgg 
gccctccagg 
gagcagaaga 
atggagaaga 
tcctgtctga 
aaactagatt 



ccaccgagac 
ccaagcttgt 
caggtgttcc 
agtacagcca 
tgaggagcag 
ggttcgaaga 
aagctgaaat 
tggagaagtc 
gctaccgcaa 
cccaggaggg 
caaacgagga 
ccagcctgag 
ctaaagagaa 
tctgacctcc 
ttctcttagg 
gctttgaaaa 



cagcagcatg 
ggagttcgat 
cgcgcctggg 
gaaggacctg 
gtgtgaggag 
ggttgtgtac 
ccagaaagtt 
cttctccgac 
gaacgaagag 
ccagaggtac 
gatcgcccag 
gaaggagcag 
cgaggagctg 
acggagccgc 
tgtcatgttc 
catgactcaa 



<210> 455 

<211> 720 

<212> PRT 

<213> Homo sapiens 



<400> 455 
Met Asp Asp 
1 

Phe Asp Phe 

Glu Val Phe 
35 

Ser Leu Glu 
50 

Ser Glu Ser 
65 

Glu Val Tyr 

Ser Arg Asn 

Gin Gly Val 
115 

Phe Glu Lys 

130 
Glu Thr Tyr 
145 

Glu Pro Arg 

Gin Ala Arg 

Leu Pro Arg 
195 

Gin Arg Lys 

210 
Arg Gly Arg 
225 

Pro Ala Ser 

Arg Asp Val 

Ala Ala Gly 
275 

Asn Lys Leu 



Pro Lys 
5 

Asp Leu 
20 

Phe Gly 

Leu Asn 

Pro Phe 

Lys Glu 

85 
Gin Ala 
100 

Glu Arg 

Glu Lys 

Tyr Leu 

Leu Leu 
165 
Leu Thr 
180 

Glu Ser 

Pro Gly 

Ser lie 

Pro Ser 
245 
Leu Pro 
260 

Ser His 



Lys Glu 

Ser Leu 

Pro Phe 

Asn Pro 
55 

Ala Trp 
70 

Ala His 

Ala Gin 

Phe lie 

Glu Met 
135 
Ser Asp 
150 

Ala Ser 

Arg Ala 

Cys Thr 

Thr Lys 
215 
Pro Gly 
230 

Arg Thr 



Asp Lys 
Leu Gly 
Gly Leu Lys Lys 



Asp lie Leu 
10 

Ser Ser Ser 
25 

Gly His Lys 
40 

Val Pro Glu 

Ser Pro Leu 

Leu Leu Ala 
90 

Ala Ala Lys 

105 
Gin Glu Ser 
120 

Lys Lys . Ser 

Ser Pro Leu 

Ser Pro Ala 
170 

Pro Gly Pro 
185 

Ala His Ala 
200 

Leu Leu Leu 

Ala Ala Glu 

Lys lie Pro 
250 

Pro Ala Pro 

265 
Gin Gly Lys 
280 

Thr Leu Leu 



Leu Leu 

Ser Ala 

Glu Arg 

Gin Pro 
60 

Ala Gly 
75 

Leu His 

Pro Glu 

Lys Leu 

Pro Thr 
140 
Leu Gly 
155 

Leu Pro 

Pro His 

Ala Ser 

Pro Arg 
220 
Lys Pro 
235 

Ala Glu 
Gly Ala 
Arg Ala 
Lys Ala 



Ala, Asp 

Asn Glu 

30 
Cys He 
45 

Pro Leu 

Glu Lys 

He Glu 

Asp Pro 
110 
Lys He 
125 

Ser Leu 

Pro Pro 

Ser Ser 

Ser Ala 
190 
Gin Ala 
205 

Ala Ala 

Lys Lys 

Lys Glu 

Val Asn 
270 
He Pro 
285 

Pro Gly 



Glu Lys 
15 

Asp Asp 

Ala Ala 

Pro Thr 

Phe Val 

80 
Ser Ser 

95 - 

Arg Ser 

Asn Leu 

Lys Arg 

Val Gly 
160 
Gly Ala 
175 

His Ala 

Ala Thr 

Ser Val 

Glu He 
240 
Ser His 
255 

Val Pro 
Val Pro 
Ser Thr 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2788 
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290 










295 




Ser 


Asn 


Leu 


Ala 


Arg 


Lys 


Ser 


Ser 


305 










310 






Ser 


Ser 


Ala 


Cys 


Thr 


Ser 


Pro 


Ala 










325 








Phe 


Ala 


Ser 


He 


Pro 


Ala 


Asn 


Ser 








340 










Lys 


Ser 


Gly Arg 


Met 


Gly 


Pro 


Ala 






355 










360 


Gly 


Pro 


Val 


Gly 


Ala 


Ser 


Ser 


Trp 




370 










375 




Glu 


Leu 


Ala 


Ala 


Glu 


Gin 


Leu 


Thr 


385 










390 






Gin 


Pro 


Gin 


Thr 


Pro 


Glu 


Gly 


Gly 










405 








Ala 


Trp 


Ser 


Glu 


Ser 


Ser 


Gin 


Leu 








420. 










Arg 


Asp 


Ser 


Cys 


Leu 


Asn 


Ser 


Lys 






435 










440 


Asn 


Gin 


Phe 


Lys 


He 


Pro 


Lys 


Phe 




450 










455 




Ser 


Thr 


Pro 


Lys 


Leu 


Ser 


Arg 


Ala 


4 65 










470 






Val 


Gly Arg 


Val 


Thr 


Val 


His 


Ser 










485 








Pro 


Ala 


Pro 


Gin 


Ser 


Leu 


Leu 


Ser 








500 










Thr 


Pro 


Ala 


Ser 


Arg 


Arg 


Cys 


Ser 






515 










520 


Thr 


Met 


Pro 


Arg 


Ala 


Val 


Gly 


Ser 




530 










535 




Arg 


Ser 


Ser 


Glu 


Pro 


Arg 


Lys 


Asn 


545 










550 






Arg 


Glu 


Ser 


Asn 


Arg 


Lys 


Thr 


Asp 










565 








Asp Arg 


Gly Ser 


Pro 


Pro 


Ser 


Arg 








580 










Pro 


Glu 


Glu 


Ser 


Asp 


Ser 


Thr 


Phe 






595 










600 


Ala 


Arg 


Glu 


Glu 


Ala 


Lys 


Pro 


Gly 




610 










615 




Leu 


Leu 


Val 


Asp 


He 


Lys 


Leu 


Glu 


625 










630 






Ala 


Ser 


Gin 


Pro 


Leu 


He 


Asp 


Leu 










6.4 5 








Pro 


Glu 


Ala 


His 


Val 


Ala 


Val 


Gly 








660 










Leu 


Met 


Thr 


Asn 


Thr 


Pro 


Asp 


Met 






675 










680 


Pro 


Val 


Val 


Gly 


Gin 


Leu 


He 


Asp 




690 










695 




Ser 


Pro 


Glu 


Ala 


Asp 


Lys 


Glu 


Asn 


705 










710 













300 










Ser 


Gly 


Pro 


Val 


Trp 


Ser 


Gly Ala 






315 










320 


Val 


Gly 


Lys 


Ala 


Lys 


Ser 


Ser 


Glu 




330 










335 




Ser 


Arg 


Pro 


Leu 


Ser 


Asn 


He 


Ser 


34 5 










350 






Met 


Leu 


Arg 


Pro 


Ala 


Leu 


Pro 


Ala 










365 








Gin 


Ala 


Lys 


Arg 


Val 


Asp 


Val 


Ser 








380 










Ala 


Pro 


Pro 


Ser 


Ala 


Ser 


Pro 


Thr 






395 










400 


Gly 


Gin 


Trp 


Leu 


Asn 


Ser 


Ser 


Cys 




410 










415 




Asn 


Lys 


Thr 


Arg 


Ser 


He 


Arg 


Arg 


425 










4 30 






Thr 


Lys 


Val 


Met 


Pro 


Thr 


Pro 


Thr 










445 








Ser 


He 


Gly 


Asp 


Ser 


Pro 


Asp 


Ser 








460 










Gin 


Arg 


Pro 


Gin 


Ser 


Cys 


Thr 


Ser 






475 










480 


Thr 


Pro 


Val 


Arg 


Arg 


Ser 


Ser 


Gly 




490 










4 95 




Ala 


Arg 


Arg 


Val 


Ser 


Ala 


Leu 


Pro 


505 










510 






Gly 


Leu 


Pro 


Pro 


Met 


Thr 


Pro 


Lys 










525 








Pro 


Leu 


Cys 


Val 


Pro 


Ala 


Arg 


Arg 








540 










Ser 


Ala 


Met 


Arg 


Thr 


Glu 


Pro 


Thr 






555 










560 


Ser 


Arg 


Leu 


Val 


Asp 


Val 


Ser 


Pro 




570 










575 




Val 


Pro 


Gin 


Ala 


Leu 


Asn 


Phe 


Ser 


585 










590 






Ser 


Lys 


Ser 


Thr 


Ala 


Thr 


Glu 


Val 










605 








Gly 


Asp 


Ala 


Ala 


Pro 


Ser 


Glu 


Ala 








620 










Pro 


Leu 


Ala 


Val 


Thr 


Pro 


Asp 


Ala 






635 










640 


Pro 


Leu 


lie 


Asp 


Phe 


Cys 


Asp 


Thr 




650 










655 




Ser 


Glu 


Ser 


Arg 


Pro 


Leu 


He 


Asp 


665 










670 






Asn 


Lys 


Asn 


Val 


Ala 


Lys 


Pro 


Ser 










685 








Leu 


Ser 


Ser 


Pro 


Leu 


He 


Gin 


Leu 








700 










Val 


Asp 


Ser 


Pro 


Leu 


Leu 


Lys 


Phe 






715 










720 



<210> 456 
<211> 2917 
<212> DNA 

<213> Homo sapiens 
<400> 456 
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ggcacgaggg cggcggccgc gatgagcctt cagcctgccg ggcaggggac gtgaacatgg 60 
atgaccctaa gaaggaagac attcttcttt tggccgatga aaaatttgac ttcgatcttt 120 
cattgtcttc ttcgagtgca aatgaagatg atgaagtctt cttcggaccc tttggacata 180 
aagaaagatg tattgctgcc agcttggaat taaataatcc ggttcccgaa cagcctccqt 240 
S^r» CatC tgagagtCCC tttgcctgga gccctctggc cggggagaag ttcgtggagg 300 
tgtacaaaga agctcactta ctggctttac acattgagag cagcagccgg aaccaggca 360 
cccaagctgc caagcctgaa gaccctcgga gccagggcgt ggaaagattc atacaggagt 420 
caaaattaaa aataaacctc tttgagaaag aaaaggaaat gaagaaaagc cccacgtctc 480 
ttaaaaggga gacatactac ctgtcagaca gccccttgct ggggccccct gtgggtgagc 540 
ctcggctctt ggcctcctcc ccggccctgc ccagctctgg tgcccaggcc cgcctcaccc 600 
Itntlt^" 9CCtCCgCac tctgctcatg ctttgcccag ggaatcatgc actgctcatg 660 
ctgcaagtca ggcagcgact cagaggaagc ccgggaccaa attgctgctg cctcgagcgg 720 
^^' gt ^ 9 a 99 aa 9 aa 9 c atccctgggg ctgcggagaa gcccaagaaa gagattccag 780 
ctagtccttc caggacaaaa atcccagctg agaaggaatc ccaccgggat gttctccctg 840 
acaaacctgc cccgggtgct gtcaatgtgc cggccgccgg aagccacttg ggccagggca 900 
agcgggcgat ccctgttcca aacaagttgg ggctgaagaa gaccctgtta aaagcacccg 960 
IroaltnTJ " atCtCgca ^gaagtcct cctcggggcc tgtttggagc ggggcatcca 1020 
gtgcgtgcac atccccagca gtgggcaaag ctaaatcaag tgaatttgca agtattcctg 1080 
caaatagctc ccggcctctg tcaaacatca gcaagtcagg cagaatggga cccgccatgc 1140 
a?a?ttctL C tC , tgCCtgCa «ccctgtgg gggcatcctc ctggcaggcc aagcgggt^g 12u0 
atgtttctga gctggcagcg gagcagctca cggcaccccc ctcagcatcc cccacccaac 1260 
ggaaggtggc ggccagtggc tgaactccag ttgcgcttgg tcagaatctt 1320 
ctcaattgaa taagactaga agtatcagac ggcgagattc ctgtctaaat tccaagacaa 1380 
aggttatgcc tactcctaca aatcaattta aaattcctaa gttttctatt ggtgactccc 1440 
cggacagctc aacaccaaag ctttcgcggg cacagcggcc gcagtcgtgc acgtcagttg 1500 
gcagggtcac tgtccacagc accccggtta gacgctcatc tgggccagca ccacaaagcc 1560 
l g trln,n 9 J aCggcgtgtg tcagccttgc ccacacccgc cagccggcgc tgctctggcc 1620 
ttccaccgat gacccccaaa acgatgccca gggccgtggg ctctcccctg tgtgtgccag 1680 
aoa^r 09 ttCCtCtgag «=cccgcaaga actctgcaat gagaactgaa ccaacaaggg 1740 
agagcaacag aaagacagat tccaggctgg tggatgtgtc ccctgacagg ggttctcctc 1800 
cttcccgtgt gcctcaggca cttaactttt ctccagagga aagcgattc? act^tctcca 1860 
atoaoacJ^ ^acagaagta gctcgggagg aagccaagcc gggtggagat gcagccccta 1920 
llltat^t tC " gtagat ^caaactgg aaccactcgc ggtcactcca gatgctgcaa 1980 
? a C a C a C cattgacctt cctctcatcg acttctgcga taccccagaa gcacacgtgg 2040 
till? 99 tgaaagcagg cctctgatcg acctcatgac aaacactcca gacatgaata 2100 
aaaatgtggc caaaccttca ccggtggtgg gacagctcat agacctgagc tcccctctga 2160 
tccagctgag ccctgaggct gacaaggaga acgtggattc cccactcctc aagttctaaq 2220 
ccgaaccaaa tcctttgcct tgaaagaaca gccctaaagt ggttttcaac cctcagaaac 2280 
aagctttagg ctggtcgcag tggcttacac ttgtaaccct agaacttggg aggctgaggt 2340 
gggcggatta cttgagccca ggagttcggg accagcctgg gaaatatag? ga'aactcc? 2400 
tccctacaaa aagtacaaaa attagccggg tgtggtagtg catgcctgta gtcccagcta 24 60 
^" 99ga . 99C tgaa g fc ggg a ggatggcctg agctcaagga gatgcaggct gcagtgggct 2520 
gtgattgtgc cactgcactc cagcctgggc accaatgtga gaacctgtct tggaaaaaaa 2580 
tatted aa " tgtttt -gtagaagtt ttatttgaaa aagaaaaata agcataaata 2640 
tttt C °" gt g ct g9 a gagg gtgggctgag ggactggggc cagcacggac cacccaaggc 2700 
S ^ ccgcca ^ ctcctcgctg ccattctctg ggctggaatg tgaagectea 2760 
= t tg l a9 Tl ttCttttgaa tgttttgtat gtaaaatagc aagtggctat 2820 
aagtttgtat aaatagttag atattctaga tttacattaa attgtaaaat 2880 
aaatggactt attgaagcat aaaaaaaaaa aaaaaaa 2917 

<210> 457 

<211> 389 

<212> PRT 

<213> Homo sapiens 

<400> 457 

Met Leu Ser Leu Arg Val Pro Leu Ala Pro He Thr Asp Pro Gin Gin 

5 10 15 

Leu Gin Leu Ser Pro Leu Lys Gly Leu Ser Leu Val Asp Lys Glu Asn 

20 25 30 

Thr Pro Pro Ala Leu Ser Gly Thr Arg Val Leu Ala Ser Lys Thr Ala 
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35 40 45 



Arg 


Arg 


He 


Phe 


Gin 


GlU 


Pro 


i nr 


Glu 


Pro 


Lys 


Thr 


Lys 


Ala 


Ala 


Ala 




bU 










c c 

bb 










60 










Pro 


Gly 


Val 


Glu Asp 


Glu 


Pro 


Leu 


Leu Arg 


Glu 


Asn 


Pro 


Arg 


Arg 


Phe 


a c 










70 










75 










80 


vai 


l le 


Pne 


Pro 


He 


Glu 


Tyr 


nlS 


ASp 


He 


Trp Gin 


Met 


Tyr 


Lys 


Lys 










Q C 
O b 










90 










95 




Ala 


GlU 


Ala 


Ser 


Pne 


Trp 


1 nr 


Ala 


GlU 


Glu 


Val 


Asp 


Leu 


Ser 


Lys 


Asp 








100 










i n c 
lUb 










110 






i i e 


r] n 

Lsin 


iil S 


Trp 


Glu 


ber 


Leu 


Lys 


Pro 


Glu 


Glu 


Arg 


Tyr 


Pne 


I le 


ber 






TIC 

lib 










ion 










125 








HIS 


vai 


Leu 


Ala 


Phe 


Phe 


Ala 


Ala 


Ser 


Asp Gly 


He 


Val 


Asn 


Glu 


Asn 




1 JU 










lob 










14 0 










Leu 


vai 


olU 


Arg 


Phe 


Ser 


Gin 


blU 


Val 


Gin 


He 


Thr 


Glu 


Ala 


Arg 


Cys 












150 










155 










160 


rne 


Tyr 


bly 


Phe 


Gin 


lie 


Ala 


Met 


Glu 


Asn 


He 


His 


Ser 


Glu 


Met 


Tyr 










lob 










170 










175 




SGr 


Leu 


Leu 


lie 


Asp 


1 nr 


Tyr 


I le 


Lys 


Asp 


Pro. 


Lys 


Glu 


Arg 


Glu 


Phe 








180 










185 










190 






Leu 


rne 


Asn 


Ala 


He 


blU 


1 nr 


Met 


Pro 


Cys 


Val 


Lys 


Lys 


Lys 


Ala 


Asp 






IOC 

i yt> 










200 










205 








Trp 


Ala 


Leu 


Arg 


Trp 


I le 


Gly 


Asp 


Lys 


Glu 


Ala 


Thr 


Tyr 


Gly 


Glu 


Arg 




210 










215 










220 










Val 


Val 


Ala 


Phe 


Ala 


Ala 


Val 


Glu 


Gly 


He 


Phe 


Phe 


Ser 


Gly 


Ser 


Phe 


o o c 

225 










230 










235 










24 0 


Ala 


Ser 


lie 


Phe 


Trp 


Leu 


Lys 


Lys 


Arg 


Gly 


Leu 


Met 


Pro 


Gly 


Leu 


Thr 










24 5 










250 










255 




Pne 


Ser 


Asn 


Glu 


Leu 


He 


Ser 


Arg 


Asp 


Glu 


Gly 


Leu 


His 


Cys 


Asp 


Phe 








260 










265 










270 






Ala 


Cys 


Leu 


Met 


Phe 


Lys 


His 


Leu 


Val 


His 


Lys 


Pro 


Ser 


Glu 


Glu 


Arg 






275 










280 










285 








Val 


Arg 


Glu 


lie 


He 


He 


Asn 


Ala 


Val 


Arg 


He 


Glu 


Gin 


Glu 


Phe 


Leu 




290 










295 










300 










Thr 


Glu 


Ala 


Leu 


Pro 


Val 


Lys 


Leu 


He 


Gly Met 


Asn 


Cys 


Thr 


Leu 


Met 


305 










310 










315 










320 


Ly s 


bin 


Tyr 


He 


Glu 


rne 


val 


Ala 


Asp 


Arg 


Leu 


Met 


Leu 


Glu 


Leu 


Gly 










325 










330 










335 




Phe 


Ser 


Lys 


Val 


Phe 


Arg 


Val 


Glu 


Asn 


Pro 


Phe 


Asp 


Phe 


Met 


Glu 


Asn 








340 










345 










350 






lie 


Ser 


Leu 


Glu 


Gly 


Lys 


Thr 


Asn 


Phe 


Phe 


Glu 


Lys 


Arg 


Val 


Gly 


Glu 






355 










360 










365 








Tyr 


Gin 


Arg 


Met 


Gly 


Val 


Met 


Ser 


Ser 


Pro 


Thr 


Glu 


Asn 


Ser 


Phe 


Thr 




370 










375 










380 










Leu 


Asp 


Ala 


Asp 


Phe 

























385 



<210> 458 

<211> 2500 

<212> DNA 

<213> Homo sapiens 



<400> 458 

cccaggcgca 

caccaaagcc 

gtgaggggtc 

gctgcgcctc 

agctgcagct 

ccctgagcgg 

cggagccgaa 

acccccgccg 



gccaatggga 
aatgggaagg 
gcccgtgcac 
cactatgctc 
ctcgccgctg 
gacccgcgtc 
aactaaagca 
ctttgtcatc 



agggtcggag 
gccgggagcg 
cctgtcccag 
tccctccgtg 
aaggggctca 
ctggccagca 
gctgcccccg 
ttccccatcg 



gcatggcaca 
cgcggcgcgg 
ccgtcctgtc 
tcccgctcgc 
gcttggtcga 
agaccgcgag 
gcgtggagga 
agtaccatga 



gccaatggga 
gagatttaaa 
ctggctgctc 
gcccatcacg 
caaggagaac 
gaggatcttc 
tgagccgctg 
tatctggcag 



agggccgggg 60 
ggctgctgga 120 
gctctgcttc 180 
gacccgcagc 240 
acgccgccgg 300 
caggagccca 360 
ctgagagaaa 420 
atgtataaga 480 
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aggcagaggc ttccttttgg accgccgagg aggttgacct ctccaaggac attcagcact 540 
gggaatccct gaaacccgag gagagatatt ttatatccca tgttctggct ttctttgcag 600 
caagcgatgg catagtaaat gaaaacttgg tggagcgatt tagccaagaa gttcagatta 660 
cagaagcccg ctgtttctat ggcttccaaa ttgccatgga aaacatacat tctgaaatgt 720 
atagtcttct tattgacact tacataaaag atcccaaaga aagggaattt ctcttcaatg 780 
ccattgaaac gatgccttgt gtcaagaaga aggcagactg ggccttgcgc tggattgggg 840 
acaaagaggc tacctatggt gaacgtgttg tagcctttgc tgcagtggaa ggcattttct 900 
tttccggttc ttttgcgtcg atattctggc tcaagaaacg aggactgatg cctggcctca 960 
cattttctaa tgaacttatt agcagagatg agggtttaca ctgtgatttt gcttgcctga 1020 
tgttcaaaca cctggtacac aaaccatcgg aggagagagt aagagaaata attatcaatg 1080 
ctgttcggat agaacaggag ttcctcactg aggccttgcc tgtgaagctc attgggatga 1140 
attgcactct aatgaagcaa tacattgagt ttgtggcaga cagacttatg ctggaactgg 1200 
gttttagcaa ggttttcaga gtagagaacc catttgactt tatggagaat atttcactgg 1260 
aaggaaagac taacttcttt gagaagagag taggcgagta tcagaggatg ggagtgatgt 1320 
caagtccaac agagaattct tttaccttgg atgctgactt ctaaatgaac tgaagatgtg 1380 
cccttacttg gctgattttt tttttccatc tcataagaaa aatcagctga agtgttacca 1440 
actagccaca ccatgaattg tccgtaatgt tcattaacag catctttaaa actgtgtagc 1500 
tacctcacaa ccagtcctgt ctgtttatag tgctggt'agt atcacctttt gccagaaggc 1560 
ctggctggct gtgacttacc atagcagtga caatggcagt cttggcttta aagtgagggg 1620 
tgacccttta gtgagcttag cacagcggga ttaaacagtc ctttaaccag cacagccagt 1680 
taaaagatgc agcctcactg cttcaacgca gattttaatg tttacttaaa tataaacctg 1740 
gcactttaca aacaaataaa cattgttttg tactcacggc ggcgataata gcttgattta 1800 
tttggtttct acaccaaata cattctcctg accactaatg ggagccaatt cacaattcac 1860 
taagtgacta aagtaagtta aacttgtgta gactaagcat gtaattttta agttttattt 1920 
taatgaatta aaatatttgt taaccaactt taaagtcagt cctgtgtata cctagatatt 1980 
agtcagttgg tgccagatag aagacaggtt gtgtttttat cctgtggctt gtgtagtgtc 2040 
ctgggattct ctqccccctc tgagtagagt gttgtgggat aaaggaatct ctcagggcaa 2100 
ggagcttctt aagttaaatc actagaaatt taggggtgat ctgggccttc atatgtgtga 2160 
gaagccgttt cattttattt ctcactgtat tttcctcaac gtctggttga tgagaaaaaa 2220 
ttcttgaaga gttttcatat gtgggagcta aggtagtatt gtaaaatttc aagtcatcct 2280 
taaacaaaat gatccaccta agatcttgcc cctgttaagt ggtgaaatca actagaggtg 2340 
9 "r! a f! a ^tgttcatt ctagttttgt ttggtgtaag taggttgtgt gagttaattc 2400 
atttatattt actatgtctg ttaaatcaga aattttttat tatctatgtt cttctagatt 2460 
ttacctgtag ttcataaaaa aaaaaaaaaa aaaaaaaaaa 



2500 



<210> 459 
<211> 890 
<212> PRT 

<213> Homo sapiens 
<400> 459 

Met Ser Gin Gly lie Leu Ser Pro Pro Ala Gly Leu Leu Ser Asp Asp 

1 5 10 15 

Asp Val Val Val Ser Pro Met Phe Glu Ser Thr Ala Ala Asp Leu Gly 

20 25 30 

Ser Val Val Arg Lys Asn Leu Leu Ser Asp Cys Ser Val Val Ser Thr 

35 40 45 

Ser Leu Glu Asp Lys Gin Gin Val Pro Ser Glu Asp Ser Met Glu Lys 
50 55 60 

Val Lys Val Tyr Leu Arg Val Arg Pro Leu Leu Pro Ser Glu Leu Glu 
65 70 75 80 

Arg Gin Glu Asp Gin Gly Cys Val Arg He Glu Asn Val Glu Thr Leu 

85 90 95 

Val Leu Gin Ala Pro Lys Asp Ser Phe Ala Leu Lys Ser Asn Glu Arq 

100 10 5 ' no 

Gly He Gly Gin Ala Thr His Arg Phe Thr Phe Ser Gin He Phe Gly 

115 120 12 5 

Pro Glu Val Gly Gin Ala Ser Phe Phe Asn Leu Thr Val Lys Glu Met 

130 135 140 

Val Lys Asp Val Leu Lys Gly Gin Asn Trp Leu He Tyr Thr Tyr Gly 
145 150 155 160 
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Val Thr Asn Ser 

Gly Gly lie Leu 
180 

Gly Gin Leu His 
195 

Val lie Trp Leu 
210 

Leu Ser Leu Leu 
225 

Leu Lys Arg Ser 

Phe Asp Ser Gly 
260 

Ser Ser Gin Leu 
275 

Ala Pro Leu Pro 
290 

Phe Phe Glu lie 
305 

Ser Gin Gin Arg 

Gly Asn Pro Tyr 
340 

Glu Glu Ala Trp 
355 

Ala Ser Thr His 
370 

Ser lie Arg lie 
385 

lie Ser Glu Leu 

Asp Gin Lys Ser 
4 20 

Ser Leu His Thr 
435 

Gin Asn Arg Ser 
4 50 

Thr Arg Val Phe 
465 

He Val Asn Val 

Val Ala Lys Phe 
500 

Met Gin Leu Gly 
515 

Leu Gin Val Ser 
530 

Leu Asp Asp Asp 
54 5 

Glu Glu Leu Leu 

Glu Arg Gin Glu 
580 

Cys Asn Glu Met 
595 

Glu His Leu Asp 
610 

Lys Leu Asn He 
625 

He Gin Glu Arg 



Gly Lys Thr His 
165 

Pro Arg Ser Leu 

Pro Thr Pro Asp 
200 

Asp Ser Lys Gin 
215 

Asn Gly Gly Leu 
230 

Val Tyr He Glu 
245 

He Ala Gly Leu 

Asp Glu Thr Ser 
280 

Val Pro Ala Asn 
295 

Tyr Asn Glu Leu 
310 

Lys Arg Gin Thr 
325 

Val Lys Asp Leu 

Lys Leu Leu Lys 
360 

Leu Asn Gin Asn 
375 

Leu His Leu Gin 
390 

Ser Leu Cys Asp 
405 

Gly Glu Arg Leu 

Leu Gly Arg Cys 
440 

Lys Gin Asn Leu 
455 

Gin Gly Phe Phe 
470 

Asn Pro Cys Ala 
485 

Ser Ala He Ala 

Phe Pro Ser Leu 
520 

Pro Ser Leu Glu 
535 

lie Glu Asn Glu 
550 

Gin Val Val Glu 
565 

Lys Leu Gin Leu 

Val Glu Gin Met 
600 

Thr Gin Lys Glu 
615 

Leu Lys Glu Ser 
630 

Asp Glu Lys He 



i nr 


x ±e 


bin 






i / u 






Ala 


Leu 


i jl e 


irne 


1 QC. 
iO J 








Leu 


Lys 


Pro 


Leu 


lie 


Arg 


kjx n 


olU 








220 


Gin 


Glu 


Glu 


Glu 






Zoo 




O v 

ber 


Arg 


lie 


(j! y 




z o u 






Ser 


ber 


T I Q 

lie 


Ser 










His 


Arg 


Trp 


Ala 


lie 


Arg 


pne 


ber 








300 


Leu 


Tyr 


Asp 


Leu 






315 




Leu 


Arg 


Leu 


Cys 




330 






Asn 


Trp 


He 


His 


345 








Val 


Gly Arg 


Lys 


Ser 


Ser 


Arg 


Ser 








380 


Gly Glu 


Gly 


Asp 






395 




Leu 


Ala 


Gly 


Ser 




410 






Lys 


Glu 


Ala 


Gly 


425 








He 


Ala 


Ala 


Leu 


Val 


Pro 


Phe 


Arg 








4 60 


Thr 


Gly Arg 


Gly 






475 




Ser 


Thr 


Tyr 


ASp 




4 90 






Ser 


Gin 


Leu 


va l 


505 








His 


Ser 


Phe 


lie 


Lys 


Gly Ala 


Lys 








540 


Ala 


Asp 


He 


Ser 






555 




Ala 


Met 


Lys 


Thr 




570 






Glu 


Met 


His 


Leu 


585 








Gin 


Gin 


Arg 


Glu 


Leu 


Leu 


Glu 


Glu 








620 


Leu 


Thr 


Ser 


Phe 






635 




Glu 


Glu 


Leu 


Glu 



Thr He Lys Asp 
175 

Asn Ser Leu Gin 
190 

Leu Ser Asn Glu 
205 

Glu Met Lys Lys 

Leu Ser Thr Ser 
* 240 
Thr Ser Thr Ser 
255 

Gin Cys Thr Ser 
270 

Gin Pro Asp Thr 

285 

He Trp lie Ser 

Leu Glu Pro Pro 
320 

Glu Asp Gin Asn 
335 

Val Gin Asp Ala 
350 

Asn Gin Ser Phe 
365 

His Ser lie Phe 

lie Val Pro Lys 
400 

Glu Arg Cys Lys 
415 

Asn lie Asn Thr 
430 

Arg Gin Asn Gin 
445 

Asp Ser Lys Leu 

Arg Ser Cys Met 
480 

Glu. Thr Leu His 
4 95 

His Ala Pro Pro 
510 

Lys Glu His Ser 
525 

Ala Asp Thr Gly 

Met Tyr Gly Lys 
560 

Leu Leu Leu Lys 
575 

Arg Asp Glu lie 
590 

Gin Trp Cys Ser 
605 

Met Tyr Glu Glu 

Tyr Gin Glu Glu 
640 

Ala Leu Leu Gin 
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Glu Ala Arg Gin 

660 

Ala Leu Arg Arg 
675 

Leu Gin Glu Val 
690 

Ser Thr Thr Glu 
705 

Pro Ser Ala Lys 

Gly Gin Lys Asn 
740 

Glu Ser Leu Gin 
755 

Gly Lys Leu Arg 
770 

Gin Asp Gin Thr 
785 

Leu Asp Leu Arg 

Val Leu Lys Leu 
820 

Asn Gin Glu Asn 
835 

Phe Leu Arg Asn 
850 

Asp Cys Ser Pro 
865 

Leu Lys Ser Gly 



645 

Gin Ser Val Ala 

Ser Gin Arg Leu 
680 

Lys Ala Lys Leu 
695 

Glu Leu His Lys 
710 

Pro Phe Thr lie 
725 

lie Arg Leu Leu 

Ser Ala Glu Arg 
760 

Gin Ala Leu Thr 
775 

Leu Ala Glu Leu 
790 

Lys Lys Ala Ala 
805 

Gin Gly Gin Val 

Gin Gin Pro Asn 
840 

Leu Leu Pro Arg 
855 

Tyr Ala Arg lie 
870 

Pro Phe Gly Lys 
885 



650 

His Gin Gin Ser 
665 

Ala Ala Ser Ala 

Gin Gin Cys Lys 
7 00 

Tyr Gin Lys Met 
715 

Asp Val Asp Lys 
730 

Arg Thr Glu Leu 
745 

Ala Cys Cys His 

Thr Cys Asp Asp 
780 

Gin Asn Asn Met 
795 

Cys He Ala Glu 
810 

Ser Ala Lys Lys 
825 

Gin Gin Pro Pro 

Thr Pro Thr Cys 
860 

Leu Arg Ser Arg 
875 

Lys Tyr 
890 



655 

Gly Ser Glu Leu 
670 

Ser Thr Gin Gin 
685 

Ala Glu Leu Asn 

Leu Glu Pro Pro 
720 

Lys Leu Glu Glu 
735 

Gin Lys Leu Gly 
750 

Ser Thr Gly Ala 
7 65 

He Leu He Lys 

Val Leu Val Lys 
800 

Gin Tyr His Thr 
815 

Arg Leu Gly Thr 
830 

Gly Lys Lys Pro 
845, 

Gin Ser Ser Thr 

Arg Ser Pro Leu 
880 



<210> 460 
<211> 2972 
<212> DNA 

<213> Homo sapiens 
<400> 460 

tcttcggacc taggctgccc tgccgtcatg tcgcaaggga tcctttctcc gccagcgggc 60 
ttgctgtccg atgacgatgt cgtagtttct cccatgtttg agtccacagc tgcagatttg 120 
gggtctgtgg tacgcaagaa cctgctatca gactgctctg tcgtctctac ctccctagag 180 
gacaagcagc aggttccatc tgaggacagt atggagaagg tgaaagtata cttgagggtt 240 
aggcccttgt taccttcaga gttggaacga caggaagatc agggttgtgt ccgtattgag 300 
aatgtggaga cccttgttct acaagcaccc aaggactcgt ttgccctgaa gagcaatgaa 360 
cggggaattg gccaagccac acacaggttc accttttccc agatctttgg gccagaagtg 420 
ggacaggcat ccttcttcaa cctaactgtg aaggagatgg taaaggatgt actcaaaggg 480 
cagaactggc tcatctatac atatggagtc actaactcag ggaaaaccca cacgattcaa 540 
ggtaccatca aggatggagg gattctcccc cggtccctgg cgctgatctt caatagcctc 600 
caaggccaac ttcatccaac acctgatctg aagcccttgc tctccaatga ggtaatctgg 660 
ctagacagca agcagatccg acaggaggaa atgaagaagc tgtccctgct aaatggaggc 720 
ctccaagagg aggagctgtc cacttccttg aagaggagtg tctacatcga aagtcggata 780 
ggtaccagca ccagcttcga cagtggcatt gctgggctct cttctatcag tcagtgtacc 840 
agcagtagcc agctggatga aacaagtcat cgatgggcac agccagacac tgccccacta 900 
cctgtcccgg caaacattcg cttctccatc tggatctcat tctttgagat ctacaacgaa 960 
ctgctttatg acctattaga accgcctagc caacagcgca agaggcagac tttgcggcta 1020 
tgcgaggatc aaaatggcaa tccctatgtg aaagatctca actggattca tgtgcaagat 1080 
gctgaggagg cctggaagct cctaaaagtg ggtcgtaaga accagagctt tgccagcacc 1140 
cacctcaacc agaactccag ccgcagtcac agcatcttct caatcaggat cctacacctt 1200 
cagggggaag gagatatagt ccccaagatc agcgagctgt cactctgtga tctggctggc 1260 
tcagagcgct gcaaagatca gaagagtggt gaacggttga aggaagcagg aaacattaac 1320 
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acctctctac acaccctggg ccgctgtatt gctgcccttc gtcaaaacca gcagaaccgg 1380 
tcaaagcaga acctggttcc cttccgtgac agcaagttga ctcgagtgtt ccaaggtttc 1440 
ttcacaggcc gaggccgttc ctgcatgatt gtcaatgtga atccctgtgc atctacctat 1500 
gatgaaactc ttcatgtggc caagttctca gccattgcta gccagcttgt gcatgcccca 1560 
cctatgcaac tgggattccc atccctgcac tcgttcatca aggaacatag tcttcaggta 1620 
tcccccagct tagagaaagg ggctaaggca gacacaggcc ttgatgatga tattgaaaat 1680 
gaagctgaca tctccatgta tggcaaagag gagctcctac aagttgtgga agccatgaag 1740 
acactgcttt tgaaggaacg acaggaaaag ctacagctgg agatgcatct ccgagatgaa 1800 
atttgcaatg agatggtaga acagatgcaa cagcgggaac agtggtgcag tgaacatttg 1860 
gacacccaaa aggaactatt ggaggaaatg tatgaagaaa aactaaatat cctcaaggag 1920 
tcactgacaa gtttttacca agaagagatt caggagcggg atgaaaagat tgaagagcta 1980 
gaagctctct tgcaggaagc cagacaacag tcagtggccc atcagcaatc agggtctgaa 2040 
ttggccctac ggcggtcaca aaggttggca gcttctgcct ccacccagca gcttcaggag 2100 
gttaaagcta aattacagca gtgcaaagca gagctaaact ctaccactga agagttgcat 2160 
aagtatcaga aaatgttaga accaccaccc tcagccaagc ccttcaccat tgatgtggac 2220 
aagaagttag aagagggcca gaagaatata aggctgttgc ggacagagct tcagaaactt 2280 
ggtgagtctc tccaatcagc agagagagct tgttgccaca gcactggggc aggaaaactt 2340 
cgtcaagcct tgaccacttg tgatgacatc ttaatcaaac aggaccagac tctggctgaa 2400 
ctgcagaaca acatggtgct agtgaaactg gaccttcgga agaaggcagc atgtattgct 24 60 
gagcagtatc atactgtgtt gaaactccaa ggccaggttt ctgccaaaaa gcgccttggt 2520 
accaaccagg aaaatcagca accaaaccaa caaccaccag ggaagaaacc attccttcga 2580 
aatttacttc cccgaacacc aacctgccaa agctcaacag actgcagccc ttatgcccgg 2640 
atcctacgct cacggcgttc ccctttactc aaatctgggc cttttggcaa aaagtactaa 2700 
ggctgtgggg aaagagaaga gcagtcatgg ccctgaggtg ggtcagctac tctcctgaag 2760 
aaataggtct cttttatgct ttaccatata tcaggaatta tatccaggat gcaatactca 2820 
gacactagct tttttctcac ttttgtatta taaccaccta tgtaatctca tgttgttgtt 2880 
tttttttatt tacttatatg atttctatgc acacaaaaac agttatatta aagatattat 2940 
tgttcacatt ttttattgaa aaaaaaaaaa aa 2972 

<210> 461 
<211> 719 
<212> PRT 

<213> Homo sapiens 



<400> 461 



Met 


Ala 


Leu 


Lys 


Asp 


Tyr 


Ala 


Leu 


Glu 


Lys 


Glu 


Lys 


Val 


Lys 


Lys 


Phe 


1 








5 










10 










15 




Leu 


Gin 


Glu 


Phe 


Tyr 


Gin 


Asp 


Asp 


Glu 


Leu 


Gly 


Lys 


Lys 


Gin 


Phe 


Lys 








20 










25 










30 






Tyr 


Gly 


Asn 


Gin 


Leu 


Val 


Arg 


Leu 


Ala 


His 


Arg 


Glu 


Gin 


Val 


Ala 


Leu 






35 










40 










45 








Tyr 


Val 


Asp 


Leu 


Asp 


Asp 


Val 


Ala 


Glu 


Asp Asp 


Pro 


Glu 


Leu 


Val 


Asp 




50 










55 










60 










Ser 


He 


Cys 


Glu 


Asn 


Ala 


Arg 


Arg 


Tyr 


Ala 


Lys 


Leu 


Phe 


Ala 


Asp 


Ala 


65 










70 










75 










80 


Val 


Gin 


Glu 


Leu 


Leu 


Pro 


Gin 


Tyr 


Lys 


Glu 


Arg 


Glu 


Val 


Val 


Asn 


Lys 










85 










90 










95 




Asp 


Val 


Leu 


Asp 


Val 


Tyr 


He 


Glu 


His 


Arg 


Leu 


Met 


Met 


Glu 


Gin 


Arg 








100 










105 










110 






Ser 


Arg 


Asp 


Pro 


Gly 


Met 


Val 


Arg 


Ser 


Pro 


Gin 


Asn 


Gin 


Tyr 


Pro 


Ala 






115 










120 










125 








Glu 


Leu 


Met 


Arg 


Arg 


Phe 


Glu 


Leu 


Tyr 


Phe 


Gin 


Gly 


Pro 


Ser 


Ser 


Asn 




130 










135 










140 










Lys 


Pro 


Arg 


Val 


He 


Arg 


Glu 


Val 


Arg 


Ala 


Asp 


Ser 


Val 


Gly 


Lys 


Leu 


145 










150 










155 










160 


Val 


Thr 


Val 


Arg 


Gly 


He 


Val 


Thr 


Arg 


Val 


Ser 


Glu 


Val 


Lys 


Pro 


Lys 










165 










170 










175 




Met 


Val 


Val 


Ala 


Thr 


Tyr 


Thr 


Cys 


Asp 


Gin 


Cys 


Gly Ala 


Glu 


Thr 


Tyr 








180 










185 










190 






Gin 


Pro 


He 


Gin 


Ser 


Pro 


Thr 


Phe 


Met 


Pro 


Leu 


He 


Met 


Cys 


Pro 


Ser 






195 










200 










205 
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Gin Glu c ys 61n Thr flsn ser Gly Gly ^ t ^ 

Arg Gly Ser Arg Phe IIe Lys Phe Gln Glu ^ Lys ^ 
S.r flap G1 „ val P m ely fls „ rle ^ ^ ue v ^ 240 

250 255 



V.! <=lu Oly Gl„ „,„ Thr At , „. pro 255 ^ 

Val T » r Gly Ilm Ph « Le „ p „ Le „ flrg 2,0 ^ 

VI Gin Gly Leu leu s„ Glu Thr Tyr leu Glu u . ^ flrg lle Val 
Lys Met Asn Lys Ser Gl u Asp Asp Glu Ser Gly Ala Gly Glu Leu Thr 
Arg Glu Glu Leu Arg Gin He Ala Glu Glu Asp P he Tyr Glu Lys ^ 
Ala Ala Ser Xle Ala Pro Glu Xle Tyr £J His Glu Asp Val Lys Lys 



Ala Leu Leu Leu Leu Leu Val Gly Gly Val Asp Gin Ser Pro Arg Gly 



Met Lys He Arg Gly Asa Xle Asn Xle Cys Leu Met Gly Asp Pro Gly 

Val Ala Lys Ser Gin Leu Leu Ser Tyr Xle Asp Arg Leu Ala Pro Arg 

Ser Gin Tyr Thr Thr Gly Ar g Gly Ser Ser Gly Val Gly Leu Thr Z 

Ala Val Leu Arg Asp Ser Val Ser Gly Glu Leu Thr Leu Glu Gly Gly 

Ala Leu Val Leu Ala Asp Gin Gly J2 Cys Cys Xle Asp Glu Phe Asp 

L ys Met Ala Glu Ala Asp Arg Thr Ala Xle His Glu JaX Met Glu Gin 

Gin Thr Xle Ser Xle Ala Lys Ala Gly Xle Leu Thr Thr Leu Asn Ala 

Arg Cys Ser Xle Leu Ala Ala Ala Asn Pro 111 Tyr Gly Arg Tyr "J 

Pro Arg Arg Ser Leu Glu Gin Asn Xle Gin Leu Pro Ala Ala IT U Leu 

Ser Arg Phe Asp Leu Leu Trp Leu Xle Gin Asp Arg Pro lsp Arg Asp 

Asn Asp Leu Arg Leu Ala Gin His Xle Thr Tyr Val His" Gin His Ser 

Ojj 5 4 0 

Arg Gin Pro Pro Ser Gin Phe Glu Pro Leu Asp Met Lys Leu Met Arg 

Arg Tyr Xle Ala Met Cys Arg Glu Lys Gin Pro Met Val Pro Glu Itr 

^ £j Ma A ^ £j ™ G1 " M ^ »** Arg GXu Ala 

Trp Ala Ser Lys Asp Ala Thr Tyr Thr Ser Ala Arg Thr Leu Leu Ala 
He Leu Arg Leu Ser Thr Ala Leu Ala Arg Leu Arg Met Val Asp Val 
Val Glu Lys Glu Asp Val Asn Glu Ala Xle Arg Leu Met Glu Met Ser 
L ys Asp Ser Leu Leu Gly Asp Lys Gly Gin Thr Ala Arg Thr Gin Arg 
Pro Ala Asp Val Xle Phe Ala Thr Val Ar g ° Glu Leu Val Ser Gly Gly 
Arg Ser Val Arg Phe Ser Glu Ala Glu Gin Arg Cys Val ler Arg Gly 
Phe Thr Pro Ala Gin Phe Gin Ala Ala Leu Asp Glu Tyr Glu Glu Leu 
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; •' 690 695 700 

| Asn Val Trp Gin Val Asn Ala Ser Arg Thr Arg He Thr Phe Val 

j 705 710 715 

| 

: <210> 462 

<211> 2821 
! <212> DNA 

| <213> Homo sapiens 

j 

<400> 462 

cgccccttcc cagccccaag ggtctaggat acagtctttg tagatgagcg ggtccccctt 60 
ggaggacaga atgaagaatt gggaaatcat ggccgttctg gagagtagac aagaagacgg 120 
cgaaagtcgg gcctgccccg ccctgcggcc ccggaacaaa agaacgcgtg tgcgctggcc 180 

| ctttaagagc gattctcctc cgcccgcgcc agctcggacc gcgggaaacc cggcgcctgc 240 

actaccccgc ccggagattc ccttccgacg cccgcaccgc ctccccgtca ctcattctag 300 
gcccgcacgg tgattggctt gcggctagcg ggaggtgaag aaggccgcct tgtccgattg 360 

. gcccgcacgc agtggcgccg gtcacgtggg gggcgacgtt tcgcgccaat ttcggttggc 420 

cggccacagt ccaccgcgcg gagattctca gcttccccag gagcaagacc tctgagcccg 480 

■ ccaagcgcgg ccgcacggcc ctcggcagcg atggcactga aggactacgc gctagagaag 540 

gaaaaggtta agaagttctt acaagagttc taccaggatg atgaactcgg gaagaagcag 600 
ttcaagtatg ggaaccagtt ggttcggctg gctcatcggg aacaggtggc tctgtatgtg 660 

| gacctggacg acgtagccga ggatgacccc gagttggtgg actcaatttg tgagaatgcc 720 

aggcgctacg cgaagctctt tgctgatgcc gtacaagagc tgctgcctca gtacaaggag 780 
agggaagtgg taaataaaga tgtcctggac gtttacattg agcatcggct aatgatggag 840 
cagcggagtc gggaccctgg gatggtccga agcccccaga accagtaccc tgctgaactc 900 
atgcgcagat ttgagctgta ttttcaaggc cctagcagca acaagcctcg tgtgatccgg 960 
gaagtgcggg ctgactctgt ggggaagttg gtaactgtgc gtggaatcgt cactcgtgtc 1020 
tctgaagtca aacccaagat ggtggtggcc acttacactt gtgaccagtg tggggcagag 1080 
acctaccagc cgatccagtc tcccactttc atgcctctga tcatgtgccc aagccaggag 1140 
tgccaaacca accgctcagg agggcggctg tatctgcaga cacggggctc cagattcatc 1200 
aaattccagg agatgaagat gcaagaacat agtgatcagg tgcctgtggg aaatatccct 1260 
cgtagtatca cggtgctggt agaaggagag aacacaagga ttgcccagcc tggagaccac 1320 
gtcagcgtca ctggtatttt cttgccaatc ctgcgcactg ggttccgaca ggtggtacag 1380 
ggtttactct cagaaaccta cctggaagcc catcggattg tgaagatgaa caagagtgag 1440 
gatgatgagt ctggggctgg agagctcacc agggaggagc tgaggcaaat tgcagaggag 1500 
gatttctacg aaaagctggc agcttcaatc gccccagaaa tatacgggca tgaagatgtg 1560 

j aagaaggcac tgctgctcct gctagtcggg ggtgtggacc agtctcctcg aggcatgaaa 1620 

atccggggca acatcaacat ctgtctgatg ggggatcctg gtgtggccaa gtctcagctc 1680 
ctgtcataca ttgatcgact ggcgcctcgc agccagtaca caacaggccg gggctcctca 1740 
ggagtggggc ttacggcagc tgtgctgaga gactccgtga gtggagaact gaccttagag 1800 
ggtggggccc tggtgctggc tgaccagggt gtgtgctgca ttgatgagtt cgacaagatg 1860 
gctgaggccg accgcacagc catccacgag gtcatggagc agcagaccat ctccattgcc 1920 
aaggccggca ttctcaccac actcaatgcc cgctgctcca tcctggctgc cgccaaccct 1980 
gcctacgggc gctacaaccc tcgccgcagc ctggagcaga acatacagct acctgctgca 204 0 
ctgctctccc ggtttgacct cctctggctg attcaggacc ggcccgaccg agacaatgac 2100 
ctacggttgg cccagcacat cacctatgtg caccagcaca gccggcagcc cccctcccag 2160 
tttgaacctc tggacatgaa gctcatgagg cgttacatag ccatgtgccg cgagaagcag 2220 
cccatggtgc cagagtctct ggctgactac atcacagcag catacgtgga gatgaggcga 2280 
gaggcttggg ctagtaagga tgccacctat acttctgccc ggaccctgct ggctatcctg 2340 
cgcctttcca ctgctctggc acgtctgaga atggtggatg tggtggagaa agaagatgtg 2400 
aatgaagcca tcaggctaat ggagatgtca aaggactctc ttctaggaga caaggggcag 24 60 
acagctagga ctcagagacc agcagatgtg atatttgcca ccgtccgtga actggtctca 2520 
gggggccgaa gtgtccggtt ctctgaggca gagcagcgct gtgtatctcg tggcttcaca 2580 
cccgcccagt tccaggcggc tctggatgaa tatgaggagc tcaatgtctg gcaggtcaat 264 0 
gcttcccgga cacggatcac ttttgtctga ttccagcctg cttgcaaccc tggggtcctc 2700 
ttgttccctg ctggcctgcc ccttgggaag gggcagtgat gcctttgagg ggaaggagga 2760 
gcccctcttt ctcccatgct gcacttactc cttttgctaa taaaagtgtt tgtagattgt 2820 
c 2821 

<210> 463 
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<211> 1124 

<212> PRT 

<213> Homo sapiens 

<400> 463 

Met Asp Pro Phe Thr Glu Lys Leu Leu Glu Arg Thr Arg Ala Arg Arg 

15 10 15 

Glu Asn Leu Gin Arg Lys Met Ala Glu Arg Pro Thr Ala Ala Pro Arg 

20 25 30 

Ser Met Thr His Ala Lys Arg Ala Arg Gin Pro Leu Ser Glu Ala Ser 

35 40 45 

Asn Gin Gin Pro Leu Ser Gly Gly Glu Glu Lys Ser Cys Thr Lys Pro 

50 55 60 

Ser Pro Ser Lys Lys Arg Cys Ser Asp Asn Thr Glu Val Glu Val Ser 
65 70 75 so 

Asn Leu Glu Asn Lys Gin Pro Val Glu Ser Thr Ser Ala Lys Ser Cvs 

85 90 95 

Ser Pro Ser Pro Val Ser Pro Gin Val Gin Pro Gin Ala Ala Asp Thr 

100 105 no 

He Ser Asp Ser Val Ala Val Pro Ala Ser Leu Leu Gly Met Arg Arq 

115 120 125 

Gly Leu Asn Ser Arg Leu Glu Ala Thr Ala Ala Ser Ser Val Lvs Thr 
130 135 14Q 

Arg Met Gin Lys Leu Ala Glu Gin Arg Arg Arg Trp Asp Asn Asp Asp 

150 155 160 

Met Thr Asp Asp He Pro Glu Ser Ser Leu Phe Ser Pro Met Pro Ser 

165 170 175 

Glu Glu Lys Ala Ala Ser Pro Pro Arg Pro Leu Leu Ser Asn Ala Ser 

180 i 8 5 19Q 

Ala Thr Pro Val Gly Arg Arg Gly Arg Leu Ala Asn Leu Ala Ala Thr 

195 200 205 

He Cys Ser Trp Glu Asp Asp Val Asn His Ser Phe Ala Lys Gin Asn 

2X0 215 220 

Ser Val Gin Glu Gin Pro Gly Thr Ala Cys Leu Ser Lys Phe Ser Ser 
225 230 235 240 

Ala Ser Gly Ala Ser Ala Arg He Asn Ser Ser Ser Val Lys Gin Glu 

245 250 255 

Ala Thr Phe Cys Ser Gin Arg Asp Gly Asp Ala Ser Leu Asn Lys Ala I 
260 265 270 i 

Leu Ser Ser Ser Ala Asp Asp Ala Ser Leu Val Asn Ala Ser lie Ser 

275 280 285 ' \ 

Ser Ser Val Lys Ala Thr Ser Pro Val Lys Ser Thr Thr Ser He Thr S 
290 295 300 ~~ " " ! 

Asp Ala Lys Ser Cys Glu Gly Gin Asn Pro Glu Leu Leu Pro Lys Thr > 
I 05 310 315 * 320 | 

Pro He Ser Pro Leu Lys Thr Gly Val Ser Lys Pro He Val Lys Ser i 
325 330 335 ! 

Thr Leu Ser Gin Thr Val Pro Ser Lys Gly Glu Leu Ser Arg Glu He 

340 345 350 

Cys Leu Gin Ser Gin Ser Lys Asp Lys Ser Thr Thr Pro Gly Glv Thr 
355 360 365 ' " J 



Gly lie Lys Pro Phe Leu Glu Arg Phe Gly Glu Arg Cys Gin Glu His i 

370 375 380 t 

Ser Lys Glu Ser Pro Ala Arg Ser Thr Pro His Arg Thr Pro He He ! 

11 390 395 ' 400 I 

Thr Pro Asn Thr Lys Ala He Gin Glu Arg Leu Phe Lys Gin Asp Thr I 

e „ 405 410 415 I. 

Ser Ser Ser Thr Thr His Leu Ala Gin Gin Leu Lys Gin Glu Arg Gin i 

420 425 ' 4 30 I 

Lys Glu Leu Ala Cys Leu Arg Gly Arg Phe Asp Lys Gly Asn He Trp L 

435 440 445 I 
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Ser Ala Glu Lys 
450 

Gin Glu Thr His 
4 65 

Ser Lys Thr Gin 

lie Pro Ala Lys 
500 

Glu Met Thr Lys 
515 

Asp Lys Ser Leu 
530 

Glu Val He Arg 
545 

Ser Ser Lys Val 

Glu Leu Asp Met 
580 

Glu Ser Ser Glu 
595 

Leu Leu Ala Pro 
610 

Leu Val Ser Thr 
625 

Glu Ser Pro Lys 

Glu Ser Gly Asp 
660 

He Asp Ala Tyr 
675 

He Lys Gin Val 
690 

Glu Lys Asn Asn 
705 

Gin Glu Leu Asn 

Ala Ser Gin Ala 
740 

Ser Leu Glu Glu 
755 

Lys Arg Thr Leu 
770 

Pro Gin Arg Lys 
785 

Lys Gly Ser Val 

Phe Val Cys Ser 
820 

Leu He He Leu 
835 

Ala Ser Thr Ser 
850 

Thr Phe Thr Leu 
865 

Val Tyr Ser Leu 

Lys Lys Thr Ser 
900 

Ser He Thr Thr 
915 

Gly Gly Leu Ser 



Gly Gly Asn Ser 
455 

Cys Gin Ser Thr 
470 

Ser Leu Pro Val 
485 

Asn Ser Ser Thr 

Ser Ser Pro Leu 
520 

Lys Val Thr Ser 
535 

Glu He Glu Met 
550 

He Asn Asp Leu 
565 

Glu Lys Ser Gin 

Glu Gin Glu Asp 
600 

Leu Ala Gin Thr 
615 

Pro Arg Leu Glu 
630 

Pro Gly Lys Phe 
645 

Ser Leu Gly Ser 

Arg Ser Gin Arg 
680 

He Val Arg Lys 
695 

Ala Phe Pro Cys 
710 

Asn Glu He Asn 
725 

Leu Asn Cys Cys 

Ala Glu Ala Glu 
760 

Leu He Asp Glu 
775 

Asn Lys Ala Ser 
790 

Thr Leu Ser Glu 
805 

Thr Val Gin Lys 

Lys Ala Gly Ala 
840 

Asn Ser Leu Asn 
855 

Gin Asp Val Ser 
870 

Val Gin Lys Lys 
885 

Lys Ser Lys Ala 

Lys Ser Asn He 
920 

Ala Val Arg Thr 



Lys Ser Lys Gin 
460 

Pro Leu Lys Lys 
475 

Thr Glu Lys Val 
490 

Glu Pro Lys Gly 
505 

Lys He Thr Leu 

Asp Pro Lys Val 
54 0 

Ser Val Asp Asp 
555 

Phe Ser Asp Val 
570 

Glu Glu Met Asp 
585 

Ala Leu Asn lie 

Val Gly Val Val 
620 

Leu Lys Asp Thr 
635 

Gin Arg Thr Arg 
650 

Glu Asp Arg Asp 
665 

Phe Lys Glu Thr 

Glu Asp Val Thr 
700 

Gin Val Asn He 
715 

Met Gin Gin Thr 
730 

Val Asp Glu Glu 
745 

Arg Leu Leu Leu 

Leu Asn Lys Leu 
780 

Pro Gin Ser Glu 
795 

lie Arg Leu Pro 
810 

Pro Asp Ala Ala 
825 

Glu Asn Met Val 

Gly Asp Ala Leu 
860 

Asn Asp Phe Glu 
87 5 

Asp Pro Ser Gly 
8 90 

lie Thr Pro Lys 
905 

His Ser Ser Val 
Ser Asn Phe Ala 



Leu Glu Thr Lys 

His Gin Gly Val 
480 

Thr Glu Asn Gin 
495 

Phe Thr Glu Cys 
510 

Phe Leu Glu Glu 
525 

Glu Gin Lys lie 

Asp Asp lie Asn 
560 

Leu Glu Glu Gly 
575 

Gin Ala Leu Ala 
590 

Ser Ser Met Ser 
605 

Ser Pro Glu Ser 

Ser Arg Ser Asp 
640 

VaJL Pro Arg Ala 
655 

Leu Leu Tyr Ser 
670 

Glu Arg Pro Ser 
685 

Ser Lys Leu Asp 

Lys Gin Lys Met 
720 

Val lie Tyr Gin 
735 

His Gly Lys Gly 
750 

He Ala Thr Gly 
765 

Lys Asn Glu Gly 

Phe Met Pro Ser 
800 

Leu Lys Ala Asp 
815 

Asn Tyr Tyr Tyr 
830 

Ala Thr Pro Leu 
845 

Thr Phe Thr Thr 

lie Asn He Glu 
880 

Leu Asp Lys Lys 
895 

Arg Leu Leu Thr 
910 

Met Ala Ser Pro 
925 

Leu Val Gly Ser 



-222- 



BNSDOCIO. <WO 20040 55050A2_I_> 



WO 2004/055050 



PCT/IB2003/006434 



9 30 935 940 

Tyr Thr Leu Ser Leu Ser Ser Val Gly Asn Thr Lys Phe Val Leu Asp 
94 $ 950 955 960 

Lys Val Pro Phe Leu Ser Ser Leu Glu Gly His lie Tyr Leu Lys He 

965 970 " 975 

Lys Cys Gin Val Asn Ser Ser Val Glu Glu Arg Gly Phe Leu Thr lie 

980 985 * 990 

Phe Glu Asp Val Ser Gly Phe Gly Ala Trp His Arg Arg Trp Cys Val 

"5 1000 1005 

Leu Ser Gly Asn Cys He Ser Tyr Trp Thr Tyr Pro Asp Asp Glu Lys 

1010 1015 1020 

Arg Lys Asn Pro He Gly Arg He Asn Leu Ala Asn Cys Thr Ser Arq 
1025 1030 1035 1040 

Gin He Glu Pro Ala Asn Arg Glu Phe Cys Ala Arg Arg Asn Thr Phe 

1045 1050 1055 

Glu Leu He Thr Val Arg Pro Gin Arg Glu Asp Asp Arg Glu Thr Leu 

1060 1065 * 1070 

Val Ser Gin Cys Arg Asp Thr Leu Cys Val Thr Lys Asn Trp Leu Ser 

1075 1080 1085 

Ala Asp Thr Lys Glu Glu Arg Asp Leu Trp Met Gin Lys Leu Asn Gin 

1Q 90 1095 1100 

Val Leu Val Asp He Arg Leu Trp Gin Pro Asp Ala Cys Tyr Lys Pr 
H05 mo 1U5 n 

He Gly Lys Pro 



o 
120 



<210> 464 

<211> 4786 

<212> DNA 

<213> Homo sapiens 

<400> 464 

ctcggcgctg aaattcaaat ttgaacggct 
gaggagagga cagctggttg tgggagagtt 
gagacacact gagctgagac tcacttttct 
tctcgtagtc cgacgcctgg ggcgatggat 
cgtgccaggc gagagaatct tcagagaaaa 
tctatgactc atgctaagcg agctagacag 
ctctctggtg gtgaagagaa atcttgtaca 
..'gacaacactg aagtagaagt ttctaacttg 
gcaaaatctt gttctccaag tcctgtgtct 
atcagtgatt ctgttgctgt cccggcatca 
agattggaag caactgcagc ctcctcagtt 
cggcgccgtt gggataatga tgatatgaca 
ccaatgccat cagaggaaaa ggctgcttcc 
gcaactccag ttggcagaag gggccgtctg 
gaagatgatg taaatcactc atttgcaaaa 
gcttgtttat ccaaattttc ctctgcaagt 
gttaagcagg aagctacatt ctgttcccaa 
ctatcctcaa gtgctgatga tgcgtctttg 
gctacttctc cagtgaaatc tactacatct 
aatcctgagc tacttccaaa aactcctatt 
attgtgaagt caactttatc ccagacagtt 
tgtctgcaat ctcaatctaa agacaaatct 
ttcctggaac gctttggaga gcgttgtcaa 
acaccccaca gaacccccat tattactcca 
aagcaagaca catcttcatc tactacccat 
aaagaactag catgtcttcg tggccgattt 
ggcggaaact caaaaagcaa acaactagaa 
cccctcaaaa aacaccaagg tgtttcaaaa 



gcagaggccg agtccgtcac tggaagccga 60 
cccccgcctc agactcctgg ttttttccag 120 
cttcctgaat ttgaaccacc gtttccatcg 180 
ccgtttacgg agaaactgct ggagcgaacc 240 
atggctgaga ggcccacagc agctccaagg 300 
ccactttcag aagcaagtaa ccagcagccc 360 
aaaccatcgc catcaaaaaa acgctgttct 420 
gaaaataaac aaccagttga gtcgacatct 480 
cctcaggtgc agccacaagc agcagatacc 540 
ctgctgggca tgaggagagg gctgaactca 600 
aaaacacgta tgcaaaaact tgcagagcaa 660 
gatgacattc ctgaaagctc actcttctca 720 
cctcccagac ctctgctttc aaatgcctcg 780 
gccaatcttg ctgcaactat ttgctcctgg 840 
caaaacagtg tacaagaaca gcctggtacc 900 
ggagcatctg ctaggatcaa tagcagcagt 960 
agggatggcg atgcctcttt gaataaagcc 1020 
gttaatgcct caatttccag ctctgtgaaa 1080 
atcactgatg ctaaaagttg tgagggacaa 1140 
agtcctctga aaacgggggt atcgaaacca 1200 
ccatccaagg gagaattaag tagagaaatt 1260 
acgacaccag gaggaacagg aattaagcct 1320 
gaacatagca aagaaagtcc agctcgtagc 1380 
aatacaaagg ccatccaaga aagattattc 1440 
ttagcacaac agctcaagca ggaacgtcaa 1500 
gacaagggca atatatggag tgcagaaaaa 1560 
accaaacagg aaactcactg tcagagcact 1620 
actcagtcac ttccagtaac agaaaaggtg 1680 
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accgaaaacc agataccagc caaaaattct agtacagaac ctaaaggttt cactgaatgc 1740 
gaaatgacga aatctagccc tttgaaaata acattgtttt tagaagagga caaatcctta 1800 
aaagtaacat cagacccaaa " ggt tgagcag aaaattgaag tgatacgtga aattgagatg 1860 
agtgtggatg atgatgatat caatagttcg aaagtaatta atgacctctt cagtgatgtc 1920 
ctagaggaag gtgaactaga tatggagaag agccaagagg agatggatca agcattagca 1980 
gaaagcagcg aagaacagga agatgcactg aatatctcct caatgtcttt acttgcacca 2040 
ttggcacaaa cagttggtgt ggtaagtcca gagagtttag tgtccacacc tagactggaa 2100 
ttgaaagaca ccagcagaag tgatgaaagt ccaaaaccag gaaaattcca aagaactcgt 2160 
gtccctcgag ctgaatctgg tgatagcctt ggttctgaag atcgtgatct tctttacagc 2220 
attgatgcat atagatctca aagattcaaa gaaacagaac gtccatcaat aaagcaggtg 2280 
attgttcgga aggaagatgt tacttcaaaa ctggatgaaa aaaataatgc ctttccttgt 2340 
caagttaata tcaaacagaa aatgcaggaa ctcaataacg aaataaatat gcaacagaca 2400 
gtgatctatc aagctagcca ggctcttaac tgctgtgttg atgaagaaca tggaaaaggg 2460 
tccctagaag aagctgaagc agaaagactt cttctaattg caactgggaa gagaacactt 2520 
ttgattgatg aattgaataa attgaagaac gaaggacctc agaggaagaa taaggctagt 2580 
ccccaaagtg aatttatgcc atccaaagga tcagttactt tgtcagaaat ccgcttgcct 2640 
ctaaaagcag attttgtctg cagtacggtt cagaaaccag atgcagcaaa ttactattac 2700 
ttaattatac taaaagcagg agctgaaaat atggtagcca caccattagc aagtacttca 2760 
aactctctta acggtgatgc tctgacattc actactacat ttactctgca agatgtatcc 2820 
aatgactttg aaataaatat tgaagtttac agcttggtgc aaaagaaaga tccctcaggc 2880 
cttgataaga agaaaaaaac atccaagtcc aaggctatta ctccaaagcg actcctcaca 2940 
tctataacca caaaaagcaa cattcattct tcagtcatgg ccagtccagg aggtcttagt 3000 
gctgtgcgaa ccagcaactt cgcccttgtt ggatcttaca cattatcatt gtcttcagta 3060 
ggaaatacta agtttgttct ggacaaggtc ccctttttat cttctttgga aggtcatatt 3120 
tatttaaaaa taaaatgtca agtgaattcc agtgttgaag aaagaggttt tctaaccata 3180 
tttgaagatg ttagtggttt tggtgcctgg catcgaagat ggtgtgttct ttctggaaac 3240 
tgtatatctt attggactta tccagatgat gagaaacgca agaatcccat aggaaggata 3300 
aatctggcta attgtaccag tcgtcagata gaaccagcca acagagaatt ttgtgcaaga 3360 
cgcaacactt ttgaattaat tactgtccga ccacaaagag aagatgaccg agagactctt 3420 
gtcagccaat gcagggacac actctgtgtt accaagaact ggctgtctgc agatactaaa 3480 
gaagagcggg atctctggat gcaaaaactc aatcaagttc ttgttgatat tcgcctctgg 3540 
caacctgatg cttgctacaa acctattgga aagccttaaa ccgggaaatt tccatgctat 3600 
ctagaggttt ttgatgtcat cttaagaaac acacttaaga gcatcagatt tactgattgc 3660 
attttatgct ttaagtacga aagggtttgt gccaatattc actacgtatt atgcagtatt 3720 
tatatctttt gtatgtaaaa ctttaactga tttctgtcat tcatcaatga gtagaagtaa 3780 
atacattata gttgattttg ctaaatctta atttaaaagc ctcattttcc tagaaatcta 3840 
attattcagt tattcatgac aatatttttt taaaagtaag aaattctgag ttgtcttctt 3900 
ggagctgtag gtcttgaagc agcaacgtct ttcaggggtt ggagacagaa acccattctc 3960 
caatctcagt agttttttcg aaaggctgtg atcatttatt gatcgtgata tgacttgtta 4020 
ctagggtact gaaaaaaatg tctaaggcct ttacagaaac atttttagta atgaggatga 4080 
gaactttttc aaatagcaaa tatatattgg cttaaagcat gaggctgtct tcagaaaagt 414 0 
gatgtggaca taggaggcaa tgtgtgagac ttgggggttc aatattttat atagaagagt 4200 
taataagcac atggtttaca tttactcagc tactatatat gcagtgtggt gcacattttc 4260 
acagaattct ggcttcatta agatcattat ttttgctgcg tagcttacag acttagcata 4320 
ttagtttttt ctactcctac aagtgtaaat tgaaaaatct ttatattaaa aaagtaaact 4380 
gttatgaagc tgctatgtac taataatact ttgcttgcca aagtgtttgg gttttgttgt 4440 
tgtttgtttg tttgtttgtt tttggttcat gaacaacagt gtctagaaac ccattttgaa 4500 
agtggaaaat tattaagtca cctatcacct ttaaacgcct ttttttaaaa ttataaaata 4560 
ttgtaaagca gggtctcaac ttttaaatac actttgaact tcttctctga attattaaag 4620 
ttctttatga cctcatttat aaacactaaa ttctgtcacc tcctgtcatt ttatttttta 4680 
ttcattcaaa tgtatttttt cttgtgcata ttataaaaat atattttatg agctcttact 4740 
caaataaata cctgtaaatg tctaaaggaa aaaaaaaaaa aaaaaa 4786 

<210> 465 
<211> 76 
<212> PRT 

<213> Homo sapiens 
<400> 465 

Met Ser Lys Ala His Pro Pro Glu Leu Lys Lys Phe Met Asp Lys Lys 
1 5 10 15 
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Leu Ser Leu Lys Leu Asn Gly Gly Arg His Val Gin Gly He Leu Arq 

20 25 30 

Gly Phe Asp Pro Phe Met Asn Leu Val He Asp Glu Cys Val Glu Met 

35 40 45 

Ala Thr Ser Gly Gin Gin Asn Asn He Gly Met Val Val He Arq Glv 

50 55 60 

Asn Ser He He Met Leu Glu Ala Leu Glu Arg Val 
65 70 75 



<210> 466 
<211> 606 
<212> DNA 
<213> Homo sapiens 



<400> 466 

agacgcaaga 

gcatctgtga 

atttatggac 

gcggggattt 

tggacaacag 

agccttggaa 

agggcctgtt 

gttaaataaa 

cagctctcat 

ttttatgcat 

aaaaaa 



cgccgggcct 
gcccgcggag 
aagaagttat 
gatcccttta 
aacaatattg 
cgagtataaa 
ttactatgat 
cttttgtaat 
tccagttttt 
taaagtgaaa 



acagcgggag 
tatacaccat 
cattgaaatt 
tgaaccttgt 
gaatggtggt 
taatggctgt 
gtaaaaatta 
agtcaaaaat 
tctaacatga 
gaatcttatt 



cgtgaggaaa 
gagcaaagct 
aaatggtggc 
gatagatgaa 
aatacgagga 
tcagcagaga 
ggtcatgtac 
gctttctcag 
attttcctgg 
aaatgcgaaa 



gccgtgcgtt 
caccctcccg 
agacatgtcc 
tgtgtggaga 
aatagtatca 
aacccatgtc 
attttcatat 
atgttctgaa 
ttgacattga 
aaaaaaaaaa 



gcgttccaag 
agttgaaaaa 
aaggaatatt 
tggcgactag 
tcatgttaga 
ctctctccat 
tagacttttt 
tatagaatat 
tttcaaaggg 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

606 



<210> 467 
<211> 971 
<212> PRT 
<213> Homo sapiens 



<400> 467 
Met Glu Leu 
1 

Lys Thr Leu 

Leu Glu Ser 
35 

Leu Leu Glu 
50 

Thr Phe Lys 
65 

Pro Asn Lys 

Val His Leu 

Asp Ala He 
115 

Asp Leu Leu 

130 
Val He Asn 
145 

Arg His Glu 

Leu Asp Ala 

Glu Leu Cys 
195 

Phe Ser Ser 



Ser Asp 
5 

Asp Pro 
20 

Val Glu 

Lys Ser 

Asn Tyr 

lie Cys 
85 

Met Leu 
100 

Ser He 

Thr Glu 

Gly Val 

Phe Lys 
165 
Phe Ala 
180 

Ser Thr 
Leu He 



Ala Asn Leu 

Asp Pro Ala 

Gly Asn Gin 
40 

Gin Asp Asn 
55 

He Lys Arg 
70 

Glu Ala Asp 

Ser Ser Pro 

He Gly Arg 
120 

Met Val Asn 

135 
Leu Arg Thr 
150 

Ser Asn Glu 

Leu Pro Leu 

His Ala Asn 
200 

Leu lie Ser 



Gin Thr Leu 
10 

He Arg Arg 
25 

Asn Tyr Pro 

Val lie Lys 

Asn Trp Arg 
75 

Arg Val Ala 
90 

Glu Gin He 
105 

Glu Asp Phe 

Arg Phe Gin 

Ala His Ser 
155 

Leu Trp Thr 

170 
Thr Asn Leu 
185 

Asp Ala Ser 
Lys Leu Phe 



Thr Glu 

Pro Ala 

Leu Leu 

45 
Val Cys 
60 

lie Val 

lie Lys 

Gin Lys 

Pro Gin 
125 
Ser Gly 
140 

Leu Phe 

Glu lie 

Phe Lys 

Ala Leu 
205 
Tyr Ser 



Tyr Leu Lys 
15 

Glu Lys Phe 
30 

Leu Leu Thr 

Ala Ser Val 

Glu Asp Glu 
80 

Ala Asn lie 
95 

Gin Leu Ser 
110 

Lys Trp Pro 

Asp Phe His 

Lys Arg Tyr 
160 

Lys Leu Val 

175 
Ala Thr lie 
190 

Arg lie Leu 
Leu Asn Phe 
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210 

Gin Asp Leu Pro 
225 

Asn Phe His Thr 

Asp Glu Glu Glu 
260 

Asp Asn Ala Ala 
275 

Tyr Leu Pro Arg 
2 90 

Gly Gin Glu Val 
305 

Leu Ala Ser Val 

Gin Asn Thr Leu 
340 

Glu Phe Arg Ala 
355 

Tyr lie Arg Arg 
370 

Ala Ala Cys Asp 
385 

Val Thr Gly He 

Ala Lys Asn Pro 
420 

Leu Val Thr Ser 
435 

Thr Gin Ala Asn 
450 

He Leu Pro Asp 
4 65 

Lys Ala Asp Gly 

Lys Glu His Leu 
500 

Ala Glu Ser He 
515 

Leu Phe Thr Met 
530 

Glu He Ala Pro 
545 

Leu Thr Leu Pro 

Met Arg Ser Phe 
580 

Thr Leu lie Thr 
595 

Pro Ser Lys Pro 
610 

Ser He Arg He 
625 

Glu Glu Ala Leu 

Gin Glu Phe He 
660 

Thr His Lys Asn 
675 

Leu Leu Gin Pro 
690 



215 

Glu Phe Phe Glu 
230 

Leu Leu Thr Leu 
245 

Ala Gly Leu Leu 

Leu Tyr Ala Gin 
280 

Phe Val Thr Ala 
295 

Lys Tyr Asp Leu 
310 

Cys Glu Arg Pro 
325 

Thr Ser He Cys 

Ala Asp Glu Glu 
360 

Asp Leu Glu Gly 
375 

Leu Val Arg Gly 
390 

Phe Ser Gly Tyr 
405 

Ser Val Asn Trp 

Leu Ala Ser Lys 
440 

Glu Leu Val Asn 
455 

Leu Lys Ser Ala 
470 

He Lys Tyr He 
485 

Leu Val Ser He 

Val Val His Thr 
520 

Arg Gly Pro Asn 
535 

Phe Val Glu He 
550 

Gly Ser Ser Glu 
565 

Ser Leu Leu Gin 

Gin Leu Thr Gin 
600 

His Phe Asn His 
615 

Thr Cys Lys Ala 
630 

Phe Leu Val Phe 
645 

Pro Tyr Val Phe 

Asp He Pro Ser 
680 

Val Leu Trp Glu 
695 



220 

Asp Asn Met Glu 
235 

Asp Asn Lys Leu 
250 

Glu Leu Leu Lys 
265 

Lys Tyr Asp Glu 

He Trp Asn Leu 
300 

Leu Val Ser Asn 
315 

His Tyr Lys Asn 
330 

Glu Lys Val He 
345 

Ala Phe Glu Asp 

Ser Asp He Asp 
380 

Leu Cys Lys Phe 
395 

Val Asn Ser Met 
410 

Lys His Lys Asp 
425 

Ala Gin Thr Gin 

Leu Thr Glu Phe 
460 

Asn Val Asn Glu 
475 

Met He Phe Arg 
4 90 

Pro Leu Leu He 
505 

Tyr Ala Ala His 

Asn Ala Thr Leu 
540 

Leu Leu Thr Asn 
555 

Asn Glu Tyr He 
570 

Glu Ala He He 
585 

Lys Leu Leu Ala 

Tyr Met Phe Glu 
620 

Asn Pro Ala Ala 
635 

Thr Glu He Leu 
650 

Gin Val Met Ser 
665 

Ser Tyr Met Ala 

Arg Thr Gly Asn 
700 



Thr Trp Met Asn 
240 

Leu Gin Thr Asp 

255 

Ser Gin He Cys 
270 

Glu Phe Gin Arg 
285 

Leu Val Thr Thr 

Ala He Gin Phe 
320 

Leu Phe Glu Asp 
335 

Val Pro Asn Met 
350 

Asn Ser Glu Glu 
365 

Thr Arg Arg Arg 

Phe Glu Gly Pro 
400 

Leu Gin Glu Tyr 
415 

Aia Ala He Tyr 
430 

Lys His Gly He 
445 

Phe Val Asn His 

Phe Pro Val Leu 
480 

Asn Gin Val Pro 
4 95 

Asn His Leu Gin 
510 

Ala Leu Glu Arg 
525 

Phe Thr Ala Ala 

Leu Phe Lys Ala 
560 

Met Lys Ala He 
575 

Pro Tyr He Pro 
590 

Val Ser Lys Asn 
605 

Ala lie Cys Leu 

Val Val Asn Phe 
640 

Gin Asn Asp Val 
655 

Leu Leu Leu Glu 
670 

Leu Phe Pro His 
685 

He Pro Ala Leu 
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Val Arg Leu Leu Gin Ala 
705 710 
Ser Ala Ala Ala Asp Lys 
725 

Leu He Ala Ser Lys Ala 
740 

Ser He He Glu His Met 
755 

Gin He Phe He Leu Leu 
770 

Lys Phe He Lys Ser Phe 
785 79 0 
Tyr Gly Ala Leu Ala Leu 
805 

Met Phe Gly Met Val Leu 
820 

Val Ser Gly Asn Val Glu 
835 

Leu Leu Thr Glu Cys Pro 
850 

Trp Thr Pro Leu Leu Gin 
865 870 
Asp Asp Thr He Pro Asp 
885 

Pro Gly Tyr Gin Thr Ala 
900 

Glu His Asp Pro Val Gly 
915 

Ala Gin Ser Leu His Lys 
930 

Ser Met Val Ser Thr Ser 
945 950 
Gly Tyr Leu Gin Ala Ala 
965 



PCT/IB2003/006434 



Phe Leu 

He Pro 

Asn Asp 

Pro Pro 
760 
Phe Gin 
775 

Leu Val 

Gin Glu 

Glu Lys 

Lys Lys 
840 
Pro Met 
855 

Ser Leu 



Glu Arg Gly 
715 

Gly Leu Leu 
730 

His Gin Gly 
74 5 

Glu Ser Val 

Arg Leu Gin 

Phe lie Asn 
795 

He Phe Asp 

810 
He lie lie 
825 

lie Cys Ala 
Met Asp Thr 



Glu Glu 

Phe Ser 

Gin Met 
920 
Leu Ser 
935 

Leu Asn 
Ser Val 



lie Gly Leu 
875 

His Phe lie 
890 

Gin Leu Ala 
905 

Val Asn Asn 

Thr Ala Cys 

Ala Glu Ala 
955 

Thr Leu Leu 
970 



Ser Asn 

Gly Val 

Phe Tyr 

Asp Gin 
765 
Asn Ser 
780 

Leu Tyr 

Gly lie 

Pro Glu 

Val Gly 
845 
Glu Tyr 
860 

Phe Glu 
Asp lie 
Phe Ala 



Pro Lys 
925 
Pro Gly 
94 0 

Leu Gin 



Thr lie Ala 
720 

Phe Gin Lys 
735 

Leu Leu Asn 
750 

Tyr Arg Lys 

Lys Thr Thr 

Cys lie Lys 
800 

Gin Pro Lys 

815 
He Gin Lys 
830 

lie Thr Lys 

Thr Lys Leu 

Leu Pro Glu 
880 

Glu Asp Thr 

895 
Gly Lys Lys 
910 

lie His Leu 

Arg Val Pro 

Tyr Leu Gin 
960 



<210> 468 

<211> 3579 

<212> DNA 

<213> Homo sapiens 



<400> 468 

tcaggctcgc 

gcgggtagtg 

gagatcctat 

agaaaacact 

ttgaaggaaa 

atgttatcaa 

ttgttgaaga 

tagtgcactt 

gcattattgg 

atcgctttca 

tatttaaaag 

ttctggatgc 

gtacccatgc 

caaaattgtt 

aaacttggat 

atgatgaaga 

cactctatgc 

ccatctggaa 

atgcaattca 



tgtcgcgcca 

ccagctacgg 

agcaatggaa 

tgatcctgat 

tcagaattat 

agtatgtgct 

tgaaccaaac 

gatgcttagc 

cagagaagat 

gagtggagat 

ataccgtcat 

ctttgctttg 

aaatgatgcc 

ctatagttta 

gaataatttt 

ggaagccggc 

acaaaagtac 

tttactagtt 

atttctggct 



ttttgccggg 

tccgcggctg 

ctcagcgatg 

cctgccatcc 

ccactgttgc 

tcagtaacat 

aaaatttgtg 

agcccagagc 

tttccacaga 

ttccatgtta 

gaatttaagt 

cctttgacta 

tctgccctga 

aactttcagg 

catactctct 

ttattggagc 

gatgaagaat 

acaacgggtc 

tcagtttgtg 



gtttgaatgt 

gggttccctc 

caaatctgca 

gacgtccagc 

ttttgacatt 

tcaaaaacta 

aagccgatcg 

aaattcagaa 

aatggcctga 

ttaatggagt 

caaacgagtt 

atctttttaa 

ggattctgtt 

atctccctga 

taacattgga 

tcttaaaatc 

tccagcgata 

aagaggttaa 

agagacctca 



gaggcggagc 

ctccgtttct 

aacactaaca 

tgagaaattt 

actggagaag 

tattaaaagg 

agtggccatt 

gcagttaagt 

cttgctgaca 

cctccgtaca 

atggactgaa 

ggccactatt 

ttcttccctg 

attttttgaa 

taataagctt 

ccagatttgt 

cctgcctcgt 

atatgatttg 

ttataagaat 



ggcggcagga 

gtatccccac 

gaatatttaa 

cttgaatctg 

tcccaggata 

aactggagaa 

aaagccaaca 

gatgcaatta 

gaaatggtga 

gcacattcat 

attaagcttg 

gaactctgca 

atcctgatct 

gataatatgg 

ttacaaactg 

gataatgccg 

tttgttacag 

ttggtaagta 

ctatttgagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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accagaacac gctgacaagt atctgtgaaa 
ctgctgatga agaagcattt gaagataatt 
gatctgatat tgatactaga cgcagggctg 
tttttgaggg acctgtgaca ggaatcttct 
acgcaaaaaa tccatctgtc aactggaaac 
ctttggcatc aaaagcccaa acacagaagc 
acctaactga gttctttgtg aatcacatcc 
aatttcctgt ccttaaagct gacggtatca 
caaaagaaca tcttttagtc tcgattcctc 
ttgttgttca tacttacgca gctcatgctc 
acaatgccac tctctttaca gctgcagaaa 
accttttcaa agctctcaca cttcctggct 
tcatgagaag tttttctctc ctacaagaag 
ctcagcttac acagaagcta ttagctgtta 
actacatgtt tgaagcaata tgtttatcca 
ctgttgtaaa ttttgaggag gctttgtttt 
tgcaagaatt tattccatac gtctttcaag 
atgacatccc gtcttcctat atggccttat 
aaagaacagg aaatattcct gctctagtga 
caaacacaat agcaagtgct gcagctgaca 
agctgattgc atccaaagca aatgaccacc 
agcacatgcc tcctgaatca gttgaccaat 
agagacttca gaattccaaa acaaccaagt 
tgtattgcat aaaatatggg gcactagcac 
aaatgtttgg aatggttttg gaaaaaatta 
atgtagagaa aaagatctgt gcggttggca 
tgatggacac tgagtatacc aaactgtgga 
ttgagttacc cgaagatgat accattcctg 
caccaggata tcagactgcc ttctcacagt 
ctgtaggtca aatggtgaat aaccccaaaa 
ctaccgcctg tccaggaagg gttccatcaa 
tccagtatct ccaagggtac cttcaggcag 
tctaatgggc taaacccaga tggtttccta 
ttaaaacaaa ggaagttctc cttttgaact 
aaatgttgct ttaacctgaa ccttgagcaa 
gtgggtggct tctagtttgc aacttcaagg 
tggtttgtgt tgagcgtttg cacggtttgg 
gactttctgt tactaaatcc ttttgttttg 
tttatatttt ttgtctgttt atttacgctt 
cttgtgttac ttgccaagca gtgcacattt 
aatcctaaaa tatgtaccta aaaaaaaaaa 



aggttattgt gcctaacatg gaatttagag 1200 
ctgaggagta cataaggaga gatttggaag 1260 
cttgtgatct ggtacgagga ttatgcaagt 1320 
ctggttatgt taattccatg ctgcaggaat 1380 
acaaagatgc agccatctac ctagtgacat 1440 
atggaattac acaagcaaat gaacttgtaa 1500 
tccctgattt aaaatcagct aatgtgaatg 1560 
aatatattat gatttttaga aatcaagtgc 1620 
tcttgattaa tcatcttcaa gctgaaagta 1680 
ttgaacggct ctttactatg cgagggccta 1740 
tcgcaccgtt tgttgagatt ctgctaacaa 1800 
cttcagaaaa tgaatatatt atgaaagcta 1860 
ccataatccc ctacatccct actctcatca 1920 
gtaagaaccc aagcaaacct cactttaatc 1980 
taagaataac ttgcaaagct aaccctgctg 2040 
tggtgtttac tgaaatctta caaaatgatg 2100 
tgatgtcttt gcttctggaa acacacaaaa 2160 
ttcctcatct ccttcagcca gtgctttggg 2220 
ggcttcttca agcattctta gaacgcggtt 2280 
aaattcctgg gttactaggt gtctttcaga 2340 
aaggttttta tcttctaaac agtataatag 2400 
ataggaaaca aatcttcatt ctgctattcc 24 60 
ttatcaagag ttttttagtc tttattaatt 2520 
tacaagaaat atttgatggt atacaaccaa 2580 
ttattcctga aattcagaag gtatctggaa 2640 
taaccaaatt actaacagaa tgtcccccaa 2700 
ctccattatt acagtctttg attggtcttt 2760 
atgaggaaca ttttattgac atagaagata 2820 
tggcatttgc tgggaaaaaa gagcatgatc 2880 
ttcacctggc acagtcactt cacaagttgt 2940 
tggtgagcac cagcctgaat gcagaagcgc 3000 
ccagtgtgac actgctttaa actgcatttt 3060 
ggaaatcaca ggcttctgag cacagctgca 3120 
tgtcacgaat tccatcttgt aaaggatatt 3180 
attagttggt ttgtgtgatc atacagttat 3240 
gacaagtatt aatagttcag tgtatggcgt 3300 
ataatcttaa attttgacgg acactgtgga 3360 
aagctgttgc tatttgtatt tctcttgtcc 3420 
ttattggaaa tgtgaataag taaagaatta 3480 
catagtttca aatctgtaat cagcaataaa 3540 
aaaaaaaaa 3579 



<210> 469 
<211> 234 
<212> PRT 

<213> Homo sapiens 



<400> 469 




























Met 


Ser 


Cys 


He 


Asn 


Leu 


Pro 


Thr 


Val 


Leu 


Pro 


Gly 


Ser 


Pro 


Ser 


Lys 


1 








5 










10 










15 




Thr 


Arg 


Gly 


Gin 

20 


He 


Gin 


Val 


He 


Leu 

25 


Gly 


Pro 


Met 


Phe 


Ser 
30 


Gly 


Lys 


Ser 


Thr 


Glu 

35 


Leu 


Met 


Arg 


Arg 


Val 
40 


Arg 


Arg 


Phe 


Gin 


He 
45 


Ala 


Gin 


Tyr 


Lys 


Cys 
50 


Leu 


Val 


lie 


Lys 


Tyr 
55 


Ala 


Lys 


Asp 


Thr 


Arg 
60 


Tyr 


Ser 


Ser 


Ser 


Phe 


Cys 


Thr 


His 


Asp 


Arg 


Asn 


Thr 


Met 


Glu 


Ala 


Leu 


Pro 


Ala 


Cys 


Leu 


65 










70 










75 










80 


Leu 


Arg 


Asp 


Val 


Ala 
85 


Gin 


Glu 


Ala 


Leu 


Gly 
90 


Val 


Ala 


Val 


He 


Gly 
95 


He 


Asp 


Glu 


Gly 


Gin 


Phe 


Phe 


Pro 


Asp 


He 


Met 


Glu 


Phe 


Cys 


Glu 


Ala 


Met 



i 
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100 










105 


Ala 


Asn 


Ala 
115 


Gly 


Lys 


Thr 


Val 


He 
120 


Val 


Gin 


Arg 


Lys 


Pro 


Phe 


Gly 


Ala 


He 


Leu 




130 








135 






Ser 


Val 


Val 


Lys 


Leu 


Thr 


Ala 


Val 


Cys 


145 










150 






Ala 


Tyr 


Thr 


Lys 


Arg 
165 


Leu 


Gly 


Thr 


Glu 


Gly 


Ala 


Asp 


Lys 
180 


Tyr 


His 


Ser 


Val 


Cys 
185 


Ala 


Ser 


Gly 


Gin 


Pro 


Ala 


Gly 


Pro 


Asp 






195 








200 


Pro 


Gly 


Lys 


Pro 


Gly Glu 


Ala 


Val 


Ala 




210 










215 






Gin 


Gin 


He 


Leu 


Gin 


Cys 


Ser 


Pro 


Ala 


225 
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110 

Ala Ala Leu Asp Gly Thr Phe 
125 

Asn Leu Val Pro Leu Ala Glu 
140 

Met Glu Cys Phe Arg Glu Ala 
155 160 
Lys Glu Val Glu Val He Gly 
170 175 
Arg Leu Cys Tyr Phe Lys Lys 
190 

Asn Lys Glu Asn Cys Pro Val 
205 

Ala Arg Lys Leu Phe Ala Pro 
220 

Asn 



<210> 470 
<211> 1421 
<212> DNA 

<213> Homo sapiens 



<400> 470 

acttactgcg 

agctgcatta 

caggtgattc 

cgcttccaga 

agcagcagct 

cgagacgtgg 

ttccctgaca 

gctgcactgg 

ctggccgaga 

tataccaaga 

cactccgtgt 

aacaaagaga 

tttgccccac 

ccgctccctt 

caggaggaag 

tgtgtggctg 

cttccctctc 

ggatctggca 

cctgcttctt 

gccagcacct 

cccaccctcc 

cccttcctac 

ggcaccaacc 

agggtccacg 



ggacggcctt 

acctgcccac 

tcgggccgat 

ttgctcagta 

tctgcacaca 

cccaggaggc 

tcatggagt t 

atgggacctt 

gcgtggtgaa 

ggctcggcac 

gtcggctctg 

actgcccagt 

agcagattct 

cctgccactg 

tcgggaggcg 

ccccacctgc 

agctgctggg 

cactccctct 

cccctctgcg 

ttgagccttg 

cctgaggatg 

ctctggtgat 

ttgctgggac 

cctctgctgt 



ggagagtact 

tgtgctgccc 

gttctcagga 

caagtgcctg 

tgaccggaac 

cctgggcgtg 

ctgcgaggcc 

ccagaggaag 

gctgacggcg 

agagaaggag 

ctacttcaag 

gccaggaaag 

gcaatgcagc 

ccgcctactg 

tggagggtga 

cgcatgctcc 

acgatcgccc 

ccttggggtg 

gctttcactg 

gcccacactg 

gcctggattc 

ggtttccaca 

ttggatccca 

agcttatgaa 



cgggttcgtg 

ggctccccca 

aaaagcacag 

gtgatcaagt 

accatggagg 

gctgtcatag 

atggccaacg 

ccatttgggg 

gtgtgcatgg 

gtcgaggtga 

aaggcctcag 

ccaggggaag 

cctgccaact 

gacgctgccc 

ccacaccttg 

ctcctctcct 

aggctggagc 

agggacagag 

ctgagtttct 

aggcttaggc 

acgccctctt 

ggaacaacag 

ggggcttatc 

attaactaat 



aacttcccgg 

gcaagacccg 

agttgatgag 

atgccaaaga 

cgctgcccgc 

gcatcgacga 

ccgggaagac 

ccatcctgaa 

agtgcttccg 

ttgggggagc 

gccagcctgc 

ccgtggctgc 

gagggacctg 

tgcatgctgc 

gccttctggg 

acccactggt 

tggccccgct 

ccccacgctg 

gttctccctg 

ctctctgcct 

gtttcctttt 

catctttcac 

tcttcaagtg 

t 



aggcgcaatg 

ggggcagatc 

acgcgtccgt 

cactcgctac 

ctgcctgctc 

ggggcagttt 

cgtaattgtg 

cctggtgccg 

ggaagccgcc 

agacaagtac 

cgggccggac 

caggaagctc 

caagggccgc 

ccagccactc 

aactctcctt 

ctgcttaaag 

tggtggcctg 

ttgacatcag 

ggaagcctgt 

gggatgggct 

gggctcaaag 

caagatgggt 

tggagagggc 



<210> 471 
<211> 792 
<212> PRT 

<213> Homo sapiens 
<400> 471 

Met His Val He Lys Arg Asp Gly Arg Gin Glu Arg Val Met Phe Asp 

1 5 10 15 

Lys He Thr Ser Arg He Gin Lys Leu Cys Tyr Gly Leu Asn Met Asp 

20 25 " 30 

Phe Val Asp Pro Ala Gin lie Thr Met Lys Val He Gin Gly Leu Tyr 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1421 
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35 40 45 



Ocl 


Gly Val 


Thr 
i in. 


1 i 1 X 


Val 


p i n 

O X u 


Leu 


Asp 


Thr 


Leu 


Ala 


Al A 
ni d 


PI n 

OX Li 


Th r 
1 11 X 


A 1 A 

Mid 




~> U 










55 










60 










MXd 


i fix 


Leu 


Thr 


Thr 


Lys 


His 


Pro 


Asp 


Tvr 

i y j- 


Ala 


He 


Leu 


Ala 


Ala 


Ar g 












/ u 










75 










ft n 
D u 


T I p 


Ala 
nj,d 


v a x 


Ser 


Asn 


Leu 


His 


Lys 


Glu 


Thr 


Lys 


Lys 


Val 


Phe 


Cpr- 
O e X 


nop 










ft S 










90 














Va 1 


iyjeT. 


p i n 

LaXU 


Asp 


Leu 


Tyr 


As n 


j yr 


Tip 
X X e 


Asn 


Pro 


His 


Sen 


PI V 

oxy 


T \/c 

Xjy s 


Hie 








i on 
x uu 










1 fit 
X U 3 










Tin 
X X u 








Pro 


LYier 


Val 


A l a 

MX a 


.Ly s 


Ser 


Thr 


Leu 


Asp 


He 


Val 


Leu 


Al A 
Mid 


Asn 


T 

Lys 






lie. 
ii j 




















1 9 s 








Asp 


Arg 


Leu 


Asn 




Ala 
Hid 


Tip 

lie 


Tip 
lie 


i yr 


A <?T"» 

nop 


MX y 


A c:n 


Pho 
tri le 


Cor 
O tr X 


Tyr 


Asn 




X JU 










ijj 










i 4 n 

1 *i U 










Tyr 


ir ne 


biy 


IT I le 


Lys 


Th r 

i nr 


Le u 


PI n 
Lj X LI 


Arg 


Cp y- 

Oe X 


T w r 

i yr 


Le u 


Leu 


Lys 


T 1 o 

x xe 


Asn 


IK 

x ** o 










ljU 










1 ^ S 

X D -J 










i ^n 
x ou 


Gly 


Lys 


va x 


Ala 


Glu 


rii y 


Prn 

£ X L» 


Gin 


His 


Met 


Leu 


Met 


Arg 


Va 1 

V d X 


Co r 
Oe X 


Va 1 
v d x 










X DD 










17 0 
X / u 










X / D 




Gly 


i le 


MIS 


T 

Lys 


p I n 


7A « r* 
nb(J 


Tip 
lie 


no fcJ 


Ala 


Ala 


He 


Pin 

OXU 


Th r 

1 IlX 


T w r 

i yr 


Asn 


Leu 








i on 
lOU 










185 










X i*U 








Ser 


oXU 


Arg 


T> rrv 

irp 


tr i le 


1 ilX 


n i o 


Ala 


Ser 


Prn 

IT X (J 


Thr 

1 1 IX 


Leu 


Pho 


Asn 


Ala 

mx a 






iyj 










9 nn 
z uu 










on c 

Z U D 








pi \? 
ox y 


i nr 


Asn 


Arg 


Pro 


pi t-i 

Oil 1 


Le u 


C O r 


Ser 


Cys 


Pho 
tile 


Leu 


Leu 


Ser 


Mor 


Lys 
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Z X D 










9 on 
z z u 










Asp 


Asp 


Ser 


Tip 


OX u 


pi \/ 
oiy 


T 1 o 
lie 


T \;r 

l yr 


Asp 


Thr 


Leu 


Lys 


PI n 

oi n 


Cys 


nld 


Leu 


ZZO 




















9 ^ e» 
-CJJ 










Z 4 u 


Tip 
lie 


Ser 


Lys 


Ser 


Ml d 




ox y 


Tip 
lie 


Gly Val 


Al a 

MX d 


Va 1 
V dX 


oer 


Cys 


Tie 

x xe 


Arg 










9 /l £^ 
Z4 J 










Z ju 










ZOO 




MX d 


Thr 


Gly 


Cor 
O e X 


Tyr 


lie 


MX d 


P 1 \7 

uiy 


Thr 


Asn 


PI \7 

biy 


Asn 


Cor 

oer 


Asn 


oxy 


Leu 








zou 










265 










z / u 






Vza 1 
val 


Pro 


Met 


Leu 


Arg 


\7=a 1 

v a i 


Tyr 


Asn 


Asn 


Thr 


MX a 


Arg 


Tyr 


v ax 


Asp 


bin 






275 










Z oU 










ZOO 








oiy 


Gly Asn 


Lys 


Arg 


Pro 


oxy 


/lid 


Phe 


Ala 


x x e 


Tyr 


Leu 


oXU 


Pro 


Trp 




290 










z y d 










jUU 










nii> 


Leu Asp 


Tip 

lie 


Ph O 

rile 


pi n 


php 


Leu 


Asp 


Leu 


x»y s 


Lys 


Asn 


Th r 

i nr 


pi \/ 
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Lys 


one. 

JUJ 










jlU 
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T 1 o 
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Asp 
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Va 1 
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OX U 
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,5 ft U 
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Pin 
blU 
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Pin 
oXU 






355 










^ 6n 

JOv 










jOj 








ys 


Leu 


Tyr 


riJL d 




iyr 


pi n 

ul u 


Lys 


Gin 


Gly 


mx y 


Va 1 
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A r rr 
nl y 


Lys 


Va 1 
v d X 


Va 1 
V d X 




370 










o / ~J 










JOU 
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Gin 


ull 1 


Leu 


l rp 


i yr 


Ala 
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lie 


PI 11 

OX LI 


Cpr 


Pin 
OX 11 


Thr 

1 1 1 X 


Pin 
OXU 


Thr 
1 l IX 












O U 










J? J 










H UU 


Gly 


Thr 


Pro 


i yr 


1 ic L 


Leu 


i yr 


Xjy s 
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C \T «3 

Lyo 


A c; n 
Mo 1 1 


A r rr 
MX y 


Lys 


Cor 
Oe X 


Asn 










^ Uj 










410 










A 1 
H X D 




G3 n 

Oil! 


Gin 


Asn 


Leu 


p "1 r, 
oi y 


1 1 1 X 


Tip 
lie 


X»y o 


Cys 
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As n 


Leu 


P\/e 
Lyo 


Thr 
X XIX 


Pin 
ol LI 


Tip 

1 X e 


















425 










1 jU 






Va 1 
v a x 


Glu 


Tyr 


Th r 

J. I IX 


OCX 


x»y o 


A 

no}J 


Glu 


Val 


Ala 


Va 1 
v d 1 


Lyo 


As n 


Leu 


A 1 A 

Ml d 


Cor 
O e X 
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Leu 


Ala 


Leu 


Asn 


Met 


Tvr 


Val 


Thr 


Ser 


Glu 


His 


Thr 


Tvr 
j 


Asp 


Phe 


Lys 




450 










455 










4 60 










Lys 


Leu 


Ala 


Glu 


Val 


Thr 


Lys 


Val 


Val 


Val 


Arg 


Asn 


Leu 


Asn 


Lys 


He 


465 










470 










475 










480 


He 


Asp 


He 


Asn 


Tyr 


Tyr 


Pro 


Val 


Pro 


Glu 


Ala 


Cys 


Leu 


Ser 


Asn 


Lys 










485 










490 










495 




Arg 


His 


Arg 


Pro 


He 


Gly 


He 


Gly 


Val 


Gin 


Gly 


Leu 


Ala 


Asp 


Ala 


Phe 








500 










505 










510 






He 


Leu 


Met 


Arg 


Tyr 


Pro 


Phe 


Glu 


Ser 


Ala 


Glu 


Ala 


Gin 


Leu 


Leu 


Asn 



515 520 525 
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Lys Gin lie Phe Glu Thr lie Tyr Tyr Gly Ala Leu Glu Ala Ser Cys 
530 535 54Q 

Asp Leu Ala Lys Glu Gin Gly Pro Tyr Glu Thr Tyr Glu Gly Ser Pro 

550 555 5g 

Val Ser Lys Gly lie Leu Gin Tyr Asp Met Trp Asn Val Thr Pro Thr 

Asp Leu Trp Asp Trp Lys Val Leu Lys Glu Lys He Ala Lys Tyr Gly 

580 585 590 

He Arg Asn Ser Leu Leu He Ala Pro Met Pro Thr Ala Ser Thr Ala 

595 600 605 

Gin lie Leu Gly Asn Asn Glu Ser He Glu Pro Tyr Thr Ser Asn He 

Tyr Thr Arg Arg Val Leu Ser Gly Glu Phe Gin He Val Asn Pro His 

t t 630 635 640 

Leu Leu Lys Asp Leu Thr Glu Arg Gly Leu Trp His Glu Glu Met Lvs 

645 650 65S 

Asn Gin He lie Ala Cys Asn Gly Ser He Gin Ser He Pro Glu He 

660 665 6 7 0 

Pro Asp Asp Leu Lys Gin Leu Tyr Lys Thr Val Trp Glu He Ser Gin 

675 680 685 

Lys Thr Val Leu Lys Met Ala Ala Glu Arg Gly Ala Phe He Asp Gin 

by0 695 700 

Ser Gin Ser Leu Asn lie His lie Ala Glu Pro Asn Tyr Gly Lys Leu 

Thr Ser Met His Phe Tyr Gly Trp Lys Gin Gly Leu Lys Thr Gly Met 

725 7 30 735 

Tyr Tyr Leu Arg Thr Arg Pro Ala Ala Asn Pro He Gin Phe Thr Leu 

40 745 750 

Asn Lys Glu Lys Leu Lys Asp Lys Glu Lys Val Ser Lys Glu Glu Glu 

760 765 

Glu Lys Glu Arg Asn Thr Ala Ala Met Val Cys Ser Leu Glu Asn Arg 

Asp Glu Cys Leu Met Cys Gly Ser ? ' 

785 790 



<210> 472 
<211> 3117 
<212> DNA 

< 213> Homo sapiens 
<400> 472 



300 
360 
420 
480 
540 



lltrnttT c 3tcgcgccc ctttgtgcgt cacgggtggc gggcgcggga aggggatttg 60 
gattgttgcg cctctgctct gaagaaagtg ctgtctggct ccaactccag ttct?tcccc 120 
tgagcagcgc ctggaaccta acccttccca ctctgtcacc ttctcgatcc cgccggcgc? ISO 
ttagagccgc agtccagtct tggatccttc agagcctcag ccactagctg cgatgcatg. 240 
aaaaotttot f! 9gCC ? CC "9"<*agt catgtttgac aaaattaca? cJcgaatcca 
aacccaaaac ttJT ^ atatggattt tgttgatcct gctcagatca cca?gaaagt 
aatccaaggc ttgtacagtg gggtcaccac agtggaacta gatactttgq ctgctqaaac 

SSScJS SSf taCta 39CaCCCtga ctatgct.tc ctggcagcca 9 ggatcgctgt 
ctacataaa? cc^^T Caaagaaa 9 C 9ttcagtgat gtgatggaag acctctataa 
S " " °"° ataa ^ 9 9 caaa <=actc tcccatggtg gccaagtcaa cattggatat 600 
caa^tSte o^* 9 gCCt 9 aattc tgctattatc tatgaccgag atttctctta 660 
" l CtC ggctttaaga cgctagagcg gtcttatttg ttgaagatca atggaaaagt 720 
taallltn 9 * C " Caacata tgttgatgag agtatctgtt gggatccaca aagfagacat 780 
lllZtaT ataatCttCt ttctgagagg tggtttactc atgct.cgcc 840 
cactctcttc aatgctggta ccaaccgccc acaactttct agctqttttc ttctaaotai- ann 
gaaagatgac agcattgaag gcatttatga cactctaaag caatgtgcat ^t^ccaa 960 
gtctgctgga ggaattggtg ttgctgtgag ttgtattcgg gctactggJa t " 1020 

lllltttTa oatcaa'o? 3tggCCttgt accgatgc?" agagta.ata alaacacagc ]°olo 
tagatatgtg gatcaaggtg ggaacaagcg tcctggggca tttgctattt acctggagcc 1140 
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ttggcattta 
gcgtgccaga 
gactaatcag 
gggagaggaa 
tgtaaaagct 
gfcatatgctc 
caaatgcagc 
ttgtaatttg 
taagaagttg 
aaactactat 
tggggtacaa 
agcccagtta 
ctgtgacct t 
aggaattctt 
tctcaaggag 
tacagcttcc 
catctatact 
agatcttacc 
tggctctatt 
gtgggaaatc 
tcaaagccaa 
gcacttctac 
agcagctaat 
atcaaaagag 
tagagatgaa 
tcagtagcca 
ggctttgctg 
gttaatgatt 
aaaagaaaaa 
agtcatcttg 
ctgaagactt 
gaagcaaaac 
cctataggtt 



gaca tctttg 
gatcttttct 
gactggtctt 
tttgagaaac 
cagcagcttt 
tacaaagatt 
aacctgtgca 
gcttccctgg 
gctgaagtca 
cctgtaccag 
ggtctggcag 
ctgaataagc 
gccaaggagc 
cagtatgata 
aagattgcaa 
actgctcaga 
cgcagagtct 
gagcggggcc 
cagagcatac 
tctcagaaaa 
tctttgaaca 
ggctggaagc 
ccaatccagt 
gaagaagaga 
tgtctgatgt 
aactacttct 
gaccctgttg 
tttttttaaa 
aaaaacggat 
catacaggga 
cagttttgtt 
tgagtgataa 
attctgaaat 



aattccttga 
ttgctctttg 
tgatgtgtcc 
tatatgcaag 
ggtatgccat 
cctgtaatcg 
cagaaatagt 
ccctgaatat 
ctaaagtcgt 
aggcatgcct 
atgcttttat 
agatctttga 
agggcccata 
tgtggaatgt 
agtatggtat 
tcctggggaa 
tgtcaggaga 
tatggcatga 
cagaaattcc 
ctgttctcaa 
tccacattgc 
agggtttgaa 
tcactctaaa 
aggagaggaa 
gtggatcctg 
tgagcataga 
caggcaaaag 
ctcatatatt 
atattgagaa 
gtggttaagt 
aaggagattt 
ctcatgagaa 
aaagataaac 



tttaaagaag 
gattccggat 
aaatgagtgt 
t tatgagaaa 
cat tgagtct 
aaagagcaac 
ggagtacacc 
gtatgtcaca 
tgtccgaaac 
atcaaataaa 
cctgatgaga 
aactatttat 
cgaaaccta t 
tactcctaca 
aagaaacagt 
taatgagtcc 
atttcagatt 
agagatgaaa 
tgatgacctg 
gatggcagct 
tgagcctaac 
gactgggatg 
taaggagaag 
cacagcagcc 
aggaaagact 
taggtatagt 
gagtaattga 
gggattttca 
tcaaagtaga 
aaggtttcat 
agttttactg 
gtactgatag 
atttctaagt 



aacacaggaa 
ctcttcatga 
cctggtctgg 
caaggtcgtg 
cagacggaaa 
cagcagaacc 
agcaaagatg 
tcagaacaca 
ttgaataaaa 
cgccatcgcc 
taccct tttg 
tatggtgctc 
gagggctctc 
gacctatggg 
ttacttattg 
attgaacctt 
gtaaatcctc 
aaccagatta 
aagcaacttt 
gagagaggtg 
tatggcaaac 
tattatttaa 
ctaaaagata 
atggtgtgct 
tggaagagac 
gggtttgctt 
tttaaagtac 
ccaaaataat 
agttttagga 
cacccattta 
ctttgactgg 
gacctttatc 
gaaaaaaaaa 



aggaagagca 
aacgagtgga 
atgaggtttg 
tccgcaaagt 
caggcacccc 
tgggaaccat 
aggttgctgt 
catacgactt 
ttattgatat 
ccattggaat 
agagtgcaga 
tggaagccag 
cagttagcaa 
actggaaggt 
ccccgatgcc 
acaccagcaa 
acttattgaa 
tt gcatgcaa 
ataaaactgt 
ct ttcat tga 
tcactagt at 
ggacaagacc 
aagaaaaggt 
ctttggagaa 
cagcatgt ct 
gaggtggtaa 
tgttaatgat 
gcttttgaaa 
atgcaaaata 
gcatgctttt 
tgggtctcta 
tggatatggt 
aaaaaaa 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3117 



<210> 473 
<211> 674 
<212> PRT 
<213> Homo sapiens 



<400> 473 




Met 


Asn 


Ser 


Gly 


1 








Gly 


lie 


Gin 


Val 








20 


Lys 


Thr 


Asp 


Asn 






35 




Lys 


Val 


Phe 


Tyr 




50 






Gin 


Lys 


Asp 


He 


65 








Lys 


Asp 


lie 


Ser 


Gin 


Thr 


Leu 


Gly 








100 


Thr 


Ala 


Glu 


Thr 






115 




Lys 


Ser 


Pro 


Asp 




130 






Lys 


Ala 


lie 


Lys 


145 








Asn 


Ala 


Leu 


Ser 



Ala 


Met 


Arg 


He 


5 








Lys 


Asn 


Glu 


Lys 


Arg 


Pro 


Glu 


Lys 








40 


Leu 


Asp 


Leu 


Pro 






55 




Lys 


Asp 


Leu 


Gly 




70 






Tyr 


Leu 


He 


Ser 


85 








Arg 


He 


Ser 


Pro 


Thr 


Ser 


Pro 


His 








120 


Thr 


Val 


Cys 


Leu 






135 




Asp 


His 


Asp 


Phe 




150 






Trp 


Gly 


Val 


Lys 



His 


Ser 


Lys 


Gly 




10 






Asn 


Arg 


Pro 


Ser 


25 








Ser 


Lys 


Cys 


Lys 


Ser 


Val 


Thr 


He 








60 


Gly 


Arg 


Val 


Glu 






75 




Asn 


Lys 


Lys 


Glu 




90 






Val 


Pro 


Ser 


Pro 


105 








Pro 


Ser 


His 


Asp 


Ser 


Arg 


Gly 


Lys 








140 


He 


Pro 


Ser 


Asn 






155 




He 


Leu 


His 


He 



His Phe Gin Gly 
15 

Leu Lys Ser Leu 
30 

Pro Leu Trp Gly 
45 

Ser Glu Lys Leu 

Glu Phe Leu Ser 
80 

Ala Lys Phe Ala 
95 

Glu Ser Ala Tyr 
110 

Gly Ser Ser Phe 
125 

Leu Leu Val Glu 

Ser He Leu Ser 
160 

Asp Asp He Arg 
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165 170 275 

Tyr Tyr He Glu Gin Lys Lys Lys Glu Leu Tyr Leu Leu Lys Lys Ser 

180 185 190 

Ser Thr Ser Val Arg Asp Gly Gly Lys Arg Val Gly Ser Gly Ala Gin 

195 200 205 

Lys Thr Arg Thr Gly Arg Leu Lys Lys Pro Phe Val Lys Val Glu Asp 

210 215 220 

Met Ser Gin Leu Tyr Arg Pro Phe Tyr Leu Gin Leu Thr Asn Met Pro 
225 230 235 240 

Phe He Asn Tyr Ser lie Gin Lys Pro Cys Ser Pro Phe Asp Val Asp 

245 250 255 

Lys Pro Ser Ser Met Gin Lys Gin Thr Gin Val Lys Leu Arg lie Gin 

260 265 270 

Thr Asp Gly Asp Lys Tyr Gly Gly Thr Ser He Gin Leu Gin Leu Lys 

275 280 285 

Glu Lys Lys Lys Lys Gly Tyr Cys Glu Cys Cys Leu Gin Lys Tyr Glu 

290 295 300 

Asp Leu Glu Thr His Leu Leu Ser Glu Gin His Arg Asn Phe Ala Gin 

310 315 
Ser Asn Gin Tyr Gin Val Val Asp Asp He Val Ser Lys Leu Val Phe 

325 . 330 ' 335 

Asp Phe Val Glu Tyr Glu Lys Asp Thr Pro Lys Lys Lys Arg He Lys 

340 345 350 

Tyr Ser Val Gly Ser Leu Ser Pro Val Ser Ala Ser Val Leu Lys Lys 
355 360 365 

Thr Glu Gin Lys Glu Lys Val Glu Leu Gin His He Ser Gin Lys Asp 
3/0 375 380 

Cys Gin Glu Asp Asp Thr Thr Val Lys Glu Gin Asn Phe Leu Tyr Lys 

390 395 4 q 0 

Glu Thr Gin Glu Thr Glu Lys Lys Leu Leu Phe He Ser Glu Pro He 

405 410 415 

Pro His Pro Ser Asn Glu Leu Arg Gly Leu Asn Glu Lys Met Ser Asn 

420 425 430 

Lys Cys Ser Met Leu Ser Thr Ala Glu Asp Asp He Arg Gin Asn Phe 
435 440 445 

Thr Gin Leu Pro Leu His Lys Asn Lys Gin Glu Cys He Leu Asp He 
450 455 46Q 

Ser Glu Hrs Thr Leu Ser Glu Asn Asp Leu Glu Glu Leu Arg Val Asp 
465 470 475 H 

His Tyr Lys Cys Asn He Gin Ala Ser Val His Val Ser Asp Phe Ser 

485 490 495 

Thr Asp Asn Ser Gly Ser Gin Pro Lys Gin Lys ' Ser Asp Thr Val Leu 

500 505 510 

Phe Pro Ala Lys Asp Leu Lys Glu Lys Asp Leu His Ser He Phe Thr 

515 520 525 

Hrs Asp Ser Gly Leu He Thr He Asn Ser Ser Gin Glu His Leu Thr 

535 540 
Val Gin Ala Lys Ala Pro Phe His Thr Pro Pro Glu Glu Pro Asn Glu 

r- „ 550 555 560 

Cys Asp Phe Lys Asn Met Asp Ser Leu Pro Ser Gly Lys He His Arg 

565 570 575 

Lys Val Lys lie He Leu Gly Arg Asn Arg Lys Glu Asn Leu Glu Pro 

580 585 59Q 

Asn Ala Glu Phe Asp Lys Arg Thr Glu Phe He Thr Gin Glu Glu Asn 

595 600 605 

Arg lie Cys Ser Ser Pro Val Gin Ser Leu Leu Asp Leu Phe Gin Thr 
610 615 62Q 

Ser Glu Glu Lys Ser Glu Phe Leu Gly Phe Thr Ser Tyr Thr Glu Lys 

t 630 635 640 

Ser Gly He Cys Asn Val Leu Asp He Trp Glu Glu Glu Asn Ser Asp 

645 650 - 655 
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Asn Leu Leu Thr Ala Phe Phe Ser Ser Pro Ser Thr Ser Thr Phe Thr 
660 665 670 

Gly Phe 



<210> 474 
<211> 3729 
<212> DNA 

<213> Homo sapiens 
<400> 474 

ggggcacggc gcgtggcgtg agacggggcg gggcgcgcgt atcggcgccg cggccgcgtg 60 
acgcgttttc aaatcttcaa ccgccgcagc ccactcgttt gtgctttgcg ccttcctcct 120 
ccgcgccttg gagccggatc cggccccgga aacccgacct gcagacgcgg tacctctact 180 
gcgtagaggc cgtagctggc ggaaggagag aggcggccgt cctgtcaaca ggccggggga 24 0 
agccgtgctt tcgcggctgc ccggtgcgac actttctccg gacccagcat gtaggtgccg 300 
ggcgactgcc atgaactccg gagccatgag gatccacagt aaaggacatt tccagggtgg 360 
aatccaagtc aaaaatgaaa aaaacagacc atctctgaaa tctctgaaaa ctgataacag 420 
gccagaaaaa tccaaatgta agccactttg gggaaaagta ttttaccttg acttaccttc 480 
tgtcaccata tctgaaaaac ttcaaaagga cattaaggat ctgggagggc gagttgaaga 540 
atttctcagc aaagatatca gttatcttat ttcaaataag aaggaagcta aatttgcaca 600 
aaccttgggt cgaatttctc ctgtaccaag tccagaatct gcatatactg cagaaaccac 660 
ttcacctcat cccagccatg atggaagttc atttaagtca ccagacacag tgtgtttaag 720 
cagaggaaaa ttattagttg aaaaagctat caaggaccat gatttt,attc cttcaaatag 780 
tatattatca aatgccttgt catggggagt aaaaattctt catattgatg acattagata 840 
ctacattgaa caaaagaaaa aagagttgta tttactcaag aaatcaagta cttcagtaag 900 
agatgggggc aaaagagttg gtagtggtgc acaaaaaaca agaacaggaa gactcaaaaa 960 
gccttttgta aaggtggaag atatgagcca actttatagg ccattttatc ttcagctgac 1020 
caatatgcct tttataaatt attctattca gaagccctgc agtccatttg atgtagacaa 1080 
gccatctagt atgcaaaagc aaactcaggt taaactaaga atccaaacag atggcgataa 1140 
gtatggtgga acctcaattc aactccagtt gaaagagaag aagaaaaaag gatattgtga 1200 
atgttgcttg cagaaatatg aagatctaga aactcacctt ctaagtgagc aacacagaaa 1260 
ctttgcacag agtaaccagt atcaagttgt tgatgatatt gtatctaagt tagtttttga 1320 
ctttgtggaa tatgaaaagg acacacctaa aaagaaaaga ataaaataca gtgttggatc 1380 
cctttctcct gtttctgcaa gtgtcctgaa aaagactgaa caaaaggaaa aagtggaatt 1440 
gcaacatatt tctcagaaag attgccagga agatgataca acagtgaagg agcagaattt 1500 
cctgtataaa gagacccagg aaactgaaaa aaagctcctg tttatttcag agcccatccc 1560 
ccacccttca aatgaattga gagggcttaa tgagaaaatg agtaataaat gttccatgtt 1620 
aagtacagct gaagatgaca taagacagaa ttttacacag ctacctctac ataaaaacaa 1680 
acaggaatgc attcttgaca tttccgaaca ;cacattaagt gaaaatgact tagaagaact 1740 
aagggtagat cactataaat gtaacataca ggcatctgta catgtttctg atttcagtac 1800 
agataatagt ggatctcaac caaaacagaa gtcagatact gtgctttttc cagcaaagga 1860 
tctcaaggaa aaggaccttc attcaatatt tactcatgat tctggtctga taacaataaa 1920 
cagttcacaa gagcacctaa ctgttcaggc aaaggctcca ttccatactc ctcctgagga 1980 
acccaatgaa tgtgacttca agaatatgga tagtttacct tctggtaaaa tacatcgaaa 2040 
agtgaaaata atattaggac gaaatagaaa agaaaatctg gaaccaaatg ctgaatttga 2100 
taaaagaact gaatttatta cacaagaaga aaacagaatt tgtagttcac cggtacagtc 2160 
tttactagac ttgtttcaga ctagtgaaga gaaatcagaa tttttgggtt tcacaagcta 2220 
cacagaaaag agtggtatat gcaatgtttt agatatttgg gaagaggaaa attcagataa 2280 
tctgttaaca gcgtttttct cgtccccttc aacttctaca tttactggct tttagaattt 2340 
aaaaaatgca tacttttcag aagtgataag gatcatattc ttgaaatttt tataaatatg 2400 
tatggaaatt cttaggattt ttttaccagc tttgtttaca gacccaaatg taaatattaa 2460 
aaataaatat ttgcaatttt ctacagaatt gaatacctgt taaagaaaaa ttacagaata 2520 
aacttgtgac tggtcttgtt ttacattata tatatgttcg taattgtttc ctgagaatta 2580 
caatgaataa taatttgctt tgtcactgaa aaccaccagt gaagtgcaat ttggggaata 2640 
tcaaacttag cattatacat ttggatattc tagttgtatt gtaaatttta aaagattatc 2700 
aggataacat gaccttgcct tgaaaaacct aaatttcata cagaaaggaa aaatacattt 2760 
ctatgcttga attctggaaa tcatcagaaa tattgacact tgagggtcac tggtggacca 2820 
tgctagtatt ttggtgctga gtaattagat tatgtatagg ggtaactgga agccatgaag 2880 
gaaggactta accaggctac caagtggaat ataatttatc tgctattttt aattacttac 2940 
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cttgtaaata 

cccagcactt 

ggccaacatg 

tgcgcctgta 

caggttgcag 

ccatctcaaa 

aggatggcat 

cacttactga 

atctgtggat 

gtgacattca 

actaatctta 

cctgcataca 

ttgtttttca 

aaaaaaaaa 



actttaataa 

tgggaggccg 

gtgaaacctt 

gtcccagcta 

tgagccaaga 

aaaaaaaaag 

tgaattatag 

tacaagcatt 

ataaggcatt 

attggttcta 

atttttttga 

ttattttgat 

taccagtacc 



tattcaaaag 

aggcaggctg 

gtctactaaa 

ctccctccag 

ttgtgccact 

ttgactctac 

atacagtttt 

tggagaagag 

tgcctttgag 

ttgcatagta 

aatatatgat 

catttggata 

tgaagtaggc 



ttgactctcc 

atcacgaggt 

aatacaaaaa 

aggctgagaa 

gtattccaga 

cccctaattt 

gggatatata 

aaaattcaaa 

tatgttcttg 

catgaatcta 

tctcagtaca 

acatcaatga 

tcaataaaag 



tgtggctcat 

cagaagatca 

attagctggg 

ttgcttgaac 

ctgggtgaca 

ggtaggagat 

caaggattgc 

tataagaaat 

taggagtaag 

ttgtcactaa 

ggactaattc 

acactggctt 

agtctttaca 



gccacaggat 

agaccatcct 

catggcggca 

ctgggaagtg 

gagtgagact 

gaaggagaaa 

cttgatctgg 

t tcaatttga 

Bctaccacca 

aaattaggtt 

atatgctttt 

tttaacacct 

gtgaaaaaaa 



3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3729 



<210> 475 
<211> 255 
<212> PRT 
<213> Homo sapiens 

<400> 475 

Met Val Lys Leu Thr Ala Glu Leu He Glu Gin Ala Ala Gin Tyr Thr 

Asn Ala Val Arg Asp Arg Glu Leu Asp ^eu Arg Gly Tyr Lys " e Pro 

25 30 
Val lie Glu Asn Leu Gly Ala Thr Leu Asp Gin Phe Asp Ala He Asp 

Phe Ser Asp Asn Glu He Arg Lys Leu Asp Gly Phe Pro Leu Leu Arg 

t 55 60 

Arg Leu Lys Thr Leu Leu Val Asn Asn Asn Arg He Cys Arg He Gly 

Glu Gly Leu Asp Gin Ala Leu Pro Cys Leu Thr Glu Leu He Leu Ihr 

85 9 0 95 

Asn Asn Ser Leu Val Glu Leu Gly Asp Leu Asp Pro Leu Ala Ser Leu 

100 105 110 

Lys Ser Leu Thr Tyr Leu Ser He Leu Arg Asn Pro Val Thr Asn Lys 

115 120 125 y 

Lys His Tyr Arg Leu Tyr Val He Tyr Lys Val Pro Gin Val Arg Val 

Leu Asp Phe Gin Lys Val Lys Leu Lys Glu Arg Gin Glu Ala Glu Lys 

150 155 
Met Phe Lys Gly Lys Arg Gly Ala Gin Leu Ala Lys Asp He Ala Arg 

165 170 175 

Arg Ser Lys Thr Phe Asn Pro Gly Ala Gly Leu Pro Thr Asp Lys Lys 

185 190 
Arg Gly Gly Pro Ser Pro Gly Asp Val Glu Ala He Lys Asn Ala He 

95 2°0 205 

Ala Asn Ala Ser Thr Leu Ala Glu Val Glu Arg Leu Lys Gly Leu Leu 

215 220 

225 Il£ H° Gly Ar 9 Glu *** Arg Ser Gly Pro Thr Asp 

235 o/tf\ 

Asp Gly Glu Glu Glu Met Glu Glu Asp Thr Val Thr Asn Gly Ser 

245 250 255 



<210> 476 

<211> 1054 

<212> DNA 

<213> Homo sapiens 

<400> 476 



-235- 



BNSDOCID: <WO 2004O5505OA2J_> 



WQJ2004/055050 

gaattccgcg ggaggccacg 
tcaagctgac ggcggagctg 
gggagctgga cctccggggg 
accagtttga tgctattgat 
tgttgagaag actgaaaaca 
gacttgatca ggctctgccc 
aactgggtga tctggaccct 
gaaatccggt aaccaataag 
tcagagtact ggatttccag 
tcaagggcaa acggggtgca 
atccaggtgc tggtttgcca 
aagcaatcaa gaatgccata 
ggttgctgca gtctggtcag 
gtgaagaaga gatggaagaa 
ataataatag gccctcttgg 
tgtgttagca aagtggaatc 
tttgtaatat aagttttgaa 
cactctctat gctaaaaaaa 

<210> 477 
<211> 241 
<212> PRT 
<213> Homo sapiens 



<400> 477 




























Met 


Arg 


His 


Gin 


Gin 


Thr 


Glu 


Arg 


Gin 


Asp 


Pro 


Ser 


Gin 


Pro 


Leu 


Ser 


1 








5 










10 










15 




Arg 


Gin 


His 


Gly 


Thr 


Tyr 


Arg 


Gin 


He 


Phe 


His 


Pro 


Glu 


Gin 


Leu 


He 








20 










25 










30 






Thr 


Gly 


Lys 


Glu Asp 


Ala 


Ala 


Asn 


Asn 


Tyr 


Ala 


Trp 


Gly 


His 


Tyr 


Thr 






35 










40 










45 








He 


Gly 


Lys 


Glu 


Phe 


He 


Asp 


Leu 


Leu 


Leu 


Asp 


Arg 


He 


Arg 


Lys 


Leu 




50 










55 










60 










Ala 


Asp 


Gin 


Cys 


Thr 


Gly 


Leu 


Gin 


Gly 


Phe 


Leu 


Val 


Phe 


His 


Ser 


Leu 


65 










70 










75 










80 


Gly Arg 


Gly 


Thr 


Gly 


Ser 


Asp 


Val 


Thr 


Ser 


Phe 


Leu 


Met 


Glu 


Trp 


Leu 










85 










90 










95 




Ser 


Val 


Asn 


Tyr Gly 


Lys 


Lys 


Ser 


Lys 


Leu 


Gly 


Phe 


Ser 


lie 


Tyr 


Pro 








100 










105 










110 






Ala 


Pro 


Gin 


Val 


Ser 


Thr 


Ala 


Met 


Val 


Gin 


Pro 


Tyr 


Asn 


Ser 


He 


Leu 






115 










120 










125 








Thr 


Thr 


His 


Thr 


Thr 


Leu 


Glu 


His 


Ser 


Asp 


Cys 


Ala 


Phe 


Met 


Val 


Asp 




130 










135 










140 










Asn 


Lys 


Ala 


He 


Tyr 


Asp 


He 


Cys 


His 


Arg 


Asn 


Leu 


Asp 


He 


Glu 


Arg 


145 










150 










155 










160 


Pro 


Thr 


Tyr 


Thr 


Asn 


Leu 


Asn 


Arg 


Leu 


He 


Ser 


Gin 


He 


Val 


Ser 


Ser 










165 










170 










175 




He 


Thr 


Ala 


Ser 


Leu 


Arg 


Phe 


Asp 


Gly 


Ala 


Leu 


Asn 


Val 


Asp 


Leu 


Thr 








180 










185 










190 






Glu 


Phe 


Gin 


Thr 


Asn 


Leu 


Val 


Ser 


Tyr 


Leu 


Thr 


Ser 


Thr 


Ser 


Pro 


Trp 






195 










200 










205 








Pro 


Pro 


Met 


His 


Gin 


Ser 


Ser 


Leu 


Gin 


Lys 


Lys 


Tyr 


Thr 


Thr 


Ser 


Ser 




210 










215 










220 










Cys 


Arg 


Trp 


Gin 


Arg 


Leu 


Pro 


Met 


Pro 


Ala 


Leu 


Ser 


Leu 


Pro 


Thr 


Arg 


225 










230 










235 










240 



Trp 

[ 
i 
i 

<210> 478 : 
<211> 1380 ! 
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ggctttccac 

atcgagcagg 

tataaaattc 

ttttctgaca 

ttgttagtga 

tgtctgacag 

ctggcatctc 

aagcattaca 

aaagtgaaac 

cagcttgcaa 

actgacaaaa 

gcaaatgctt 

atccctggca 

gacacagtca . 

aacaagtctt 

tatcagcat t 

atctaaatgt 

aaaaaaagga 



agcgcggggg 
cggcgcagta 
ccgtcattga 
atgagatcag 
acaacaacag 
aactcattct 
tcaaatcgct 
gattgtatgt 
taaaagagcg 
aggatattgc 
agagaggtgg 
caactctggc 
gagaacgcag 
caaacgggtc 
gcttttcgaa 
gttgaaatgc 
caattttcta 
attc 



aacgggaggc 
caccaacgcg 
aaatctaggt 
gaaactggat 
aatatgccgt 
caccaataat 
gacttaccta 
gatttataaa 
tcaggaagca 
caggagaagc 
gccatctcca 
tgaagtggag 
atcagggccc 
ctgagcagtg 
catggtataa 
ttaagactgc 
caaattataa 



tgcaggatgg 
gtgcgcgacc 
gctacgt tag 
ggttttcctt 
ataggtgagg 
agtctcgtgg 
agtatcctaa 
gttccgcaag 
gagaaaatgt 
aaaactttta 
ggggatgtag 
aggctgaagg 
actgatgatg 
aggcagatgt 
tagccttgtt 
tgctgataat 
aaataaactc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1054 
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<212> DNA 

<213> Homo sapiens 
<400> 478 

gtctctgccc atccgcgcac ccgggcttcg 
cgaaccccgt tcccaccaac cctctcagct 
ggtggttctg tggatagttg gaatgcatac 
acagagagac aagaccccag ccagcccctg 
ttccatccag agcagctcat cacaggcaag 
cactacacca ttgggaagga gttcatcgac 
gaccagtgca caggacttca gggcttcctg 
tctgacgtca cctcattcct gatggagtgg 
ctgggattct ccatctaccc agccccccag 
tctatcctga ccacccacac caccctggag 
aaagcaatct atgacatctg ccaccgcaac 
ctcaatcgcc tcattagcca aattgtctcc 
gccctcaatg tggacctgac agagttccag 
tccccctggc cacctatgca ccagtcatct 
cggtggcaga gattaccaat gcctgctttg 
cccggcacgg caagtacatg gcctgctgcc 
atgtcaacgc tgccattgct gccatcaaga 
gccccacagg ctttaaggtt gatatcaatc 
acctggtaaa gtgcaacgtg ccatgtgcat 
ctgggcccgc ctggaccaca agtttgacct 
atatgtgggt gagggcatgg aggagggtga 
cctggagaag gattacaagg aggtgggcat 
a 9gggggatg aatactaggg gaatactgtg 



gctggagagg gccagctcgc ttcaggaggc 60 
cagacgcggg gtgctgagtc acgggggggg 120 
acagaggaaa gggggatgcg gcaccagcag 180 
tccaggcagc atggcacata ccgccagatc 240 
gaagatgctg ccaataacta tgcctggggc 300 
ctgctactgg accggattcg gaagctggct 360 
gtgttccaca gccttggtcg gggcactggc 420 
ctttctgtta actatggcaa gaaatccaag 480 
gtgtctacag ccatggtcca gccctacaac 540 
cactcagact gtgccttcat ggtggacaac 600 
ctagacattg agcgcccaac ctacaccaac 660 
tccatcacag cttctctgcg ctttgacggg 720 
accaacctgg tgtcctacc-t cacatccact 780 
ctgcagaaaa agtataccac gagcagctgt 840 
agcctgccaa ccagatggtg aagtgtgatc 900 
tgctatacca tggagatgtg gtgcccaagg 960 
ccaagtgcag cattcagttt gtggactggt 1020 
accagcctcc cactgtggtg cctgggagtg 1080 
gctgagcaac atgacagcca tcactatggc 1140 
gatgtatgcc aagagggcgt ttgggcactg 1200 
gttctccaag gcccatgagg atatgactgc 1260 
ggatag'tgtg gagtgtgggg aagaaaagat 1320 
tgtctgtcct acataaagtg ctgtggcctt 1380 



<210> 479 
<211> 175 
<212> PRT 

<213> Homo sapiens 
<400> 479 

Met Thr Asp Cys Glu Phe Gly Tyr He Tyr Arg Leu Ala Gin Asp Tyr 

15 10 15 

Leu Gin Cys Val Leu Gin He Pro Gin Pro Gly Ser Gly Pro Ser Lys 

20 25 30 

Thr Ser Arg Val Leu Gin Asn Val Ala Phe Ser Val Gin Lys Glu Val 

35 40 4 5 

Glu Lys Asn Leu Lys Ser Cys Leu Asp Asn Val Asn Val Val Ser Val 

50 55 60 

Asp Thr Ala Arg Thr Leu Phe Asn Gin Val Met Glu Lys Glu Phe Glu 
65 70 75 80 

Asp Gly He He Asn Trp Gly Arg He Val Thr He Phe Ala Phe Glu 

85 90 95 

Gly He Leu He Lys Lys Leu Leu Arg Gin Gin He Ala Pro Asp Val 

100 105 HO 

Asp Thr Tyr Lys Glu He Ser Tyr Phe Val Ala Glu Phe He Met Asn 

115 120 125 

Asn Thr Gly Glu Trp He Arg Gin Asn Gly Gly Trp Glu Asn Gly Phe 

130 135 140 

Val Lys Lys Phe Glu Pro Lys Ser Gly Trp Met Thr Phe Leu Glu Val 
145 150 155 160 

Thr Gly Lys He Cys Glu Met Leu Ser Leu Leu Lys Gin Tyr Cys 
165 170 " 175 

<210> 480 
<211> 885 
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<212> DNA 

<213> Homo sapiens 
<400> 480 

agcctacgca cgaaagtgac tagggaggaa ggatattata aagtgatgca aacagaaatt 60 

ccaccagcct ccatgtatca tcatgtgtca taactcagtc aagctcagtg agcattctca 120 

gcacattgcc tcaacagctt caaggtgagc cagctcaaga ctttgctctc caccaggcag 180 

aagatgacag actgtgaatt tggatatatt tacaggctgg ctcaggacta tctgcagtgc 240 

gtcctacaga taccacaacc tggatcaggt ccaagcaaaa cgtccagagt gctacaaaat 300 

gttgcgttct cagtccaaaa agaagtggaa aagaatctga agtcatgctt ggacaatgtt 360 

aatgttgtgt ccgtagacac tgccagaaca ctattcaacc aagtgatgga aaaggagttt 420 

gaagacggca tcattaactg gggaagaatt gtaaccatat ttgcatttga aggtattctc 480 

atcaagaaac ttctacgaca gcaaattgcc ccggatgtgg atacctataa ggagatttca 540 

tattttgttg cggagttcat aatgaataac acaggagaat ggataaggca aaacggaggc 600 

tgggaaaatg gctttgtaaa gaagtttgaa cctaaatctg gctggatgac ttttctagaa 660 

gttacaggaa agatctgtga aatgctatct ctcctgaagc aatactgttg accagaaagg 720 

acactccata ttgtgaaacc ggcctaattt ttctgactga tatggaaacg attgccaaca 780 

catacttcta cttttaaata aacaactttg atgatgtaac ttgaccttcc agagttatgg 840 

aaattttgtc cccatgtaat gaataaattg tatgtatttt tctct 885 

<210> 481 
<211> 104 
<212> PRT 

<213> Homo sapiens 
<400> 481 



Met 


Ser 


His 


Thr 


lie 


Leu 


Leu 


Val 


Gin 


Pro 


Thr 


Lys 


Arg 


Pro 


Glu 


Gly 


1 








5 










10 










15 




Arg 


Thr 


Tyr 


Ala 


Asp 


Tyr 


Glu 


Ser 


Val 


Asn 


Glu 


Cys 


Met 


Glu 


Gly 


Val 








20 










25 










30 






Cys 


Lys 


Met 


Tyr 


Glu 


Glu 


His 


Leu 


Lys 


Arg 


Met 


Asn 


Pro 


Asn 


Ser 


Pro 






35 










40 










45 








Ser 


lie 


Thr 


Tyr 


Asp 


lie 


Ser 


Gin 


Leu 


Phe 


Asp 


Phe 


He 


Asp 


Asp 


Leu 




50 










55 










60 










Ala 


Asp 


Leu 


Ser 


Cys 


Leu 


Val 


Tyr 


Arg 


Ala 


Asp 


Thr 


Gin 


Thr 


Tyr 


Gin 


65 










70 










75 










80 


Pro 


Tyr 


Asn 


Lys 


Asp 


Trp 


He 


Lys 


Glu 


Lys 


He 


Tyr 


Val 


Leu 


Leu Arg 










85 










90 










95 




Arg 


Gin 


Ala 


Gin 


Gin 


Ala 


Gly 


Lys 



















100 



<210> 482 
<211> 815 
<212> DNA 

<213> Homo sapiens 

<400> 482 j 

gcggcgttgt agttaagctc gtgtaacggc ggcggtgtcg gcagctgctg tagcgaagag 60 j 

agtttggcgc gatgtctcac accattttgc tggtacagcc taccaagagg ccagaaggca 120 j 

gaacttatgc tgactacgaa tctgtgaatg aatgcatgga aggtgtttgt aaaatgtatg 180 | 

aagaacatct gaaaagaatg aatcccaaca gtccctctat cacatatgac atcagtcagt 240 j 
tgtttgattt catcgatgat ctggcagacc tcagctgcct ggtttaccga gctgataccc 300 

agacatacca gccttataac aaagactgga ttaaagagaa gatctacgtg ctccttcgtc 360 ! 

ggcaggccca acaggctggg aaataattgt gttggaagca ctgggggggt tggggtgggc 420 J 

ttggaacaca ggtgtgtaca gcgtgctgta gtggaagttt tgtatcatag taatcctgtt 480 i 

tccactttgt tatactctag ccaagattga ctgtattaga tgaaatgtga ggatcttgtt 540 ' 

caatcggaaa cccccgttac ctcctctttt tctttctctt tctttttttt ttttttactt 600 « 
aaacattttt atgatgattt agatggaagt tgttcttcgt cacttaatgt tggttccagt 660 

ccttcaactg ttcatatcta ctttataaca ttcacatact aacccttctt caagatgggg 720 j 

tggggggtgg aaatgcagtt tagccatgtc ctcaagataa agtcttggta aaaataaata 780 1 

i 

i 
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aatgtccttt agttataaaa aaaaaaaaaa aaaaa 815 

<210> 483 
<211> 857 
<212> PRT 

<213> Homo sapiens 
<400> 483 

Met Glu Ser Glu Asp Leu Ser Gly Arg Glu Leu Thr He Asp Ser He 

5 10 15 

Met Asn Lys Val Arg Asp He Lys Asn Lys Phe Lys Asn Glu Asp Leu 

20 25 30 

Thr Asp Glu Leu Ser Leu Asn Lys He Ser Ala Asp Thr Thr Asp Asn 

35 40 45 

Ser Gly Thr Val Asn Gin He Met Met Met Ala Asn Asn Pro Glu Asp 

50 55 60 

Trp Leu Ser Leu Leu Leu Lys Leu Glu Lys Asn Ser Val Pro Leu Ser 
65 70 75 80 

Asp Ala Leu Leu Asn Lys Leu He Gly Arg Tyr Ser Gin Ala He Glu 

85 90 95 

Ala Leu Pro Pro Asp Lys Tyr Gly Gin Asn Glu Ser Phe Ala Arg He 

100 105 no 

Gin Val Arg Phe Ala Glu Leu Lys Ala He Gin Glu Pro Asp Asp Ala 

115 120 125 

Arg Asp Tyr Phe Gin Met Ala Arg Ala Asn Cys Lys Lys Phe Ala Phe 

1JU 135 140 

Val His He Ser Phe Ala Gin Phe Glu Leu Ser Gin Gly Asn Val Lys 

T 150 155 160 

Lys Ser Lys Gin Leu Leu Gin Lys Ala Val Glu Arg Gly Ala Val Pro 

165 170 175 

Leu Glu Met Leu Glu He Ala Leu Arg Asn Leu Asn Leu Gin Lys Lys 

180 185 19Q 

Gin Leu Leu Ser Glu Glu Glu Lys Lys Asn Leu Ser Ala Ser Thr Val 

1^5 200 205 

Leu Thr Ala Gin Glu Ser Phe Ser Gly Ser Leu Gly His Leu Gin Asn 

21° 215 220 

Arg Asn Asn Ser Cys Asp Ser Arg Gly Gin Thr Thr Lys Ala Arg Phe 

230 235 240 

Leu Tyr Gly Glu Asn Met Pro Pro Gin Asp Ala Glu He Gly Tyr Arg 

245 250 255 

Asn Ser Leu Arg Gin Thr Asn Lys Thr Lys Gin Ser Cys Pro Phe Glv 

260 265 270 

Arg Val Pro Val Asn Leu Leu Asn Ser Pro Asp Cys Asp Val Lys Thr 
275 280 285 

ASP oon SSr Val Val Pr ° CyS Phe Met L V S Ar 3 Gln Thr Ser Arg Ser 
290 295 300 

Glu Cys Arg Asp Leu Val Val Pro Gly Ser Lys Pro Ser Gly Asn Asp 
e 310 315 320 

Ser Cys Glu Leu Arg Asn Leu Lys Ser Val Gin Asn Ser His Phe Lys 

325 330 335 

Glu Pro Leu Val Ser Asp Glu Lys Ser Ser Glu Leu He He Thr Asp 

340 345 350 

Ser He Thr Leu Lys Asn Lys Thr Glu Ser Ser Leu Leu Ala Lys Leu 

355 360 365 

Glu Glu Thr Lys Glu Tyr Gin Glu Pro Glu Val Pro Glu Ser Asn Gin 

Zl 375 380 

Lys Gin Trp Gin Ser Lys Arg Lys Ser Glu Cys He Asn Gin Asn Pro 

3 ? 5 390 395 400 

Ala Ala Ser Ser Asn His Trp Gin He Pro Glu Leu Ala Arg Lys Val 

405 410 4 { 5 

Asn Thr Glu Gin Lys His Thr Thr Phe Glu Gin Pro Val Phe Ser Val 
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420 

Ser Lys Gin Ser 
435 

Ser lie Cys Lys 
4 50 

Cys Phe Arg Thr 
465 

Leu Ser Thr Pro 

Gin lie Leu Ala 
500 

Ser Ala Asn Glu 
515 

Lys Gin lie Gly 

530 

Glu Lys Lys Gin 
545 

Asp Asn Gin Thr 

Lys Leu Gin Gin 
580 

lie Thr Asp Gin 
595 

Leu Asn Ser Trp 
610 

Lys Ser Tyr Trp 
625 

His Gly lie Val 

Asp Gly Met Leu 
660 

Pro Asp Thr Thr 
675 

Tyr Met Pro Pro 
690 

Gly Lys Ser Lys 
705 

Gly Cys He Leu 

He He Asn Gin 
740 

Glu He Glu Phe 
755 

Lys Cys Cys Leu 
770 

Leu Leu Ala His 
785 

Met Ala Lys Gly 

Val Gly Leu Asn 
820 

Tyr Glu His Tyr 
835 

Thr Phe Glu Lys 
850 



Pro Pro lie Ser 
440 

Thr Pro Ser Ser 
455 

Pro Val Val Lys 
470 

Tyr Gly Gin Pro 
485 

Thr Pro Leu Gin 

Cys He Ser Val 
520 

Ser Gly Gly Ser 
535 

He Tyr Ala He 
550 

Leu Asp Ser Tyr 
565 

His Ser Asp Lys 

Tyr lie Tyr Met 
600 

Leu Lys Lys Lys 
615 

Lys Asn Met Leu 
630 

His Ser Asp Leu 
645 

Lys Leu He Asp 

Ser Val Val Lys 
680 

Glu Ala He Lys 
695 

Ser Lys He Ser 
710 

Tyr Tyr Met Thr 
725 

lie Ser Lys Leu 

Pro Asp lie Pro 
7 60 

Lys Arg Asp Pro 
775 

Pro Tyr Val Gin 
790 

Thr Thr Glu Glu 
805 

Ser Pro Asn Ser 

Ser Gly Gly Glu 
840 

Lys Arg Gly Lys 
855 



425 

Thr Ser Lys Trp 

Asn Thr Leu Asp 
460 

Asn Asp Phe Pro 
475 

Ala Cys Phe Gin 
490 

Asn Leu Gin Val 
505 

Lys Gly Arg lie 

Ser Lys Val Phe 
540 

Lys Tyr Val Asn 
555 

Arg Asn Glu He 
570 

lie lie Arg Leu 
585 

Val Met Glu Cys 

Lys Ser lie Asp 
620 

Glu Ala Val His 
635 

Lys Pro Ala Asn 
650 

Phe Gly lie Ala 
665 

Asp Ser Gin Val 

Asp Met Ser Ser 
700 

Pro Lys Ser Asp 
715 

Tyr Gly Lys Thr 
730 

His Ala lie lie 
745 

Glu Lys Asp Leu 

Lys Gin Arg lie 
780 

lie Gin Thr His 
795 

Met Lys Tyr Val 
810 

lie Leu Lys Ala 
825 

Ser His Asn Ser 
Lys 



430 

Phe Asp Pro Lys 
445 

Asp Tyr Met Ser 

Pro Ala Cys Gin 
480 

Gin Gin Gin His 
495 

Leu Ala Ser Ser 
510 

Tyr Ser lie Leu 
525 

Gin Val Leu Asn 

Leu Glu Glu Ala 
560 

Ala Tyr Leu Asn 
575 

Tyr Asp Tyr Glu 
590 

Gly Asn lie Asp 
605 

Pro Trp Glu Arg 

Thr He His Gin 
640 

Phe Leu lie Val 
655 

Asn Gin Met Gin 
670 

Gly Thr Val Asn 
685 

Ser Arg Glu Asn 

Val Trp Ser Leu 
720 

Pro Phe Gin Gin 
735 

Asp Pro Asn His 
750 

Gin Asp Val Leu 
7 65 

Ser He Pro Glu 

Pro Val Asn Gin 
800 

Leu Gly Gin Leu 
815 

Ala Lys Thr Leu 
830 

Ser Ser Ser Lys 
845 



<210> 484 
<211> 2984 
<212> DNA 

<213> Homo sapiens 
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<400> 484 

ggaaattcaa acgtgtttgc ggaaaggagt ttgggttcca tcttttcatt tccccagcgc 60 
agctttctgt agaaatggaa tccgaggatt taagtggcag agaattgaca attgattcca 120 
taatgaacaa agtgagagac attaaaaata agtttaaaaa tgaagacctt actgatgaac 180 
taagcttgaa taaaatttct gctgatacta cagataactc gggaactgtt aaccaaatta 240 
tgatgatggc aaacaaccca gaggactggt tgagtttgtt gctcaaacta gagaaaaaca 300 
gtgttccgct aagtgatgct cttttaaata aattgattgg tcgttacagt caagcaattg 360 
aagcgcttcc cccagataaa tatggccaaa atgagagttt tgctagaatt caagtgagat 420 
ttgctgaatt aaaagctatt caagagccag atgatgcacg tgactacttt caaatggcca 480 
gagcaaactg caagaaattt gcttttgttc atatatcttt tgcacaattt gaactgtcac 540 
aaggtaatgt caaaaaaagt aaacaacttc ttcaaaaagc tgtagaacgt ggagcagtac 600 
cactagaaat gctggaaatt gccctgcgga atttaaacct ccaaaaaaag cagctgcttt 660 
cagaggagga aaagaagaat ttatcagcat ctacggtatt aactgcccaa gaatcatttt 720 
ccggttcact tgggcattta cagaatagga acaacagttg tgattccaga ggacagacta 780 
ctaaagccag gtttttatat ggagagaaca tgccaccaca agatgcagaa ataggttacc 840 
ggaattcatt gagacaaact aacaaaacta aacagtcatg cccatttgga agagtcccag 900 
ttaaccttct aaatagccca gattgtgatg tgaagacaga tgattcagtt gtaccttgtt 960 
ttatgaaaag acaaacctct agatcagaat gccgagattt ggttgtgcct ggatctaaac 1020 
caagtggaaa tgattcctgt gaattaagaa atttaaagtc tgttcaaaat agtcatttca 1080 
aggaacctct ggtgtcagat gaaaagagtt ctgaacttat tattactgat tcaataaccc 1140 
tgaagaataa aacggaatca agtcttctag ctaaattaga agaaactaaa gagtatcaag 1200 
aaccagaggt tccagagagt aaccagaaac agtggcaatc taagagaaag tcagagtgta 1260 
ttaaccagaa tcctgctgca tcttcaaatc actggcagat tccggagtta gcccgaaaag 1320 
ttaatacaga gcagaaacat accacttttg agcaacctgt cttttcagtt tcaaaacagt 1380 
caccaccaat atcaacatct aaatggtttg acccaaaatc tatttgtaag acaccaagca 1440 
gcaatacctt ggatgattac atgagctgtt ttagaactcc agttgtaaag aatgactttc 1500 
cacctgcttg tcagttgtca acaccttatg gccaacctgc ctgtttccag cagcaacagc 1560 
atcaaatact tgccactcca cttcaaaatt tacaggtttt agcatcttct tcagcaaatg 1620 
aatgcatttc ggttaaagga agaatttatt ccattttaaa gcagatagga agtggaggtt 1680 
caagcaaggt atttcaggtg ttaaatgaaa agaaacagat atatgctata aaatatgtga 1740 
acttagaaga agcagataac caaactcttg atagttaccg gaacgaaata gcttatttga 1800 
ataaactaca acaacacagt gataagatca tccgacttta tgattatgaa atcacggacc 1860 
agtacatcta catggtaatg gagtgtggaa atattgatct taatagttgg cttaaaaaga 1920 
aaaaatccat tgatccatgg gaacgcaaga gttactggaa aaatatgtta gaggcagttc 1980 
acacaatcca tcaacatggc attgttcaea gtgatcttaa accagctaac tttctgatag 2040 
ttgatggaat gctaaagcta attgattttg ggattgcaaa ccaaatgcaa ccagatacaa 2100 
caagtgttgt taaagattct caggttggca cagttaatta tatgccacca gaagcaatca 2160 
aagatatgtc ttcctccaga gagaatggga aatctaagtc aaagataagc cccaaaagtg 2220 
atgtttggtc cttaggatgt attttgtact atatgactta cgggaaaaca ccatttcagc 2280 
agataattaa tcagatttct aaattacatg ccataattga tcctaatcat gaaattgaat 2340 
ttcccgatat tccagagaaa gatcttcaag atgtgttaaa gtgttgttta aaaagggacc 2400 
caaaacagag gatatccatt cctgagctcc tggctcatcc ctatgttcaa attcaaactc 2460 
atccagttaa ccaaatggcc aagggaacca ctgaagaaat gaaatatgtt ctgggccaac 2520 
ttgttggtct gaattctcct aactccattt tgaaagctgc taaaacttta tatgaacact 2580 
atagtggtgg tgaaagtcat aattcttcat cctccaagac ttttgaaaaa aaaaggggaa 2640 
aaaaatgatt tgcagttatt cgtaatgtca aataccacct ataaaatata ttggactgtt 2700 
atactcttga atccctgtgg aaatctacat ttgaagacaa catcactctg aagtgttatc 2760 
agcaaaaaaa attcagtaga ttatctttaa aagaaaactg taaaaatagc aaccacttat 2820 
ggtactgtat atattgtaga cttgttttct ctgttttatg ctcttgtgta atctacttga 2880 
catcatttta ctcttggaat agtgggtgga tagcaagtat attctaaaaa actttgtaaa 2940 
taaagttttg tggctaaaat gacactaaaa aaaaaaaaaa aaaa 2984 

<210> 485 
<211> 725 
<212> PRT 

<213> Homo sapiens 



<400> 485 

Met Ala Met Asp Ser Ser Leu Gin Ala Arg Leu Phe Pro Gly Leu Ala 
1 5 10 15 
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lie 


Lys 


lie 


Gin 


Arg 








20 




Thr 


Val 


Asn 


Leu 


Glu 






35 






Gly 


Ala 


Thr 


Lys 


Gly 




50 








Asn 


Pro 


Glu 


Leu 


Leu 


65 










Pro 


Leu 


Gin 


Glu 


Asn 










85 


Asn 


Ser 


Lys 


He 


Pro 








100 




Arg 


Met 


Ser 


Thr 


Val 






115 






Met 


Glu 


Val 


Glu 


Leu 




130 








Val 


Pro 


Pro 


Ala 


Pro 


145 










Pro 


Leu 


Arg 


Met 


Val 










165 


Arg 


Gly 


Ser 


Ser 


Ser 








180 




Cys 


Leu 


Val 


Lys 


Glu 






195 






Lys 


Ala 


Gin 


Asn 


Ser 




210 








Ser 


Ser 


Phe 


Pro 


Asn 


225 










Ala 


Thr 


Leu 


Glu 


Cys 










245 


His 


Arg 


He 


Cys 


Val 








260 




Leu 


Ala 


Lys 


Lys 


Glu 






275 






Leu 


Leu 


Val 


His 


Glu 




290 








Glu 


Asn 


Gin 


Ala 


Phe 


305 










Asn 


Glu 


Val 


Val 


Tyr 










325 


Phe 


Glu 


Gly 


Gly 


Lys 








340 




Gly 


Lys 


Thr 


His 


Thr 






355 






Ala 


Ser 


Lys 


Gly 


He 




370 








Lys 


Asn 


Gin 


Pro 


Cys 


385 










Phe 


Phe 


Glu 


He 


Tyr 










405 


Ala 


Lys 


Leu 


Arg 


Val 








420 




Gly 


Leu 


Gin 


Glu 


His 






435 






Leu 


Asp 


Met 


Gly 


Ser 




450 








Ser 


Asn 


Ser 


Ser 


Arg 


465 










Lys 


Gly Arg 


Met 


His 










485 


Glu 


Arg 


Gly 


Ala 


Asp 
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Ser 


Asn 


Gly 


Leu 


i _Le 


U -i e? 

nls 










9 c» 
c. D 








Lys 


Ser 


Cys 


Vdl 


oer 


Vdl 


VjIU 






H u 










Lys 


Glu 


lie 


Asp 


rrie 


Asp 


Asp 




DO 










oU 


Gin 


Leu 


Leu 


Pro 


Leu 


nlS 


Pro 


1 U 










/ O 




Val 


Thr 


lie 


Gin 


Lys 


Gin 


Lys 










an 






Ala 


Pro 


Lys 


pin 

GlU 


oer 


Leu 


Arg 








1 UD 








Ser 


Glu 


Leu 


Arg 


lie 


Thr 


Ala 
















Pro 


Ala 


Ala 


Ala 


Asn 


Ser 


Arg 




n OC 
1-55 












Thr 


Arg 


Pro 


Ser 


Cys 


Pro 


Ala 












Ijj 




Ser 


Glu 


Glu 


Met 


Glu 


Glu 


Gin 










inn 

1 / U 






Ala 


Asn 


Pro 


val 


Asn 


Ser 


17-1 

Val 








185 








val 


GlU 


Lys 


Met 


Lys 


Asn 


Lys 
















Glu 


Met 


Arg 


Met 


Lys 


Arg 


Ala 




215 










220 


Trp 


Glu 


Phe 


Ala 


Arg 


Met 


He 


OTA 










o o c 
Z JO 




His 


Pro 


Leu 


Thr 


Met 


Thr 


Asp 










OCA 






Cys 


Val 


Arg 


Lys 


Arg 


Pro 


Leu 








£ DD 








He 


Asp 


Val 


He 


Ser 


lie 


Pro 
















Pro 


Lys 


Leu 


Lys 


Val 


Asp 


Leu 




O Q C 












Cys 


Phe 


Asp 


Phe 


Ala 


Phe 


Asp 












315 




Arg 


Phe 


Thr 


Ala 


Arg 


Pro 


Leu 










330 






Ala 


Thr 


Cys 


Phe 


Ala 


Tyr 


Gly 








345 








Met 


Gly 


Gly Asp 


Leu 


Ser 


Gly 






360 










Tyr 


Ala 


Met 


Ala 


Ser Arg 


Asp 




O / D 










oon 


Tyr 


Arg 


Lys 


Leu 


Gly 


Leu 


Glu 












395 




Asn 


Gly 


Lys 


Leu 


Phe 


Asp 


Leu 










410 






Leu 


GlU 


Asp 


Gly 


Lys 


Gin 


Gin 








425 








Leu 


val 


Asn 


Ser 


Ala 


Asp 


Asp 






440 










Ala 


Cys 


Arg 


Thr 


Ser 


Gly 


Gin 




455 










4 60 


Ser 


His 


Ala 


Cys 


Phe 


Gin 


He 


470 










475 




Gly 


Lys 


Phe 


Ser 


Leu 


Val 


Asp 










4 90 






Thr 


Ser 


Ser 


Ala 


Asp Arg 


Gin 
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Ala 


Asn 


va i 


Arg 




30 






trp Ala 


blU 


V3± y 


4 5 








Val 


Ala 


AX a 


Tl A 

lie 


Lys 


Asp 


Asn 


Leu 








80 


Arg 


Arg 


Ser 


Val 






o c 
9o 




Ser 


Arg 


Ser 


1 nr 




110 






Gin 


Glu 


Asn 


Asp 


125 








Lys 


Gin 


fne 


Ser 


Val 


Ala 


GlU 


lie 








160 


Val 


His 


Ser 


He 






1 "7 R 
I/O 




Arg 


Arg 


Lys 


Ser 




190 






Arg Glu 


Glu 


Lys 


205 








Gin 


Glu 


Tyr 


Asp 


Lys 


Glu 


Phe 


Arg 








240 


Pro 


He 


Glu 


Glu 






255 




Asn 


Lys 


Gin 


Glu 




270 






Ser 


Lys 


Cys 


Leu 


285 








Thr 


Lys 


Tyr 


Leu 


Glu 


Thr 


Ala 


ber 








320 


Val 


Gin 


Thr 


He 






.3 J>0 




Gin 


Thr 


Gly 


Ser 




350 






Lys 


Ala 


Gin 


Asn 


365 








Val 


Phe 


Leu 


Leu 


Val 


Tyr 


Val 


I nr 








400 


Leu 


Asn 


Lys 


Lys 






4 1 D 




Val 


Gin 


Val 


va l 




430 






Val 


He 


Lys 


Met 


445 








Thr 


Phe 


Ala 


Asn 


He 


Leu 


Arg 


Ala 








480 


Leu 


Ala 


Gly 


Asn 






495 




Thr Arg 


Met 


Glu 
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500 

Gly Ala Glu lie 
515 

Ala Leu Gly Gin 
530 

Thr Gin Val Leu 
545 

Met lie Ala Thr 

Asn Thr Leu Arg 
580 

Gly Pro Ser Gly 
595 

Ala Cys Ser Asn 
610 

Glu Glu Leu Ser 
625 

lie Arg Glu Leu 

Gin Gin Gly Pro 
660 

Asp Tyr Asp Leu 
675 

Gin Gin Ala Lys 
690 

Arg Leu Ala Met 
705 

Lys Lys Arg Pro 



Asn Lys Ser Leu 
520 

Asn Lys Ala His. 
535 

Arg Asp Ser Phe 
550 

lie Ser Pro Gly 
565 

Tyr Ala Asp Arg 

Glu Gin Leu He 
600 

Gly Ala Leu He 
615 

Ser Gin Met Ser 
630 

Glu Glu Lys Ala 
645 

Asp Trp Leu Glu 

Glu Thr Phe Val 
680 

His Phe Ser Ala 
695 

Gin Leu Glu Glu 

710 
Gin 
725 



505 

Leu Ala Leu Lys 

Thr Pro Phe Arg 
540 

He Gly Glu Asn 
555 

lie Ser Ser Cys 
570 

Val Lys Glu Leu 
585 

Gin Met Glu Thr 

Pro Gly Asn Leu 
620 

Ser Phe Asn Glu 
635 

Met Glu Glu Leu 
650 

Leu Ser Glu Met 
665 

Asn Lys Ala Glu 

Leu Arg Asp Val 
700 

Gin Ala Ser Arg 
715 



510 

Glu Cys He Arg 
525 

Glu Ser Lys Leu 

Ser Arg Thr Cys 
560 

Glu Tyr Thr Leu 
575 

Ser Pro His Ser 
590 

Glu Glu Met Glu 
605 

Ser Lys Glu Glu 

Ala Met Thr Gin 
640 

Lys Glu He He 
655 

Thr Glu Gin Pro 
670 

Ser Ala Leu Ala 
685 

He Lys Ala Leu 

Gin He Ser Ser 
720 



<210> 486 

<211> 2825 

<212> DNA 

<213> Homo sapiens 

<400> 486 

gcgaaattga ggtttcttgg tattgcgcgt 
atggactcgt cgcttcaggc ccgcctgttt 
aatggtttaa ttcacagtgc caatgtaagg 
gtggaatggg cagaaggagg tgccacaaag 
gcaataaacc cagaactctt acagcttctt 
caggaaaatg taacaatcca gaaacaaaaa 
ccaaaagaaa gtcttcgaag ccgctccact 
acggctcagg agaatgacat ggaggtggag 
ttttcagttc ctcctgcccc cactaggcct 
aggatggtca gcgaggagat ggaagagcaa 
aaccctgtga actcagttcg gaggaaatca 
aacaagcgag aagagaagaa ggcccagaac 
tatgacagta gttttccaaa ctgggaattt 
ttggaatgtc atccacttac tatgactgat 
gttaggaaac gcccactgaa taagcaagaa 
attcctagca agtgtctcct cttggtacat 
tatctggaga accaagcatt ctgctttgac 
gttgtctaca ggttcacagc aaggccactg 
acttgttttg catatggcca gacaggaagt 
tctgggaaag cccagaatgc atccaaaggg 
ctcctgaaga atcaaccctg ctaccggaag 
gagatctaca atgggaagct gtttgacctg 
gaggacggca agcaacaggt gcaagtggtg 
gatgatgtca tcaagatgct cgacatgggc 



ttctcttcct tgctgactct ccgaatggcc 60 
cccggtctcg ctatcaagat ccaacgcagt 120 
actgtgaact tggagaaatc ctgtgtttca 180 
ggcaaagaga ttgattttga tgatgt'ggct 240 
cccttacatc cgaaggacaa tctgcccttg 300 
cggagatccg tcaactccaa aattcctgct 360 
cgcatgtcca ctgtctcaga gcttcgcatc 420 
ctgcctgcag ctgcaaactc ccgcaagcag 480 
tcctgccctg cagtggctga aataccattg 540 
gtccattcca tccgtggcag ctcttctgca 600 
tgtcttgtga aggaagtgga aaaaatgaag 660 
tctgaaatga gaatgaagag agctcaggag 720 
gcccgaatga ttaaagaatt tcgggctact 780 
cctatcgaag agcacagaat atgtgtctgt 840 
ttggccaaga aagaaattga tgtgatttcc 900 
gaacccaagt tgaaagtgga cttaacaaag 960 
tttgcatttg atgaaacagc ttcgaatgaa 1020 
gtacagacaa tctttgaagg tggaaaagca 1080 
ggcaagacac atactatggg cggagacctc 1140 
atctatgcca tggcctcccg ggacgtcttc 1200 
ttgggcctgg aagtctatgt gacattcttc 1260 
ctcaacaaga aggccaagct gcgcgtgctg 1320 
gggctgcagg agcatctggt taactctgct 1380 
agcgcctgca gaacctctgg gcagacattt 1440 



-243- 



BNSDOCID: <WO__2004055O5QA2J_> 



WO 2004/055050 



PCT/IB2003/006434 



gccaactcca 
agaatgcatg 
tccagtgctg 
ctgaaggagt 
aagctgacac 
gccacgatct 
gacagggtca 
gaaacagaag 
gaagaggagg 
gagctggagg 
cttgagctct 
gcggaatctg 
gccttacgcc 
cggccccagt 
gccctcccca 
ttctggtaaa 
gtgacatggg 
ttagttaccc 
ttctccgcag 
ctggctctgg 
acgcctttgg 
tctactttac 
tatacattgt 
aaaaa 



attcctcccg 
gcaagttctc 
accggcagac 
gcatcagggc 
aggtgctgag 
caccaggcat 
aggagctgag 
agatggaagc 
aactgtcttc 
agaaggctat 
ctgagatgac 
ctctggccca 
tggccatgca 
gacgactgca 
gagaactt tg 
tgccaagtat 
acactccttt 
ttttgtgttg 
catcctgcct 
ggagagagac 
gccgagcact 
tgtctcccta 
atgtaacaat 



ctcccacgcg 
tttggtagat 
ccgcatggag 
cctgggacag 
ggactccttc 
aagctcctgt 
cccccacagt 
ctgctctaac 
ccagatgtcc 
ggaagagctc 
cgagcagcca 
gcaagccaag 
gctggaagag 
aataaaaatc 
ggtacctggt 
gggggcatct 
tctgttcctc 
cccttctttc 
gcgtggactg 
ggagccttta 
gaatgtcttg 
gagtcctaga 
aaagagaaaa 



tgcttccaaa 
ctggcaggga 
ggcgcagaaa 
aacaaggctc 
attggggaga 
gaatatactt 
gggcccagtg 
ggggcgctga 
agctttaacg 
aaggagatca 
gactatgacc 
catt tctcag 
caggctagca 
tgtttggttt 
gggtctaggc 
gggcccaggg 
agt tgtcgcc 
catcaagggg 
gctgctaa tg 
gtacagctat 
tactttaaaa 
ggatccctac 
aataaaaaaa 



ttattcttcg 
atgagcgagg 
tcaacaagag 
acaccccgt t 
actctaggac 
taaacaccct 
gagagcagtt 
ttccaggcaa 
aagccatgac 
tacagcaagg 
tggagacctt 
ccctgcgaga 
gacaaataag 
gacacccagc 
agggtctgag 
cagctgggga 
ctcacgagag 
aatgttctca 
gagagctccc 
ctgctggctc 
aaatgtttct 
tgttttctgt 
aaaaaaaaaa 



agctaaaggg 
cgcagacact 
tctcttagcc 
ccgtgagagc 
ttgcatgatt 
gagatatgca 
gattcaaatg 
tttatccaag 
tcagatcagg 
accagactgg 
tgtgaacaaa 
tgtca tcaag 
cagcaagaaa 
ctcttccctg 
ctgggacagg 
gggggtcaga 
gaaggagctc 
gcatagagct 
tggggttgtc 
taaaccttct 
gagacctctt 
tttatgtgtt 
aaaaaaaaaa 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2825 



<210> 487 
<211> 566 
<212> PRT 

<213> Homo sapiens 
<400> 487 

Met Phe Val Ser Asp 

1 5 
Gin Arg Val Leu Leu 
20 

Cys Lys lie Leu Gin 
35 

Leu Val Pro Val Ser 
50 

His Lys Glu Gin Phe 
65 

Val Asp Leu Leu Asp 
85 

Val Cys Asp Thr His 
100 

Thr Gin lie Lys Leu 
115 

Ala Tyr Glu Asp lie 
130 

Gly Asn Asp Ser Asp 
145 

Glu Glu Glu lie Val 
165 

Trp Glu Ala Arg Arg 
180 

Tyr His Gly Thr Ser 
195 

Leu Ser Lys Asp Leu 
210 

Thr Asp Gin Trp Val 
225 



Phe Arg Lys 

Phe Val Ala 

Ala Leu Phe 
40 

Gly Trp Gin 
55 

His Tyr Phe 
70 

lie Leu Gin 

Arg Pro Val 

Leu lie Lys 
120 

Phe Arg Asp 

135 
Gly Ser Glu 
150 

Glu Gin Thr 

Arg Asp lie 

Ser Ala Met 
200 

Asn Asp Met 

215 
Gin Asp Lys 
230 



Glu Phe 

10 
Ser Asp 
25 

Gin Cys 

Glu Leu 

lie Leu 

Pro Asp 

90 
Asn Val 
105 

Gin Asp 

Glu Glu 

Pro Ser 

Met Arg 
170 
Leu Phe 
185 

Val Met 
Leu Trp 
lie Thr 



Tyr Glu Val Val 
Val Asp 
Asp His 



Glu Thr 
60 

lie Asn 
75 

Glu Asp 

Val Asn 

Asp Asp 

Glu Asp 
140 
Glu Lys 
155 

Arg Arg 

Asp Tyr 

Phe Glu 

Trp Ala 
220 
Gin Met 
235 



Ala Leu 
30 

Val Gin 
45 

Ala Phe 



Cys Gly 

Thr He 

Val Tyr 
110 
Leu Glu 

125 

Glu Glu 

Arg Thr 

Gin Arg 

Glu Gin 
190 
Leu Ala 
205 

He Val 
Lys Tyr 



Gin Ser 
15 

Cys Ala 

Tyr Thr 

Leu Glu 

Ala Asn 

8 0. 
Phe Phe 
95 

Asn Asp 

Val. Pro 

His Ser 

Arg Leu 
160 
Arg Glu 
175 

Tyr Glu 

Trp Met 

Gly Leu 

Val Thr 
240 
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Asp Val Gly Val 

Glu Asp Glu Glu 
260 

Glu Tyr Asp Leu 
275 

Ser Leu Cys Asn 
290 

Val His Gly Gin 
305 

Pro Leu Lys Gin 

Lys Glu Asn Leu 
340 

Met Lys Asp Met 
355 

His Lys Phe Leu 
370 

Glu Ser Pro Glu 
385 

Leu Asp Ser Leu 

Glu Leu Ala Lys 
420 

Cys Leu Cys Thr 
435 

Ser Leu Met Glu 
450 

Ser Leu Ser Leu 
4 65 

Thr Lys Asn Arg 

Leu Ser Met Glu 
500 

Thr Asp Ser Ser 
515 

Ala Ala Glu Ser 
530 

Ser Val He Glu 
545 

Leu He Ser Leu 



Leu Gin Arg His 
245 

Asn Thr Leu Ser 

Arg Leu Val Leu 
280 

Thr Ser Tyr Thr 
295 

Lys Arg Leu Gin 
310 

Val Lys Gin Lys 
325 

Arg Glu Met He 

Arg Val Gin Thr 
360 

Ala Ser Asp Val 
375 

Lys Asp Gly Ser 
390 

Ser Arg Ser Asn 
405 

Lys Gin Leu Arg 

Asn Leu Val He 
440 

Gly Thr Pro Asp 
455 

Leu Ser Lys His 
'470 

Arg Cys Lys Leu 
485 

His Gly Thr Val 

Asp Arg Lys Asn 
520 

Thr Ser Ser Arg 
535 

Leu Lys Ala Glu 

550 
Leu Ser 
565 



Val Ser Arg His 
250 

Val Asp Cys Thr 
265 

Tyr Gin His Trp 

Ala Ala Arg Phe 
300 

Glu Phe Leu Ala 
315 

Phe Gin Ala Met 
330 

Glu Glu Ser Ala 
345 

Phe Ser He His 

Val Phe Ala Thr 
380 

Gly Thr Asp His 
395 

Leu Asp Lys Leu 
410 

Ala Thr Gin Gin 
425 

Ser Gin Gly Pro 

Val Met Leu Phe 
460 

Leu Leu Lys Ser 
475 

Leu Pro Leu Val 
490 

Thr Val Val Gly 
505 

Phe Phe Gly Arg 

Met Leu His Asn 
540 

Asp Arg Ser Lys 
555 



Asn His Arg Asn 
255 

Arg He Ser Phe 
270 

Ser Leu His Asp 
285 

Lys Leu Trp Ser 

Asp Met Gly Leu 
320 

Asp He Ser Leu 
335 

Asn Lys Phe Gly 
350 

Phe Gly Phe Lys 

3 65 

Met Ser Leu Met 

Phe He Gin Ala 
400 

Tyr His Gly Leu 
415 

Thr He Ala Ser 
430 

Phe Leu Tyr Cys 

4 45 

Ser Arg Pro Ala 

Phe Val Cys Ser 
480 

Met Ala Ala Pro 
495 

He Pro Pro Glu 
510 

Ala Phe Glu Lys 
525 

His Phe Asp Leu 

Phe Leu Asp Ala 
560 



<210> 488 
<211> 1938 
<212> DNA 

<213> Homo sapiens 
<400> 488 

gatttggcgg gagtcttgac cgccgccggg 
tccggccgcc gtggctatgt tcgtgtccga 
gagccagagg gtccttctct tcgtggcctc 
ccttcaggcc ttgttccagt gtgaccacgt 
gcaagaactt gaaactgcat ttcttgagca 
aaactgtgga gctaatgtag acctattgga 
ctttgtgtgt^ gacacccata ggccagtcaa 
caaattactc attaaacaag atgatgacct 
ggatgaagag gaggatgaag agcattcagg 
gaagcgcaca cggttagaag aggagatagt 
agagtgggag gcccggagaa gagacatcct 



ctcttggtac ctcagcgcga gcgccaggcg 60 
tttccgcaaa gagttctacg aggtggtcca 120 
ggacgtggat gctctgtgtg cgtgcaagat 180 
gcaatatacg ctggttccag tttctgggtg 240 
taaagaacag tttcattatt ttattctcat 300 
tattcttcaa cctgatgaag acactatatt 360 
tgtcgtcaat gtatacaacg atacccagat 420 
tgaagttccc gcctatgaag acatcttcag 480 
aaatgacagt gatgggtcag agccttctga 540 
ggagcaaacc atgcggagga ggcagcggcg 600 
ctttgactac gagcagtatg aatatcatgg 660 
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gacatcgtca 
catgctgtgg 
aatgaaatac 
gaacgaggat 
cctccgcctg 
taccgcagcc 
tgcagacatg 
cttgaaggag 
catgcgcgtg 
cgtggtcttt 
tcacttcatc 
cctggaactc 
caccaacctc 
agatgtcatg 
gtcctttgtg 
ccccctgagc 
ctcggacagg 
ccggatgctg 
caagtttctg 
cttcttattt 
tagaagccat 
aaaaaaaaaa 



gccatggtga 
tgggccatcg 
gtgactgatg 
gaggagaaca 
gtgctctacc 
aggttcaagc 
ggtcttcccc 
aatttgcggg 
cagactttca 
gccaccatgt 
caggctctgg 
gccaagaagc 
gtcatctccc 
ctgttctcta 
tgttcgacaa 
atggagcatg 
aagaactttt 
cacaaccatt 
gacgcactta 
atgtaactgg 
tttttattaa 
aaaaaaaa 



tgtttgagct 
ttggactaac 
ttggtgtcct 
cact ctccgt 
agcactggtc 
tgtggtctgt 
tgaagcaggt 
aaatgattga 
gcattcat tt 
ctttgatgga 
acagcctctc 
agctgcgagc 
aggggcct tt 
ggccggcatc 
agaaccggcg 
gcacagtgac 
ttgggagggc 
ttgacctctc 
tttccctcct 
ctttcattta 
ataaaatgct 



ggcttggatg 
agaccagtgg 
gcagcgccac 
ggactgcaca 
cctccatgac 
gcatggacag 
gaagcagaag 
agagtctgca 
tgggttcaag 
gagccccgag 
caggagtaac 
cacccagcag 
cctgtactgc 
cctaagcctg 
ctgcaaactg 
cgtggtgggc 
gtttgagaag 
agtaattgag 
gtcctaggaa 
gattgtaagt 
tattttaggc 



ctgtccaagg 
gtgcaagaca 
gtttcccgcc 
cggatctcct 
agcctgtgca 
aagcggctcc 
ttccaggcca 
aataaatttg 
cacaagtttc 
aaggatggct 
ctggacaagc 
accattgcca 
tctctcatgg 
ctcagcaaac 
ctgcccctgg 
atccccccag 
gcagcggaaa 
ctgaaagctg 
tttgattctt 
tatggacatg 
tccgtcccca 



acctgaatga 
agatcactca 
acaaccaccg 
ttgagtatga 
acaccagcta 
aggagttcct 
tggacatctc 
ggatgaagga 
tggccagcga 
cagggacaga 
tgtaccatgg 
gctgcctttg 
agggcactcc 
acctgctcaa 
tgatggctgc 
agaccgacag 
gcaccagctc 
aggatcggag 
ccagaatgac 
atttgagatg 
aaaaaaaaaa 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1938 



<210> 489 
<211> 219 
<212> PRT 
<213> Homo sapiens 



<400> 4 


89 




























Met 


Ser 


Glu 


Thr 


Ala 


Pro 


Ala 


Ala 


Pro 


Ala 


Ala 


Pro 


Ala 


Pro 


Ala 


Glu 


1 








5 










10 










15 




Lys 


Thr 


Pro 


Val 


Lys 


Lys 


Lys 


Ala 


Arg 


Lys 


Ser 


Ala 


Gly Ala 


Ala 


Lys 








20 










25 










30 






Arg 


Lys 


Ala 


Ser Gly 


Pro 


Pro 


Val 


Ser 


Glu 


Leu 


He 


Thr 


Lys 


Ala 


Val 






35 










40 










45 








Ala 


Ala 
50 


Ser 


Lys 


Glu 


Arg 


Ser 
55 


Gly 


Val 


Ser 


Leu 


Ala 
60 


Ala 


Leu 


Lys 


Lys 


Ala 


Leu 


Ala 


Ala 


Ala 


Gly 


Tyr 


Asp 


Val 


Glu 


Lys 


Asn 


Asn 


Ser Arg 


He 


65 










70 










75 










80 


Lys 


Leu 


Gly 


Leu 


Lys 
85 


Ser 


Leu 


Val 


Ser 


Lys 
90 


Gly 


Thr 


Leu 


Val 


Gin 

95 


Thr 


Lys 


Gly 


Thr 


Gly Ala 


Ser 


Gly 


Ser 


Phe 


Lys 


Leu 


Asn 


Lys 


Lys 


Ala 


Ala 








100 










105 










110 






Ser 


Gly 


Glu 
115 


Ala 


Lys 


Pro 


Lys 


Ala 
120 


Lys 


Lys 


Ala 


Gly 


Ala 
125 


Ala 


Lys 


Ala 


Lys 


Lys 


Pro 


Ala 


Gly 


Ala 


Ala 


Lys 


Lys 


Pro 


Lys 


Lys 


Ala 


Thr Gly Ala 




130 










135 










140 










Ala 


Thr 


Pro 


Lys 


Lys 


Ser 


Ala 


Lys 


Lys 


Thr 


Pro 


Lys 


Lys 


Ala 


Lys 


Lys 


145 










150 










155 










160 


Pro 


Ala 


Ala 


Ala 


Ala 


Gly Ala 


Lys 


Lys 


Ala 


Lys 


Ser 


Pro 


Lys 


Lys 


Ala 










165 










170 










175 




Lys 


Ala 


Ala 


Lys 
180 


Pro 


Lys 


Lys 


Ala 


Pro 
185 


Lys 


Ser 


Pro 


Ala 


Lys 
190 


Ala 


Lys 


Ala 


Val 


Lys 
195 


Pro 


Lys 


Ala 


Ala 


Lys 
200 


Pro 


Lys 


Thr 


Ala 


Lys 
205 


Pro 


Lys 


Ala 


Ala 


Lys 
210 


Pro 


Lys 


Lys 


Ala 


Ala 
215 


Ala 


Lys 


Lys 


Lys 













<210> 490 
<211> 785 
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<212> DNA 

<213> Homo sapiens 



<400> 490 

agtcccggcc 

atgtccgaga 

aagaagaagg 

tccgagctca 

gctctcaaga 

aagctgggtc 

gcgtcgggtt 

aaaaaggcag 

gcgacggggg 

ccggctgcag 

ccaaaaaagg 

ccaaagaccg 

aaagttcctt 

accca 



agtgcctctg 
ctgcgcctgc 
cccgcaagtc 
ttactaaagc 
aagcgctggc 
tcaagagcct 
cct tcaaact 
gcgcggccaa 
cggccacccc 
ctgctggagc 
cgcccaagag 
ccaagcccaa 
tggccaactg 



cttccggctc 
cgcgcccgct 
tgcaggtgcg 
tgttgccgcc 
agccgctggc 
ggtgagcaag 
caacaagaag 
ggccaagaag 
caagaagagc 
caaaaaagcg 
cccagcgaag 
ggcagccaag 
cttagaagcc 



gaattgctct 
gctccggccc 
gccaagcgca 
tccaaggagc 
tatgacgtgg 
ggcaccctgg 
gcggcctctg 
ccagcaggag 
gccaagaaga 
aaaagcccga 
gccaaagcag 
ccaaagaagg 
caacacaacc 



cgctcacgct 
ctgccgagaa 
aagcgtctgg 
gcagcggcgt 
agaagaacaa 
tgcagaccaa 
gggaagccaa 
cggcgaagaa 
ccccaaagaa 
aaaaggcgaa 
ttaaacccaa 
cggcagccaa 
caaaggctct 



tgccttcaac 
gactcccgtg 
gcccccggtg 
atctttggcc 
cagccgcatc 
gggcaccggc 
gcctaaggct 
gcccaagaag 
ggcgaagaag 
agcagccaag 
ggcggctaaa 
gaaaaagtag 
tttcagagcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

785 



<210> 491 
<211> 910 
<212> PRT 

<213> Homo sapiens 



<400> 491 
Met Arg Arg 
1 

Lys Pro Lys 

Glu He Thr 
35 

He Asn Asn 
50 

Leu Asn Leu 
65 

Asn Cys Ser 

Asp Pro Pro 

Lys Glu Gin 
115 

Val Trp Cys 
130 

Ala Val Leu 
145 

Gly Lys Asp 

Lys He Glu 

Val Met Gly 
195 

Gin Leu Gly 

210 
Arg Lys Cys 
225 

Lys Glu Asn 

Pro Tyr Thr 

Gin Trp Val 
275 



Ser Ala 
5 

Phe He 
20 

Lys Leu 

Thr Phe 

Pro Ser 

Gly Lys 

85 
His Thr 
100 

Glu Glu 

Lys Pro 

He Val 

He Gly 
165 
Glu Gly 
180 

Val He 

Gly Gly 

Phe Ser 

Arg Gin 
245 
Pro Asn 
260 

Phe Asn 



Ala Pro Ser Gin 
Pro Pro 
Asn Pro 



Leu Pro 

55 
Glu Glu 
70 

Tyr Cys 

Val His 

Lys Ser 

Ser Lys 
135 
Lys Gly 
150 

Arg Gly 

Gin Thr 

Ser Pro 

Ser Thr 
215 
Asn Pro 
230 

Asn Asp 
Ser Leu 
Lys Asn 



Gly Arg 
25 

Asp He 
40 

Ser Gin 



Ser Thr 

Phe Glu 

Ser Ala 
105 
Asp Ser 
120 

Lys Lys 

Lys Ser 

He Gly 

Leu Met 
185 
Asp Asp 
200 

Ala He 

Phe Lys 

Phe Gin 

Val Met 
265 
Cys Phe 
280 



Leu Gin Gly 
10 

Ser Asn Pro 

Lys Leu Phe 

Asn Asp Leu 
60 

Arg Glu He 
75 

Ala Pro Thr 
90 

Pro Lys Glu 

Leu Val Lys 

His Lys Lys 
140 

Phe He Leu 

155 
Tyr Lys Phe 
170 

lie Cys Gly 

Phe Ser Ser 

Ser His Ser 
220 

Ser Val Cys 

235 
Asn Cys Lys 
250 

Pro Arg Pro 
Pro Leu Val 



Asn Ser 

Gly Leu 
30 

Glu Gly 
45 

Arg He 

Asn Asn 

Leu Ala 

Val Ala 
110 
Tyr Phe 
125 

Trp Glu 

Lys Asn 

Lys Glu 

Lys Glu 
190 
Gly Arg 
205 

Ser Gin 

Lys Pro 

Pro Arg 

Asp Lys 
270 
Asp Val 
285 



Phe 
15 
Asn 

val 

Cys 

Arg 

Thr 
95 
Val 

Ser 

Gly 

Leu 

Leu 
175 
He 

Cys 

Val 

Ser 

His 
255 
Asn 

Val 



Lys 

Glu 

Ala 

Ser 

Asp 
80 
Leu 

Ser 

Val 

Asp 

Glu 
160 
Glu 

Glu 

Phe 

Ala 

Ser 
240 
Asp 

His 

He 
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Asp Pro Tyr Leu 
290 

lie Phe Leu Tyr 
305 

Gly Ala lie Leu 

lie Ser Leu lie 
340 

Pro Val lie Lys 
355 

Asn Trp Lys Lys 
370 

He Phe Thr Val 
385 

He Phe Tyr Ser 

Leu Asp Gin He 
420 

Gly His Arg Leu 
435 

Ser Leu Ser Cys 
4 50 

Asn Asp Leu Gin 
4 65 

He Leu Gly Ser 

lie Leu Ser Arg 
500 

Glu Arg Arg Ala 
515 

Arg Arg Thr Gin 
530 

Asn Val Val Phe 
545 

Lys Leu Leu Asn 

Glu Asn Ser Pro 
580 

Asn His Pro Cys 
595 

Ser Thr Cys Asp 
610 

Ser Val Phe Pro 
625 

Ser Gly Lys Leu 

Leu Arg Pro Thr 
660 

Leu Asn He Leu 
675 

Arg Leu Asp Gly 
690 

Gly Phe Asn Ser 
705 

Lys Ala Gly Gly 

Leu Tyr Asp He 
740 

Arg Val Trp Arg 
755 

Leu Thr Thr Gly 



Val Tyr His Leu 
295 

Glu Cys Val Met 
310 

Ala Asp Glu Met 
325 

Trp Thr Leu Gin 

Lys Thr Leu He 
360 

Glu Phe Gin Lys 
375 

Asp Gin Asp His 
390 

Val Leu He He 
405 

Lys Asn He Lys 

Lys Asn Ser Ala 
4 40 

Glu Lys Arg He 
4 55 

Glu Phe Phe Ala 
470 

Leu Ser Ser Tyr 
485 

Glu Pro Ser Ala 

Ala Glu Leu Thr 
520 

Glu He He Asn 
535 

Cys Arg Pro Gly 
550 

Ser Gin Val Val 
565 

His Leu He Cys 

Leu Leu Phe Asn 
600 

Lys Asn Glu Glu 
615 

Ala Asp Tyr Asn 
630 

Gin Val Leu Ser 
645 

Glu Lys Val Val 

Gin Glu Val Cys 
680 

Gin Thr Pro lie 
695 

Gin His Ser Ser 
710 

Val Gly Leu Asn 
725 

Asp Trp Asn Pro 

Asp Gly Gin Lys 
760 

Thr lie Glu Glu 



Arg Pro His Gin 
300 

Gly Met Arg Met 
315 

Gly Leu Gly Lys 
330 

Cys Gin Gly Pro 
345 

Val Thr Pro Gly 

Trp Leu Gly Ser 
380 

Lys Val Glu Glu 
395 

Ser Tyr Glu Met 
410 

Phe Asp Leu Leu 
425 

lie Lys Thr Thr 

He Leu Thr Gly 
4 60 

Leu lie Asp Phe 
475 

Arg Lys lie Tyr 
4 90 

Ser Glu Glu Glu 
505 

Cys Leu Thr Gly 

Lys Tyr Leu Pro 
540 

Ala Leu Gin lie 
555 

Arg Phe Cys Leu 
570 

He Gly Ala Leu 
585 

Ser lie Lys Glu 

Lys Ser Leu Tyr 
620 

Pro Leu Leu Phe 
635 

Lys Leu Leu Ala 
650 

Leu Val Ser Asn 
665 

Lys Arg His Gly 

Ser Gin Arg Gin 
700 

Phe Phe He Phe 
715 

Leu lie Gly Gly 
730 

Ala Thr Asp lie 
745 

Tyr Pro Val His 
Lys lie Tyr Gin 



Lys Glu Gly lie 

Asn Gly Arg Cys 
320 

Thr Leu Gin Cys 
335 

Tyr Gly Gly Lys 
350 

Ser Leu Val Asn 
365 

Glu Arg lie Lys 

Phe He Lys Ser 
400 

Leu Leu Arg Ser 
415 

lie Cys Asp Glu 
430 

Thr Ala Leu lie 
445 

Thr Pro lie Gin 

Val Asn Pro Gly 
480 

Glu Glu Pro lie 
495 

Lys Glu Leu Gly 
510 

Leu Phe lie Leu 
525 

Pro Lys lie Glu 

Glu Leu Tyr Arg 
560 

Gin Gly Leu Leu 
575 

Lys Lys Leu Cys 
590 

Lys Glu Cys Ser 
605 

Lys Gly Leu Leu 

Thr Glu Lys Glu 
640 

Val lie His Glu 
655 

Tyr Thr Gin Thr 
670 

Tyr Ala Tyr Thr 
685 

Gin lie Val Asp 

Leu Leu Ser Ser 
720 

Ser His Leu He 
735 

Gin Ala Met Ser 
750 

lie Tyr Arg Leu 
7 65 

Arg Gin lie Ser 
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770 










775 






Lys 


Gin 


Gly 


Leu 




Gly Ala 


Val 


Val 


785 










790 








His 


lie 


Gin 


Phe 


o X 


Val 


Glu 


Glu 


Leu 










ft n 

OUj 










Glu 


Ser 


Ser 


Asp 


L-ys 


Val 


Thr 


His 


Asp 








820 










825 


Gly 


Glu 


Glu 


v a x 


HIS 


Thr 


Gly Asp 


Ser 






835 










840 




Arg 


Asp 


Cys 


Gin 


Leu 


Gly 


Pro 


His 


His 




850 








855 






Pro 


Leu 


Ser 


Met 


Ser 


Gin 


Leu 


Lys 


Gin 


865 










870 






His 


Leu 


Asn 


Leu 


Thr 


Asp 


Pro 


Phe 


Leu 










885 










Ser 


Phe 


He 


Phe 


Gin 


Asn 


He 


Thr 


Thr 








900 










905 



PCT/IB2003/006434 



780 

Asp Leu Thr Lys Thr Ser Glu 
795 800 
Lys Asn Leu Phe Thr Leu His 
810 815 
Leu Leu Asp Cys Glu Cys Thr 
830 

Leu Glu Lys Phe He Val Ser 
845 

Gin Lys Ser Asn Ser Leu Lys 
860 

Trp Lys His Phe Ser Gly Asp 
875 880 
Glu Arg He Thr Glu Asn Val 
890 895 
Gin Ala Thr Gly Thr 
910 

<210> 492 

<211> 3057 

<212> DNA 

<213> Homo sapiens 

<400> 492 

gagcttctcc ggcggcgagg ggatagctgg ttaccagaag gacttctttg cagggccagt 60 
ggtttctgtc agattttcgc cggtcacgac tgctgaatat gagacgatct gcagcaccaa 120 
gtcagttgca ggggaattcc ttcaaaaaac caaaatttat acctccagga agaagtaatc 180 
caggtctgaa tgaagagatt acaaaactga atccagatat aaaattattt gagggtgttg 240 
caattaataa cacctttctc ccgtcacaaa atgatcttag aatatgcagt ttaaatctgc 300 
ctagtgaaga aagtactaga gaaatcaata acagagataa ttgcagtgga aaatattgtt 360 
ttgaagcacc tacactggca acattagatc cacctcatac agttcattcg gctcctaaag 420 
aagtagcagt gtccaaggaa caagaagaga aatctgatag cctagttaaa tatttcagtg 480 
ttgtttggtg taagccttca aagaaaaaac ataaaaagtg ggaaggtgat gctgttctta 540 
ttgtaaaagg aaagtcattt atattaaaga atttggaagg caaagacatt ggaagaggca 600 
ttggttataa attcaaagag cttgaaaaga ttgaagaggg ccaaacactg atgatttgtg 660 
gaaaagaaat agaagtcatg ggtgtaatct ctccagatga cttcagcagt ggcaggtgtt 720 
ttcagcttgg aggaggaagt actgctatct cgcattcttc tcaggttgcc aggaaatgtt 780 
tctctaaccc tttcaaaagt gtttgtaaac caagttcaaa ggaaaataga cagaatgatt 840 
tccaaaattg caaaccacgc catgacccat atacgccaaa ttccctcgtt atgccacgac 900 
cagataagaa tcaccagtgg gtattcaata agaactgttt ccctcttgtg gatgtagtga 960 
ttgatcctta ccttgtatat catcttcgac cacatcagaa agaaggaatc atattccttt 1020 
atgaatgtgt aatgggaatg agaatgaatg gcagatgtgg agctattctt gctgatgaaa 1080 
tgggtttagg gaagacattg caatgtattt cgctcatctg gaccctgcag tgtcagggac 1140 
cctatggagg caagccagta ataaagaaga cactaattgt cacacctgga agcttggtga 1200 
ataattggaa gaaagaattt caaaaatggc taggaagtga aaggatcaag atatttactg 1260 
ttgatcagga ccacaaagtt gaagaattca tcaagtctat attttattct gttcttatta 1320 
tcagttatga aatgttactt cgttccctgg atcaaattaa gaatataaaa tttgatcttc 1380 
taatctgtga cgaggggcat cgtttgaaga acagtgccat taagacaact acagccctca 1440 
ttagcctttc ttgtgagaaa agaataattc taactggtac tccaattcag aatgatctgc 1500 
aagaattttt tgcattaatt gattttgtaa atccaggaat attaggctct ttgtcatctt 1560 
ataggaaaat atatgaagaa cccatcattt tatcgagaga accttctgct tctgaggaag 1620 
aaaaggagtt aggagaaaga agagcagctg aacttacttg cctcactgga ctctttatcc 1680 
ttagaagaac ccaagaaatt ataaataaat atctcccacc taaaatagag aatgttgtct 1740 
tttgccgacc aggagcacta cagattgagc tttatcgaaa gctgttaaat tctcaggttg 1800 
tcaggttctg ccttcaaggg ttgttggaaa atagtcccca tctaatatgt ataggagctc 1860 
ttaaaaaact gtgcaatcac ccctgccttt tgttcaactc tataaaggaa aaggaatgta 1920 
gctcaacttg tgataaaaat gaagaaaaga gtctatacaa aggcttgcta agtgtgtttc 1980 
ctgctgacta caaccctctc ctgtttactg aaaaggagtc aggaaaacta caggtgttgt 2040 
ccaagctctt agcggttatc cacgaacttc gacctactga aaaggtggtg ttggtatcca 2100 
actatacaca aaccttgaac attttacaag aagtatgtaa gcgtcatgga tatgcttata 2160 



-249- 



BNSDOCID: <WO___200405505GA2J_> 



WO 2004/055050 



PCT/IB2003/006434 



caagacttga tggacaaaca 
gtcaacactc ttcttttttt 
acctcattgg aggatctcac 
ttcaggcaat gtctagagta 
tcctaactac aggtacaata 
tttgtggggc agttgtcgac 
aacttaaaaa tttgttcaca 
actgtgagtg tacaggagaa 
ctagagattg tcagcttggt 
tgtcccagct gaagcaatgg 
ttcttgaaag aataacagaa 
ctggcacata gtgaaagatt 
ggtaattaaa tgtacttttt 
aaatatatca cttttgatac 
agtgtttaaa tgaaaaataa 

<210> 493 
<211> 209 
<212> PRT 

<213> Homo sapiens 



<400> 493 



Met 


Gly 


Lys 


Gly 


Asp 


Pro 


Asn 


Lys 


Pro 


Arg 


Gly 


Lys 


Met 


Ser 


Ser 


Tyr 


1 








5 










10 










15 




Ala 


Phe 


Phe 


Val 
20 


Gin 


Thr 


Cys 


Arg 


Glu 
25 


Glu 


His 


Lys 


Ly5 


Lys 
30 


His 


Pro 


Asp 


Ser 


Ser 
35 


Val 


Asn 


Phe 


Ala 


Glu 
40 


Phe 


Ser 


Lys 


Lys 


Cys 
45 


Ser 


Glu 


Arg 


Trp 


Lys 
50 


Thr 


Met 


Ser 


Ala 


Lys 
55 


Glu 


Lys 


Ser 


Lys 


Phe 
60 


Glu 


Asp 


Met 


Ala 


Lys 


Ser 


Asp 


Lys 


Ala 


Arg 


Tyr 


Asp 


Arg 


Glu 


Met 


Lys 


Asn 


Tyr 


Val 


Pro 


65 










70 










75 










80 


Pro 


Lys 


Gly 


Asp 


Lys 
85 


Lys 


Gly 


Lys 


Lys 


Lys 
90 


Asp 


Pro 


Asn 


Ala 


Pro 
95 


Lys 


Arg 


Pro 


Pro 


Ser 
100 


Ala 


Phe 


Phe 


Leu 


Phe 
105 


Cys 


Ser 


Glu 


His 


Arg 
110 


Pro 


Lys 


He 


Lys 


Ser 


Glu 


His 


Pro 


Gly 


Leu 


Ser 


He 


Gly Asp 


Thr 


Ala 


Lys 


Lys 






115 










120 










125 








Leu 


Gly 
130 


Glu 


Met 


Trp 


Ser 


Glu 
135 


Gin 


Ser 


Ala 


Lys 


Asp 
140 


Lys 


Gin 


Pro 


Tyr 


Glu 


Gin 


Lys 


Ala 


Ala 


Lys 


Leu 


Lys 


Glu 


Lys 


Tyr 


Glu 


Lys 


Asp 


He 


Ala 


145 










150 










155 










160 


Ala 


Tyr 


Arg 


Ala 


Lys 
165 


Gly 


Lys 


Ser 


Glu 


Ala 
170 


Gly 


Lys 


Lys 


Gly 


Pro 
175 


Gly 


Arg 


Pro 


Thr Gly 


Ser 


Lys 


Lys 


Lys 


Asn 


Glu 


Pro 


Glu 


Asp 


Glu Glu 


Glu 








180 










185 










190 






Glu 


Glu 


Glu 
195 


Glu 


Glu 


Asp 


Glu 


Asp 
200 


Glu 


Glu 


Glu 


Glu 


Asp 
205 


Glu 


Asp 


Glu 


Glu 

































ccaatctctc 

atttttttgt, 

ttaattctct 

tggagagatg 

gaagaaaaga 

ctcaccaaga 

ttacatgaaa 

gaagttcata 

ccacatcacc 

aaacattttt 

aatgtgtcat 

acttctgaca 

gaaaattaat 

aatagtcaaa 

agatgtgctt 



aaaggcagca 
taagttcaaa 
atgacattga 
gtcagaaata 
tctatcaaag 
catctgaaca 
gttcagattg 
caggtgattc 
agaaatctaa 
ctggagatca 
tcatttttca 
ttccattgct 
agaattattt 
attgagtggt 
atatcattgt 



gattgttgat 
agctggtggt 
ttggaatcca 
tcctgtacat 
gcagatcagt 
tattcagttt 
tgttactcat 
gttggaaaaa 
ctccctgaaa 
tttaaatctt 
gaatataacc 
cttcttttga 
aaat tacagt 
ttaatgtttt 
aaaaaaaaaa 



ggctttaaca 
gtaggactta 
gccactgaca 
atttacagac 
aagcaaggtc 
tcagtagaag 
gatctgcttg 
ttcattgtct 
cctctttcta 
acagatcct t 
actcaagcta 
aaattagtat 
atatgttgca 
gtaaatatta 
aaaaaaa 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3057 



<210> 494 
<211> 1277 
<212> DNA 

<213> Homo sapiens 
<400> 494 

gatgtggccc gtggcctagc tcgtcaagtt gccgtggcgc ggagaactct gcaaaacaag 60 
aggctgagga ttgcgttaga gataaaccag ttcacgccgg agccccgtga gggaagcgtc 120 
tccgttgggt ccggccgctc tgcgggactc tgaggaaaag ctcgcaccag gtggacgcgg 180 
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atctgtcaac 

cttcttcgtg 

tttcgcggaa 

gtcgaagttt 

ttacgttcct 

gccaccatct 

ccctggccta 

agccaaagat 

ggatattgct 

gccaacaggc 

agatgaagat 

cgtgtggaat 

agctcaatac 

ctgtagagaa 

gttagatttt 

ctgtgtgtat 

ttttaggatg 

tttctattgt 

gaccaaaaaa 



atgggtaaag 

cagacctgcc 

ttctccaaga 

gaagatatgg 

cccaaaggtg 

gccttcttcc 

tccattgggg 

aaacaaccat 

gcatatcgtg 

tcaaagaaga 

gaggaggaag 

gtgtgtgtgt 

tagcttcagt 

aatactttta 

acagcttctg 

ggtagcacag 

ttgcatttcg 

ctttgtctta 

aaaaaaa 



gagaccccaa 

gggaagagca 

agtgttcgga 

caaaaagtga 

ataagaaggg 

tgttttgctc 

atactgcaaa 

atgaacagaa 

ccaagggcaa 

agaacgaacc 

aggatgaaga 

gctcaggcaa 

ataaaaactg 

aaaaatgcag 

atgttgaatg 

caaacttgta 

tttttttaaa 

atatgctaag 



caagccgcgg 

caagaagaaa 

gagatggaag 

caaagctcgc 

gaagaaaaag 

tgaacatcgc 

gaaattgggt 

agcagctaag 

aagtgaagca 

agaagatgag 

tgaagaataa 

ttattttgct 

tacagatttt 

gttgtagctt 

ttcctaaata 

ggaattagta 

aaaaattttg 

ttaattttca 



ggcaaaatgt 

cacccggact 

accatgtctg 

tatgacaggg 

gaccccaatg 

ccaaagatca 

gaaatgtggt 

ctaaaggaga 

ggaaagaagg 

gaggaggagg 

atggctatcc 

aagaatgtga 

tgtatagctg 

tttgatgggc 

tttaatggtt 

tcaatagtaa 

taataaaatt 

ctttaaaaaa 



cctcgtacgc 240 
cttccgtcaa 300 
caaaggagaa 360 
agatgaaaaa 420 
ctcctaaaag 480 
aaagtgaaca 540 
ctgagcagtc 600 
aatatgaaaa 660 
gccctggcag 720 
aggaagaaga 780 
tttaatgatg 840 
attcaagtgc 900 
ataagattct 960 
tactcataca 1020 
tttttaattt 1080 
attttgggtt 1140 
atgtatatta 1200 
gccatttgaa 1260 
1277 



<210> 495 
<211> 874 
<212> PRT 
<213> Homo sapiens 



<400> 495 
Met Asp Glu 
1 

Glu Asn Glu 

Thr Arg Glu 
35 

Asp Gly Asp 
50 

Gly Glu Thr 
65 

Met Glu Asp 

Glu Asn Arg 

Glu Glu Leu 
115 

Leu Gin Glu 
130 

Ala Arg Leu 
14 5 

Glu Arg Arg 

Leu Asp Val 

Lys Ala Ser 
195 

Ala Pro Ser 

210 
He Thr Arg 
225 

Thr Gin Pro 



Glu Glu 
5 

Ser Ala 
20 

Asn Gly 
Gly Glu 
Arg Asp 

Leu Thr 

85 

Val Leu 
100 

Gin Glu 

Gin Leu 

Gin Lys 

Val Gin 
165 
Pro Ala 
180 

Pro Pro 



Asp Asn 

Leu Asp 

Glu Pro 

Ser Tyr 

55 
Glu Lys 
70 

Asp Glu 

Pro Ala 

Glu Leu 

Lys Val 
135 
Ser Pro 
150 

Arg lie 
Leu Pro 
Asp Pro 



Pro Arg Val 
Leu lie Arg 



Gin Pro 

Gly Gin 

He Cys 
245 
Ser Ser 
260 

Leu Ser 



Leu Gin 
215 
lie Val 
230 

Val Glu 
Thr Glu 
Gin He 



Leu Ser Leu 
10 

Cys Asn Ser 
25 

Asp Ala Phe 
40 

Thr Glu Glu 

Glu Asn Leu 

Glu Glu Val 
90 

Pro Ala Pro 

105 
Arg Asn Leu 
120 

Thr Thr He 

Glu Lys Ser 

Gin Glu Ser 
170 

Arg Thr Lys 
185 

Lys Ser Ser 
200 

Thr He Ser 



Leu Thr Ala 

Glu Glu Asn 

Asp Glu Leu 
45 

Ala Asp Asp 
60 

Ala Thr Leu 
75 

Pro Ala Ser 
Arg Arg Glu 

Gin Glu Gin 

125 

Lys Gin Thr 

140 
Pro Arg Pro 
155 

Thr Cys Phe 
Arg Val Ala 



Gly Thr Pro 

Ala Phe Ser 
250 

Met Asn Lys 
265 

Lys Glu Lys 



Ser Ser Arg 
205 

Arg Asn Lys 

220 
Gly Ser Ser 
235 

Gly Leu Arg 
Lys Met Thr 
Met Ala Arg 



Leu Leu Glu 
15 

Asn Phe Leu 
30 

Phe Asp Ala 

Gly Glu Thr 

Phe Gly Asp 
80 

Gin Ser Thr 
95 

Lys Thr Asn 
110 

Met Lys Ala 

Ala Ser Pro 

Pro Leu Lys 
160 

Ser Ala Glu 

175 
Arg Thr Pro 
190 

Met Thr Ser 

Pro Ser Gly 

Gly Glu Thr 
240 

Leu Arg Arg 

255 
Gly Arg Lys 
270 

Glu Lys Leu 



-251- 



BNSDOCID: <WO 200405505GA2J_> 



WO 2004/055050 



PCT/IB2003/006434 







275 










280 










285 








Glu 


Glu 


He 


Asp 


Trp 


Val 


Thr 


Phe 


Gly 


Val 


He 


Leu 


Lys 


Lys 


Val 


Thr 




290 










295 










300 










Pro 


Gin 


Ser 


Val 


Asn 


Ser 


Gly Lys 


Thr 


Phe 


Ser 


He 


Trp 


Lys 


Leu 


Asn 


305 










310 










315 








320 


Asp 


Leu 


Arg 


Asp 


Leu 


Thr 


Gin 


Cys 


Val 


Ser 


Leu 


Phe 


Leu 


Phe 


Gly 


Glu 










325 










330 










335 




Val 


His 


Lys 


Ala 


Leu 


Trp 


Lys 


Thr 


Glu 


Gin 


Gly 


Thr 


Val 


Val 


Gly 


He 








340 










345 










350 






Leu 


Asn 


Ala 


Asn 


Pro 


Met 


Lys 


Pro 


Lys 


Asp 


Gly 


Ser 


Glu 


Glu 


Val 


Cys 






355 










360 










365 








Leu 


Ser 


He 


Asp 


His 


Pro 


Gin 


Lys 


Val 


Leu 


He 


Met 


Gly 


Glu 


Ala 


Leu 




370 










375 










380 










Asp 


Leu 


Gly 


Thr 


Cys 


Lys 


Ala 


Lys 


Lys 


Lys 


Asn 


Gly 


Glu 


Pro 


Cys 


Thr 


385 










390 










395 










400 


Gin 


Thr 


Val 


Asn 


Leu 


Arg 


Asp 


Cys 


Glu 


Tyr 


Cys 


Gin 


Tyr 


His 


Val 


Gin 










405 










410 










415 




Ala 


Gin 


Tyr 


Lys 


Lys 


Leu 


Ser 


Ala 


Lys 


Arg 


Ala 


Asp 


Leu 


Gin 


Ser 


Thr 








420 










425 










430 






• Phe 


Ser 


Gly 


Gly 


Arg 


lie 


Pro 


Lys 


Lys 


Phe 


Ala 


Arg 


Arg 


Gly 


Thr 


Ser 






435 










440 










445 








Leu 


Lys 


Glu 


Arg 


Leu 


Cys 


Gin 


Asp 


Gly 


Phe 


Tyr 


Tyr 


Gly 


Gly 


Val 


Ser 




450 










455 










4 60 










Ser 


Ala 


Ser 


Tyr 


Ala 


Ala 


Ser 


He 


Ala 


Ala 


Ala 


Val 


Ala 


Pro 


Lys 


Lys 


465 










470 










475 










480 


Lys 


lie 


Gin 


Thr 


Thr 


Leu 


Ser 


Asn 


Leu 


Val 


Val 


Lys 


Gly 


Thr 


Asn 


Leu 










485 










490 










495 




lie 


lie 


Gin 


Glu 


Thr 


Arg 


Gin 


Lys 


Leu 


Gly 


He 


Pro 


Gin 


Lys 


Ser 


Leu 








500 










505 










510 






Ser 


Cys 


Ser 


Glu 


Glu 


Phe 


Lys 


Glu 


Leu 


Met 


Asp 


Leu 


Pro 


Thr 


Cys 


Gly 






515 










520 










525 








Ala 


Arg 


Asn 


Leu 


Lys 


Gin 


His 


Leu 


Ala 


Lys 


Ala 


Thr 


Ala 


Ser 


Gly 


He 




530 










535 










540 










Met 


Gly 


Ser 


Pro 


Lys 


Pro 


Ala 


He 


Lys 


Ser 


He 


Ser 


Ala 


Ser 


Ala 


Leu 


545 










550 










555 










560 


Leu 


Lys 


Gin 


Gin 


Lys 


Gin 


Arg 


Met 


Leu 


Glu 


Met 


Arg 


Arg Arg 


Lys 


Ser 










565 










570 










575 




Glu 


Glu 


He 


Gin 


Lys 


Arg 


Phe 


Leu 


Gin 


Ser 


Ser 


Ser 


Glu 


Val 


Glu 


Ser 








580 










585 










590 






Pro 


Ala 


Val 


Pro 


Ser 


Ser 


Ser 


Arg 


Gin 


Pro 


Pro 


Ala 


Gin 


Pro 


Pro 


Arg 






595 










600 










605 








Thr 


Gly 


Ser 


Glu 


Phe 


Pro 


Arg 


Leu 


Glu 


Gly 


Ala 


Pro 


Ala 


Thr 


Met 


Thr 




610 










615 










620 










Pro 


Lys 


Leu 


Gly Arg 


Gly 


Val 


Leu 


Glu 


Gly 


Asp 


Asp 


Val 


Leu 


Phe 


Tyr 


625 










630 










635 










640 


Asp 


Glu 


Ser 


Pro 


Pro 


Pro 


Arg 


Pro 


Lys 


Leu 


Ser 


Ala 


Leu 


Ala 


Glu 


Ala 










645 










650 










655 




Lys 


Lys 


Leu 


Ala 


Ala 


He 


Thr 


Lys 


Leu 


Arg 


Ala 


Lys 


Gly 


Gin 


Val 


Leu 








660 










665 










670 






Thr 


Lys 


Thr 


Asn 


Pro 


Asn 


Ser 


He 


Lys 


Lys 


Lys 


Gin 


Lys 


Asp 


Pro 


Gin 






675 










680 










685 








Asp 


He 


Leu 


Glu 


Val 


Lys Glu 


Arg 


Val 


Glu 


Lys 


Asn 


Thr 


Met 


Phe 


Ser 




690 










695 










700 










Ser 


Gin 


Ala 


Glu 


Asp 


Glu 


Leu 


Glu 


Pro 


Ala 


Arg 


Lys 


Lys 


Arg 


Arg 


Glu 


705 










710 










715 










720 


Gin 


Leu 


Ala 


Tyr 


Leu 


Glu 


Ser 


Glu 


Glu 


Phe 


Gin 


Lys 


He 


Leu 


Lys 


Ala 










725 










730 










735 




Lys 


Ser 


Lys 


His 


Thr 


Gly 


He 


Leu 


Lys 


Glu 


Ala 


Glu 


Ala 


Glu 


Met 


Gin 








740 










745 










750 






Glu Arg 


Tyr 


Phe 


Glu 


Pro 


Leu 


Val 


Lys 


Lys 


Glu 


Gin 


Met 


Glu 


Glu 


Lys 






755 










760 










765 
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Met Arg 
770 
Cys Ala 
785 

His Glu 

Cys Gly 

Ser Asn 

Lys Thr 
850 
Glu His 
865 



Asn He Arg Glu 
Tyr Thr 
Tyr His 



Asn Arg 
820 
Cys Gly 
835 

Gly Pro 
Ala Lys 



His Phe 
790 
Trp His 
805 

Ser He 



Leu Tyr 

Lys He 

Phe Leu 
870 



Val Lys Cys Arg Val Val Thr Cys Lys Thr 
775 780 

Lys Leu Leu Glu Thr Cys Val Ser Glu Gin 
"795 soo 
Asp Gly Val Lys Arg Phe Phe Lys Cys Pro 

810 815 
Ser Leu Asp Arg Leu Pro Asn Lys His Cys 

825 830 
Lys Trp Glu Arg Asp Gly Met Leu Lys Glu 

840 845 
Gly Gly Glu Thr Leu Leu Pro Arg Gly Glu 
855 8 60 

Asn Ser Leu Lys 



• <210> 496 
<211> 4532 
<212> DNA 

<213> Homo sapiens 
<400> 496 

gaacgaagaa ggcgtcccgg catcggccaa gattctacat tgctcatctg ggcatctgag 60 
cctccttcga agtttcctgt cacaactgtc ctcttgacag catggatgag gaggaagaca 120 
atctgtctct gctgaccgca ctgctggaag aaaatgagtc agccttggat tgtaattcag 180 
aagaaaataa cttcttgacg cgggaaaatg gcgagcccga cgcatttgat gagctctttg 240 
atgccgacgg cgacggtgaa tcttatacag aagaggctga tgatggagaa acaggagaga 300 
caagagacga aaaggaaaat ctggccactc tctttggaga tatggaggac ttaacagatg 360 
aagaagaagt tcccgcatca cagtcaactg aaaatagggt cctccctgct cctgccccca 420 
ggcgagagaa aacgaatgaa gagttgcaag aggaattaag gaatttgcaa gagcaaatga 480 
aggccttaca agagcagcta aaagtaacaa caattaaaca gacagcaagc ccagcccgtc 540 
tgcaaaaatc ccctgagaag tctccccggc cacctcttaa ggagaggaga gttcagagaa 600 
ttcaggagtc aacatgcttt tctgcggagc ttgatgtccc tgcgctacca agaaccaaga 660 
gggtggctcg aacaccaaag gcttcacctc cagatcccaa aagctcatct tcaaggatga 720 
caagtgcacc ctcccaaccc ctacagacga tttctcggaa caaacctagt gggataacta 780 
gaggtcaaat tgtggggacc ccaggaagtt ctggggaaac gactcaaccc atctgtgtgg 840 
aagccttctc tggtctgcgg ctcaggcggc ctcgagtatc ctccacagaa atgaacaaga 900 
aaatgaccgg ccgaaaactg atcagactgt ctcagatcaa ggaaaagatg gccagagaga 960 
agctggaaga aatagattgg gtgacatttg gggttatatt gaagaaggtt acgccacaga 1020 
gtgtgaatag tggaaaaacc ttcagcatat ggaaactgaa tgatcttcgt gacctgacac 1080 
aatgtgtgtc cttgttctta tttggagaag ttcacaaagc gctctggaag acggagcagg 1140 
ggactgtcgt agggatcctc aatgccaacc ccatgaagcc caaggatggt tcagaggagg 1200 
tgtgtttatc tatcgatcat cctcagaagg tcttaattat gggtgaagct cttgacctgg 1260 
gaacctgtaa agccaagaag aagaatggag agccgtgcac gcagactgtg aatttgcgtg 1320 
actgtgagta ctgtcagtac catgtccagg ctcagtacaa gaagctcagc gcaaagcgtg 1380 
cggatctgca gtccaccttc tctggaggac gaattccaaa gaagtttgcc cgcagaggca 14 40 
ccagcctcaa agaacggctg tgccaagatg gcttttacta cggaggggtt tcttctgcct 1500 
cgtatgcagc ttcaattgca gcagctgtgg ctcctaagaa gaagattcaa accactctga 1560 
gtaatctggt tgttaagggc acaaacttga tcatccagga aacacggcaa aaactcggaa 1620 
taccccagaa gagcctgtct tgctctgagg agttcaagga actgatggac ctgccgacgt 1680 
gtggagccag gaacttaaaa caacatttag ccaaagccac agcttcaggg attatgggga 1740 
gcccaaaacc agccatcaag tccatctcgg cctcagcact cttgaagcaa cagaagcagc 1800 
ggatgttgga gatgaggaga aggaaatcag aagaaataca gaagcgattt ctgcagagct 1860 
caagtgaagt tgagagccca gctgtgccat cttcatcaag acagccccct gctcagcctc 1920 
cacggacagg atccgagttc cccaggctgg agggagcccc ggccacaatg acgcccaagc 1980 
tggggcgagg tgtcttggaa ggagatgatg ttctctttta tgatgagtca ccaccaccaa 2040 
gaccaaaact gagtgcttta gcagaagcca aaaagttagc tgctatcacc aaattaaggg 2100 
caaaaggcca ggttcttaca aaaacaaacc caaacagcat taagaagaaa caaaaggacc 2160 
ctcaggacat cctggaggtg aaggaacgtg tagaaaaaaa caccatgttt tcttctcaag 2220 
ctgaggatga attggagcct gccaggaaaa aaaggagaga acaacttgcc tatctggaat 2280 
ctgaggaatt tcagaaaatc ctaaaagcaa aatcaaaaca cacaggcatc ctgaaagagg 2340 
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ccgaggctga gatgcaggag cgctactttg agccactggt gaaaaaagaa caaatggaag 2400 
aaaagatgag aaacatcaga gaagtgaagt gccgtgtcgt gacatgcaag acgtgcgcct 2460 
atacccactt caagctgctg gagacctgcg tcagtgagca gcatgaatac cactggcatg 2520 
atggtgtgaa gaggtttttc aaatgtccct gtggaaacag aagcatctcc ttggacagac 2580 
tcccgaacaa gcactgcagt aactgtggcc tctacaaatg ggaacgggac ggaatgctaa 2640 
aggaaaagac tggtccaaag ataggaggag aaactctgtt accaagagga gaagaacatg 2700 
ctaaatttct gaacagcctt aaataacccg aacttcagac attttcccac agacttcctg 2760 
gcctcctgtg actctggaaa gcaaaggatt ggctgtgtat tgtccattga ttcctgattg 2820 
acgccgtcaa aaacaaatgc ttgttaagcc cataagcttt gcctgcttac tttctgccat 2880 
tgggttggtt tgataccaca tttaacattg acatttaagt ggaaaaccaa gttatcattg 2940 
tcttttctaa gctcagtgtg gatgattgca ttacttcatt cactgaagtt tttgcccaaa 3000 
aattggaagg taaacagaga gctatgtttc tgtatctttt ggttatagag tgttcacttc 3060 
tttatcataa caaaattcta gtgtttatac gaacacccag aggcaaaaga atttggctta 3120 
attctcactc caggtaagta gcttaacttc tgggcttcag ttttctcatc tgtaaaatca 3180 
ggaagattgg actaagtgat cctgaaatgt attttttagc actggatttc tacaaataat 3240 
aaaactttcc catctagata atgatgatca catagtcttg atgtacggac attaaaagcc 3300 
agatttcttc attcaattct gttatctctg ttttactctt tgaaattgat caagccactg 3360 
aatcactttg catttcagtt tatatatata gagagaaaga aggtgtctgc tcttacatta 3420 
ttgtggagcc ctgtgataga aatatgtaaa atctcatatt attttttttt taattttttt 3480 
attttttatg acagggtctc actatgtcac cctggctgga gtgcagtagt gcgatcgcgg 3540 
cacactgcag ccttggcttc cctgggctca agcagtcctc ccacctcagt ctcccaaata 3600 
gctaggacta caggcgtgcg tgaccaagcc cagctaattt ttgcattttt tgtagagatg 3660 
gggttttgcc atgttgctca ggctggtctc aaactcctga gcactagcaa tccacccacc 3720 
tctgtttcca aaaaaaaaaa aaaaatgaaa ggtcaacccc tatgcaaatt accacagcaa 3780 
aggtttcatt caggagattc ttccatctgg gcaacctggt tttccaaata tcatttgacc 3840 
taagtgaatg ttgatactag ctaaagattg ggtaaattgg ttgaattatt gtattgaagc 3900 
ttgagctgta gctaaaagta atttaggttt cccctaagat gttattatgt tagggacata 3960 
acacttttgg gaggttgttg tgggagatgg ttgatttagg ttttcaaaag ctagaaataa 4020 
aatttacatg ccttagattt cataaaattc tgctctaatt gggtggaagg tgctgtatct 4080 
aacttgtgtt cctcctaagg ttatgtccta ataactattc ttttaggagt atacttctac 4140 
tttatagaag gttgcttttc tttttaattt tttctaacaa agaaaagaat aaagtattta 4200 
ttaataagaa ccagaaagca cttgaaactg atgtttttaa tggctcattt agggtagatt 4260 
tatttatctc attaacttaa aacagctatg tgtatgaaat aggtcacaac agaacttgaa 4320 
caccaggttg gtgtctgagc aatccctttc ttatgggaaa aacaatgttc ttgtttgaac 4380 
agagggtatc attgcagtca gtattcacgt gtatattgtt atataagttg tataatatgc 4440 
ttgtaaaggc tgagggtgag ctgtatctgg atgccttttt acaatttgat tttaactttt 4500 
aaaataaatt taaaacataa aaaaaaaaaa aa 4532 

<210> 497 
<211> 876 
<212> PRT 

<213> Homo sapiens 



<400> 497 



Met 


Glu 


Leu 


He 


Thr 


He 


Leu 


Glu 


Lys 


Thr 


Val 


Ser 


Pro 


Asp 


Arg 


Leu 


1 








5 










10 










15 




Glu 


Leu 


Glu 


Ala 


Ala 


Gin 


Lys 


Phe 


Leu 


Glu 


Arg 


Ala 


Ala 


Val 


Glu 


Asn 








20 










25 










30 






Leu 


Pro 


Thr 


Phe 


Leu 


Val 


Glu 


Leu 


Ser 


Arg 


Val 


Leu 


Ala 


Asn 


Pro 


Gly 






35 










40 










45 








Asn 


Ser 


Gin 


Val 


Ala 


Arg 


Val 


Ala 


Ala 


Gly 


Leu 


Gin 


He 


Lys 


Asn 


Ser 




50 










55 










60 










Leu 


Thr 


Ser 


Lys 


Asp 


Pro 


Asp 


He 


Lys 


Ala 


Gin 


Tyr 


Gin 


Gin 


Arg 


Trp 


65 










70 










75 










80 


Leu 


Ala 


He 


Asp 


Ala 


Asn 


Ala 


Arg 


Arg 


Glu 


Val 


Lys 


Asn 


Tyr 


Val 


Leu 










85 










90 










95 




Gin 


Thr 


Leu 


Gly 


Thr 


Glu 


Thr 


Tyr 


Arg 


Pro 


Ser 


Ser 


Ala 


Ser 


Gin 


Cys 








100 










105 










110 






Val 


Ala 


Gly 


lie 


Ala 


Cys 


Ala 


Glu 


He 


Pro 


Val 


Asn 


Gin 


Trp 


Pro 


Glu 






115 










120 










125 








Leu 


He 


Pro 


Gin 


Leu 


Val 


Ala 


Asn 


Val 


Thr 


Asn 


Pro 


Asn 


Ser 


Thr 


Glu 
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130 135 140 

His Met Lys Glu Ser Thr Leu Glu Ala lie Gly Tyr lie Cys Gin Asp 
!45 150 155 160 

He Asp Pro Glu Gin Leu Gin Asp Lys Ser Asn Glu He Leu Thr Ala 

165 170 175 

He He Gin Gly Met Arg Lys Glu Glu Pro Ser Asn Asn Val Lys Leu 

180 185 190 

Ala Ala Thr Asn Ala Leu Leu Asn Ser Leu Glu Phe Thr Lys Ala Asn 

195 200 205 

Phe Asp Lys Glu Ser Glu Arg His Phe He Met Gin Val Val Cys Glu 

210 N 215 220 

Ala Thr Gin Cys Pro Asp Thr Arg Val Arg Val Ala Ala Leu Gin Asn 
225 230 235 240 

Leu Val Lys He Met Ser Leu Tyr Tyr Gin Tyr Met Glu Thr Tyr Met 

245 250 255 

Gly Pro Ala Leu Phe Ala He Thr He Glu Ala Met Lys Ser Asp He 

260 265 270 

Asp Glu Val Ala Leu Gin Gly He Glu Phe Trp Ser Asn Val Cys Asp 

275 280 285 

Glu Glu Met Asp Leu Ala He Glu Ala Ser Glu Ala Ala Glu Gin Gly 

290 295 300 

Arg Pro Pro Glu His Thr Ser Lys Phe Tyr Ala Lys Gly Ala Leu Gin 
305 310 315 320 

Tyr Leu Val Pro He Leu Thr Gin Thr Leu Thr Lys Gin Asp Glu Asn 

325 330 . 335 

Asp Asp Asp Asp Asp Trp Asn Pro Cys Lys Ala Ala Gly Val Cys Leu 

340 345 350 

Met Leu Leu Ala Thr Cys Cys Glu Asp Asp He Val Pro His Val Leu 

355 360 365 

Pro Phe He Lys Glu His lie Lys Asn Pro Asp Trp Arq Tyr Arq Asd 

370 375 380 

Ala Ala Val Met Ala Phe Gly Cys He Leu Glu Gly Pro Glu Pro Ser 
385 390 395 400 

Gin Leu Lys Pro Leu Val He Gin Ala Met Pro Thr Leu lie Glu Leu 

405 410 415 

Met Lys Asp Pro Ser Val Val Val Arg Asp Thr Ala Ala Trp Thr Val 

420 425 430 

Gly Arg lie Cys Glu Leu Leu Pro Glu Ala Ala He Asn Asp Val Tyr 

4 35 44 0 445 

Leu Ala Pro Leu Leu Gin Cys Leu lie Glu Gly Leu Ser Ala Glu Pro 

450 455 460 

Arg Val Ala Ser Asn Val Cys Trp Ala Phe Ser Ser Leu Ala Glu Ala 
465 470 475 480 

Ala Tyr Glu Ala Ala Asp Val Ala Asp Asp Gin Glu Glu Pro Ala Thr 

485 490 495 

Tyr Cys Leu Ser Ser Ser Phe Glu Leu He Val Gin Lys Leu Leu Glu 

500 505 510 

Thr Thr Asp Arg Pro Asp Gly His Gin Asn Asn Leu Arg Ser Ser Ala 

515 520 525 

Tyr Glu Ser Leu Met Glu He Val Lys Asn Ser Ala Lys Asp Cys Tyr 

530 535 540 

Pro Ala Val Gin Lys Thr Thr Leu Val He Met Glu Arg Leu Gin Gin 
545 550 555 560 

Val Leu Gin Met Glu Ser His He Gin Ser Thr Ser Asp Arg He Gin 

565 570 ' 575 

Phe Asn Asp Leu Gin Ser Leu Leu Cys Ala Thr Leu Gin Asn Val Leu 

580 585 590 

Arg Lys Val Gin His Gin Asp Ala Leu Gin lie Ser Asp Val Val Met 

595 600 605 

Ala Ser Leu Leu Arg Met Phe Gin Ser Thr Ala Gly Ser Gly Gly Val 
610 615 620 
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Gin 




Asp 


Ala 


Leu 


Met 


Ala 


val 


C a -Y~ 

ber 


i nr 


Leu 


vai 




vai 


T rtn 

Leu 


biy 


tz o c 










/Z O A 




















C A f\ 


Gly 


GlU 


Phe 


Leu 


Lys 


Tyr 


Met 


Glu 


Ala 


rne 


Lys 


Pro 


Pne 


Leu 


Gly 


lie 




















& ^ n 










boo 




Gly 


Leu 


Lys 


Asn 


Tyr 


Ala 


Glu 


Tyr 


Gin 


val 


cys 


Leu 


Ala 


Ala 


Val 


P 1 * r 

Gly 








r a c\ 










ODD 










D / U 






Leu 


Val 


Gly 


Asp 


Leu 


Cys 


Arg 


TV 1 — 

Ala 


Leu 


bin 


ber 


Asn 


He 


lie 


Pro 


Pne 






675 










r o a 

680 










685 








Cys 


Asp 


Glu 


Val 


Met 


Gin 


Leu 


Leu 


Leu 


Glu 


Asn 


Leu 


Gly 


Asn 


Glu 


Asn 




690 










695 










1 A A 










Val 


His 


Arg 


Ser 


Val 


Lys 


Pro 


Gin 


He 


Leu 


Ser 


Val 


Phe 


Gly 


Asp 


He 


/UD 










/ 10 










/ 1 D 










Ton 


Ala 


Leu 


Ala 


lie 


Gly 


Gly 


Glu 


Phe 


Lys 


Lys 


Tyr 


Leu 


Glu 


Val 


Val 


Leu 










725 










7 30 










"7 "3 n 
/ 3 b 




Asn 


Thr 


Leu 


Gin 


Gin 


Ala 


Ser 


Gin 


Ala 


Gin 


Val 


Asp 


Lys 


Ser 


Asp 


Tyr 








I 4 U 




















TC A 

/ du 






Asp 


Met 


Val 


Asp 


Tyr 


Leu 


Asn 


Glu 


Leu 


Arg 


Glu 


Ser 


Cys 


Leu 


Glu 


Ala 






TCP 

/bo 










/ bl) 










ICC 

/ bb 








Tyr 


Thr 


Gly 


lie 


Val 


Gin 


Gly 


Leu 


Lys 


Gly 


Asp 


Gin 


Glu 


Asn 


Val 


His 




7 70 










lib 










7 80 










Pro 


Asp 


Val 


Met 


Leu 


Val 


Gin 


Pro 


Arg 


Val 


Glu 


Phe 


He 


Leu 


Ser 


Phe 


"7 O C 

78b 










790 










t a c 










O A A 


lie 


Asp 


His 


lie 


Ala 


Gly 


Asp 


Glu 


Asp 


His 


Thr 


Asp 


Gly 


Val 


Val 


Ala 










QAC 

oUD 










olU 










Ol J 




Cys 


Ala 


Ala 


Gly 


Leu 


He 


Gly 


Asp 


Leu 


Cys 


Thr 


Ala 


Phe 


Gly 


Lys 


Asp 








820 










825 










830 






Val 


Leu 


Lys 


Leu 


Val 


Glu 


Ala 


Arg 


Pro 


Met 


He 


His 


Glu 


Leu 


Leu 


Thr 






835 










840 










845 








Glu 


Gly 


Arg 


Arg 


Ser 


Lys 


Thr 


Asn 


Lys 


Ala 


Lys 


Thr 


Leu 


Ala 


Thr 


Trp 




850 










855 










860 










Ala 


Thr 


Lys 


Glu 


Leu 


Arg 


Lys 


Leu 


Lys 


Asn 


Gin 


Ala 











865 870 875 



<210> 498 
<211> 4205 
<212> DNA 

<213> Homo sapiens 
<400> 498 

ctccctcgct ccctccctgc gcgccgcctc tcactcacag cctcccttcc 
ctccgcctcc cgagcaccag cgcgctctga gctgccccca gggtccctcc 
gcagcccatt tggagggagg aagtaaggga agaggagagg aaggggagcc 
cccagacaga gccggtgaat gggtttgtgg tgacccccgc cccccacccc 
cccacccgac ccccaacccc catccccagt tcgagccgcc gcccgaaagg 
gtcttaggag gagtcgccgc cgccgccacc tccgccatgg agctgatcac 
aagaccgtgt ctcccgatcg gctggagctg gaagcggcgc agaagttcct 
gccgtggaga acctgcccac tttccttgtg gaactgtcca gagtgctggc 
aacagtcagg ttgccagagt tgcagctggt ctacaaatca agaactcttt 
gatccagata tcaaggcaca atatcagcag aggtggcttg ctattgatgc 
cgagaagtca agaactatgt tttgcagaca ttgggtacag aaacttaccg 
gcctcacagt gtgtggctgg tattgcttgt gcagagatcc cagtaaacca 
ctcattcctc agctggtggc caatgtcaca aaccccaaca gcacagagca 
tcgacattgg aagccatcgg ttatatttgc caagatatag acccagagca 
aaatccaatg agattctgac tgccataatc caggggatga ggaaagaaga 
aatgtgaagc tagctgctac gaatgcactc ctgaactcat tggagttcac 
tttgataaag agtctgaaag gcactttatt atgcaggtgg tctgtgaagc 
ccagatacga gggtacgagt ggctgcttta cagaatctgg tgaagataat 
tatcagtaca tggagacata tatgggtcct gctctttttg caatcacaat 
aaaagtgaca ttgatgaggt ggctttacaa gggatagaat tctggtccaa 
gaggaaatgg atttggccat tgaagcttca gaggcagcag aacaaggacg 



ttctttctcc 60 
cccgccgcca 120 
ggaccgacta 180 
accctccctt 240 
ccgggccgtc 300 
cattctcgag 360 
ggagcgtgcg 420 
aaatccagga 480 
gacatctaaa 540 
taatgctcga 600 
gcctagttct 660 
gtggccagaa 720 
catgaaggag 780 . 
gctacaagat 840 
gcctagtaat 900 
caaagcaaac 960 
cacacagtgt 1020 
gtccttatat 1080 
cgaagcaatg 1140 
tgtctgtgat 1200 
gccccctgag 1260 
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cacaccagca agttttatgc gaagggagca ctacagtatc tggttccaat cctcacacag 1320 
acactaacta aacaggacga aaatgatgat gacgatgact ggaacccctg caaagcagca 1380 
ggggtgtgcc tcatgcttct ggccacctgc tgtgaagatg acattgtccc acatgtcctc 1440 
cccttcatta aagaacacat caagaaccca gattggcggt accgggatgc agcagtgatg 1500 
gcttttggtt gtatcttgga aggaccagag cccagtcagc tcaaaccact agttatacag 1560 
gctatgccca ccctaataga attaatgaaa gaccccagtg tagttgttcg agatacagct 1620 
gcatggactg taggcagaat ttgtgagctg cttcctgaag ctgccatcaa tgatgtctac 1680 
ttggctcccc tgctacagtg tctgattgag ggtctcagtg ctgaacccag agtggcttca 1740 
aatgtgtgct gggctttctc cagtctggct gaagctgctt atgaagctgc agacgttgct 1800 
gatgatcagg aagaaccagc tacttactgc ttatcttctt catttgaact catagttcag 1860 
aagctcctag agactacaga cagacctgat ggacaccaga acaacctgag gagttctgca 1920 
tatgaatctc tgatggaaat tgtgaaaaac agtgccaagg attgttatcc tgctgtccag 1980 
aaaacgactt tggtcatcat ggaacgactg caacaggttc ttcagatgga gtcacatatc 2040 
cagagcacat ccgatagaat ccagttcaat gaccttcagt ctttactctg tgcaactctt 2100 
cagaatgttc ttcggaaagt gcaacatcaa gatgctttgc agatctctga tgtggttatg 2160 
gcctccctgt taaggatgtt ccaaagcaca gctgggtctg ggggagtaca agaggatgcc 2220 
ctgatggcag ttagcacact ggtggaagtg ttgggtggtg aattcctcaa gtacatggag 2280 
gcctttaaac ccttcctggg cattggatta aaaaattatg ctgaatacca ggtttgtttg 2340 
gcagctgtgg gcttagtggg agacttgtgc cgtgccctgc aatccaacat catacctttc 2400 
tgtgacgagg tgatgcagct gcttctggaa aatttgggga atgagaacgt ccacaggtct 2460 
gtgaagccgc agattctgtc agtgtttggt gatattgccc ttgctatfgg aggagagttt 2520 
aaaaaatact tagaggttgt attgaatact cttcagcagg cctcccaagc ccaggtggac 2580 
aagtcagact atgacatggt ggattatctg aatgagctaa gggaaagctg cttggaagcc 2640 
tatactggaa tcgtccaggg attaaagggg gatcaggaga acgtacaccc ggatgtgatg 2700 
ctggtacaac ccagagtaga atttattctg tctttcattg accacattgc tggagatgag 2760 
gatcacacag atggagtagt agcttgtgct gctggactaa taggggactt atgtacagca 2820 
tttgggaagg atgtactgaa attagtagaa gctaggccaa tgatccatga attgttaact 2880 
gaagggcgga gatcgaagac taacaaagca aaaacccttg ctacatgggc aacaaaagaa 2940 
ctgaggaaac tgaagaacca agcttgatct gttaccattg ggatgataac ctgaggaccc 3000 
ccactggaaa tctcccatct tttgaaaaac ctggaagtga ggagtgtgca cggatgctga 3060 
atgtttggga atgagaggat gagtgagtga ggcttgaaaa cacaccacat tgaaaatcct 3120 
gccacagcag cagccgcagc cgccaacagc agcgctgtta gtgagctaag taagcactga 3180 
cttcgtagaa aaccataaca tcggccatct tggaaaagag aaaaacaatg gagttactta 3240 
tttaaaaaaa aagaaagaaa gttatctctt cccaggagag gctagaagta gcttttctgt 3300 
cttttggcca gtgccgagtg gaatgcctgg t-ttgggggag gaggagggac tgggttcagc 3360 
tgtggtgctt tgttgtaaaa ggcagcctgg cctttgctac tgaggagaaa gatggagcct 3420 
gggtctcaag cccaccttcg ctgtaccttt gccacatggt actgtatgct tgccagctag 3480 
aaggagggtc agggattttt tacagtctga gaatgagtgt gtgtgagtga ggcggtatcc 3540 
acattctcaa cttcaagtca ttgcagtttc tttttcccag aaaacaaggg gttagatgtt 3600 
gcatttcata aaactaaccg aagttctgtc tactgatgca gcacaagaga tgtaaaaaaa 3660 
aaaaaaaaaa aaaaaaaaaa aacacacaca cagaggaaag acgctcttta ggttttgttt 3720 
tgtttttttt ttttggtttt gttttttgtt ttttttactc tagggaaaac actgacgaat 3780 
ggtcagagct cctatcctga tcttttcatc aaggcgcctt tcctaataat atggttcaac 3840 
tgtgaatgta gaagtggggg ggagggggga gaaaaagaaa actctggcgt tagaggatat 3900 
agaaaaatat aagtacaatt gttacaaata acgcagactt caaaaacaaa aaaatcacaa 3960 
cccaaacaaa ccaaaattta aatgatcaga attggcagca caaagaaaac gccctctcct 4020 
gacttgtatt gtggcagtct gaacgccccc agaaaattgt gccaaagagt ttagaaaaat 4080 
aaatatacaa taaaagtaaa cacatacaca caaaacagca aacttcaggt aactattttg 4140 
gattgcaaac aggataaatt aaatgttcaa acaatctgat aaaataacca tttggaaact 4200 
gaaaa 



4205 



<210> 499 
<211> 763 
<212> PRT 

<213> Homo sapiens 
<400> 499 

Met Lys Ala Ser Pro Arg Arg Pro Leu He Leu Lys Arg Arg Arg Leu 

1 5 10 15 

Pro Leu Pro Val Gin Asn Ala Pro Ser Glu Thr Ser Glu Glu Glu Pro 
20 25 30 
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515 

Glu Met Leu Val 
530 

Arg Arg Lys Gin 
545 

Leu Phe Ser Glu 

Phe Pro Ala Asp 
580 

Glu Val Gly Gly 
595 

Ser Ser Thr Pro 
610 

Arg Leu Thr Pro 
625 

Gin Thr Ser Gin 

Met Asp Leu Ser 
660 

Pro Gin Arg Leu 
675 

Phe Gly Asn Ser 
690 

Pro Glu Pro Gin 
705 

Gly Leu Val Leu 

Asp lie Ser Phe 
740 

He Asn Trp Ser 
755 



520 

He Gin His Arg 
535 

His Leu Leu Pro 
550 

Gly Pro Ser Thr 
565 

Ser Ser Asp Pro 

Pro Phe Lys Thr 
600 

Ser Lys Ser Val 
615 

Pro Ala Lys Val 
630 

Gly Ala Ser Asp 
64 5 

Thr Thr Pro Leu 

Leu Ser Ser Glu 
680 

Ser Pro Ser Asp 
695 

Val Ser Gly Leu 
710 

Asp Thr Met Asn 
725 

Pro Gly Leu Asp 

Gin Phe He Pro 
760 



Glu Arg Arg Glu 
540 

Pro Cys Val Asp 
555 

Ser Arg Trp Ala 
570 

Ala Ser Gin Leu 
585 

Pro He Lys Glu 

Leu Pro Arg Thr 
620 

Gly Gly Leu Asp 
635 

Pro Leu Pro Asp 
650 

Gin Ser Ala Pro 
665 

Pro Leu Asp Leu 

He Asp Val Pro 
700 

Ala Ala Asn Arg 
715 

Asp Ser Leu Ser 
730 

Glu Asp Pro Leu 
745 

Glu Leu Gin 



525 

Arg Ser Arg Ser 

Glu Pro Glu Leu 
560 

Ala Glu Leu Pro 
575 

Ser Tyr Ser Gin 
590 

Thr Leu Pro lie 
605 

Pro Glu Ser Trp 

Phe Ser Pro Val 
640 

Pro Leu Gly Leu 
655 

Pro Leu Glu Ser 
670 

He Ser Val Pro 
685 

Lys Pro Gly Ser 

Ser Leu Thr Glu 
720 

Lys He Leu Leu 
735 

Gly Pro Asp Asn 
750 



<210> 500 

<211> 3336 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (3317) . . . (3317) 

<223> N=A, T, G, or C 

<400> 500 

cggcggcgac tgcagtctgg agggtccaca cttgtgattc tcaatggaga gtgaaaacgc 60 
agattcataa tgaaagctag cccccgtcgg ccactgattc tcaaaagacg gaggctgccc 120 
cttcctgttc aaaatgcccc aagtgaaaca tcagaggagg aacctaagag atcccctgcc 180 
caacaggagt ctaatcaagc agaggcctcc aaggaagtgg cggagtccaa ctcttgcaag 240 
tttccagctg ggatcaagat tattaaccac cccaccatgc ccaacacgca agtagtggcc 300 
atccccaaca atgctaatat tcacagcatc atcacagcac tgactgccaa gggaaaagaq 360 
agtggcagta gtgggcccaa caaattcatc ctcatcagct gtgggggagc cccaactcag 420 
cctccaggac tccggcctca aacccaaacc agctatgatg ccaaaaggac agaagtgacc 480 
ctggagacct tgggaccaaa acctgcagct agggatgtga atcttcctag accacctgga 54 0 
gccctttgcg agcagaaacg ggagacctgt gcagatggtg aggcagcagg ctgcactatc 600 
aacaatagcc tatccaacat ccagtggctt cgaaagatga gttctgatgg actgggctcc 660 
cgcagcatca agcaagagat ggaggaaaag gagaattgtc acctggagca gcgacaggtt 720 
aaggttgagg agccttcgag accatcagcg tcctggcaga actctgtgtc tgagcggcca 780 
ccctactctt acatggccat gatacaattc gccatcaaca gcactgagag gaagcgcatg 840 
actttgaaag acatctatac gtggattgag gaccactttc cctactttaa gcacattgcc 900 
aagccaggct ggaagaactc catccgccac aacctttccc tgcacgacat gtttgtccgg 960 
gagacgtctg ccaatggcaa ggtctccttc tggaccattc accccagtgc caaccgctac 1020 
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ttgacattgg accaggtgtt taagccactg gacccagggt ctccacaatt gcccgagcac 1080 
ttggaatcac agcagaaacg accgaatcca gagctccgcc ggaacatgac catcaaaacc 1140 
gaactccccc tgggcgcacg gcggaagatg aagccactgc taccacgggt cagctcatac 1200 
ctggtaccta tccagttccc ggtgaaccag tcactggtgt tgcagccctc ggtgaaggtg 1260 
ccattgcccc tggcggcttc cctcatgagc tcagagcttg cccgccatag caagcgagtc 1320 
cgcattgccc ccaaggtgct gctagctgag gaggggatag ctcctctttc ttctgcagga 1380 
ccagggaaag aggagaaact cctgtttgga gaagggtttt ctcctttgct tccagttcag 1440 
actatcaagg aggaagaaat " ccagcctggg gaggaaatgc cacacttagc gagacccatc 1500 
aaagtggaga gccctccctt ggaagagtgg ccctccccgg ccccatcttt caaagaggaa 1560 
tcatctcact cctgggagga ttcgtcccaa tctcccaccc caagacccaa gaagtcctac 1620 
agtgggctta ggtccccaac ccggtgtgtc tcggaaatgc ttgtgattca acacagggag 1680 
aggagggaga ggagccggtc tcggaggaaa cagcatctac tgcctccctg tgtggatgag 1740 
ccggagctgc tcttctcaga ggggcccagt acttcccgct gggccgcaga gctcccgttc 1800 
ccagcagact cctctgaccc tgcctcccag ctcagctact cccaggaagt gggaggacct 1860 
tttaagacac ccattaagga aacgctgccc atctcctcca ccccgagcaa atctgtcctc 1920 
cccagaaccc ctgaatcctg gaggctcacg cccccagcca aagtaggggg actggatttc 1980 
agcccagtac aaacctccca gggtgcctct gaccccttgc ctgaccccct ggggctgatg 2040 
gatctcagca ccactccctt gcaaagtgct cccccccttg aatcaccgca aaggctcctc 2100 
agttcagaac ccttagacct catctccgtc ccctttggca actcttctcc ctcagatata 2160 
gacgtcccca agccaggctc cccggagcca caggtttctg gccttgcagc caatcgttct 2220 
ctgacagaag gcctggtcct ggacacaatg aatgacagcc tcagcaagat cctgctggac 2280 
atcagctttc ctggcctgga cgaggaccca ctgggccctg acaacatcaa ctggtcccag 2340 
tttattcctg agctacagta gagccctgcc cttgcccctg tgctcaagct gtccaccatc 2400 
ccgggcactc caaggctcag tgcaccccaa gcctctgagt gaggacagca ggcagggact 24 60 
gttctgctcc tcatagctcc ctgctgcctg attatgcaaa agtagcagtc acaccctagc 2520 
cactgctggg accttgtgtt ccccaagagt atctgattcc tctgctgtcc ctgccaggag 2580 
ctgaagggtg ggaacaacaa aggcaatggt gaaaagagat taggaacccc ccagcctgtt 2640 
tccattctct gcccagcagt ctcttacctt ccctgatctt tgcagggtgg tccgtgtaaa 2700 
tagtataaat tctccaaatt atcctctaat tataaatgta agcttatttc cttagatcat 2760 
tatccagaga ctgccagaag gtgggtagga tgacctgggg tttcaattga cttctgttcc 2820 
ttgcttttag ttttgataga agggaagacc tgcagtgcac ggtttcttcc aggctgaggt 2880 
acctggatct tgggttcttc actgcaggga cccagacaag tggatctgct tgccagagtc 2940 
ctttttgccc ctccctgcca cctccccgtg tttccaagtc agctttcctg caagaagaaa 3000 
tcctggttaa aaaagtcttt tgtattgggt caggagttga atttggggtg ggaggatgga 3060 
tgcaactgaa gcagagtgtg ggtgcccaga tgtgcgctat tagatgtttc tctgataatg 3120 
tccccaatca taccagggag actggcattg acgagaactc aggtggaggc ttgagaaggc 3180 
cgaaagggcc cctgacctgc ctggcttcct tagcttgccc ctcagctttg caaagagcca 3240 
ccctaggccc cagctgaccg catgggtgtg agccagcttg agaacactaa ctactcaata 3300 
aaagcgaagg tggaccnaaa aaaaaaaaaa aaaaaa 3336 

<210> 501 
<211> 952 
<212> PRT 
<213> Homo sapiens 
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Ser 


Asp 


He 


1 








5 








10 










15 




Ala 


Thr 


Leu 


Leu 


Lys 


Thr 


Ser 


Leu 


Arg Lys 


Gly 


Asp 


Thr 


Trp 


Tyr 


Leu 








20 










25 








30 






Val 


Asp 


Ser 


Arg 


Trp 


Phe 


Lys 


Gin 


Trp Lys 


Lys 


Tyr 


Val 


Gly 


Phe 


Asp 






35 










40 








45 








Ser 


Trp 


Asp 


Lys 


Tyr 


Gin 


Met 


Gly 


Asp Gin 


Asn 


Val 


Tyr 


Pro 


Gly 


Pro 




50 










55 








60 










He 


Asp 


Asn 


Ser 


Gly 


Leu 


Leu 


Lys 


Asp Gly 


Asp Ala 


Gin 


Ser 


Leu 


Lys 


65 










70 








75 










80 


Glu 


His 


Leu 


He 


Asp 


Glu 


Leu 


Asp 


Tyr He 


Leu 


Leu 


Pro 


Thr 


Glu 


Gly 










85 








90 










95 




Trp 


Asn 


Lys 


Leu 


Val 


Ser 


Trp 


Tyr 


Thr Leu 


Met 


Glu 


Gly 


Gin 


Glu 


Pro 








100 










105 








110 






He 


Ala 


Arg 


Lys 


Val 


Val 


Glu 


Gin 


Gly Met 


Phe 


Val 


Lys 


His 


Cys 


Lys 
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115 120 125 

Val.Glu Val Tyr Leu Thr Glu Leu Lys Leu' Cys Glu Asn Gly Asn Met 

130 135 140 

Asn Asn Val Val Thr Arg Arg Phe Ser Lys Ala Asp Thr He Asp Thr 
145 150 155 160 

He Glu Lys Glu He Arg Lys He Phe Ser He Pro Asp Glu Lys Glu 

165 170 175 

Thr Arg Leu Trp Asn Lys Tyr Met Ser Asn Thr Phe Glu Pro Leu Asn 

180 185 190 

Lys Pro Asp Ser Thr He Gin Asp Ala Gly Leu Tyr Gin Gly Gin Val 

195 200 205 

Leu Val He Glu Gin Lys Asn Glu Asp Gly Thr Trp Pro Arg Gly Pro 

210 215 220 

Ser Thr Pro Asn Val Lys Asn Ser Asn Tyr Cys Leu Pro Ser Tyr Thr 
225 230 235 240 

Ala Tyr Lys Asn Tyr Asp Tyr Ser Glu Pro Gly Arg Asn Asn Glu Gin 

245 250 255 

Pro Gly Leu Cys Gly Leu Ser Asn Leu Gly Asn Thr Cys Phe Met Asn 

260 265 270 

Ser Ala He Gin Cys Leu Ser Asn Thr Pro Pro Leu Thr Glu Tyr Phe 

215 280 285 

Leu Asn Asp Lys Tyr Gin Glu Glu Leu Asn Phe Asp Asn Pro Leu Gly 

2 ^0 295 300 

Met Arg Gly Glu He Ala Lys Ser Tyr Ala Glu Leu He Lys Gin Met 
305 310 315 , 320 

Trp Ser Gly Lys Phe Ser Tyr Val Thr Pro Arg Ala Phe Lys Thr Gin 

325 330 335 

Val Gly Arg Phe Ala Pro Gin Phe Ser Gly Tyr Gin Gin Gin Asp Cys 

340 345 350 

Gin Glu Leu Leu Ala Phe Leu Leu Asp Gly Leu His Glu Asp Leu Asn 

355 360 365 

Arg He Arg Lys Lys Pro Tyr He Gin Leu Lys Asp Ala Asp Gly Arq 

370 375 380 

Pro Asp Lys Val Val Ala Glu Glu Ala Trp Glu Asn His Leu Lys Arg 
385 390 395 400 

Asn Asp Ser He He Val Asp He Phe His Gly Leu Phe Lys Ser Thr 

405 410 415 

Leu Val Cys Pro Glu Cys Ala Lys lie Ser Val Thr Phe Asp Pro Phe 

420 425 430 

Cys Tyr Leu Thr Leu Pro Leu Pro Met Lys Lys Glu Arg Thr Leu Glu 

435 440 445 

Val Tyr Leu Val Arg Met Asp Pro Leu Thr Lys Pro Met Gin Tyr Lys 

4 50 4 55 4 60 

Val Val Val Pro Lys lie Gly Asn He Leu Asp Leu Cys Thr Ala Leu 
465 470 475 480 

Ser Ala Leu Ser Gly lie Pro Ala Asp Lys Met lie Val Thr Asp lie 

485 490 495 

Tyr Asn His Arg Phe His Arg lie Phe Ala Met Asp Glu Asn Leu Ser 

500 505 510 

Ser lie Met Glu Arg Asp Asp lie Tyr Val Phe Glu lie Asn lie Asn 

515 520 525 

Arg Thr Glu Asp Thr Glu His Val lie lie Pro Val Cys Leu Arg Glu 

530 535 540 

Lys Phe Arg His Ser Ser Tyr Thr His His Thr Gly Ser Ser Leu Phe 
545 550 555 560 

Gly Gin Pro Phe Leu Met Ala Val Pro Arg Asn Asn Thr Glu Asp Lys 

565 570 575 

Leu Tyr Asn Leu Leu Leu Leu Arg Met Cys Arg Tyr Val Lys lie Ser 

580 585 590 

Thr Glu Thr Glu Glu Thr Glu Gly Ser Leu His Cys Cys Lys Asp Gin 
595 600 605 
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Asn lie Asn Gly Asn 
610 

Ser Glu Met Glu Thr 
625 

Glu Leu Pro Ser Glu 
645 

Gly Asp Asn Asp Ser 
660 

Gly Gin Leu Thr Gly 
675 

Asn Leu Gly Asn Thr 
690 

lie Arg Phe Asp Asp 
705 

Ala Leu Asp Trp Asp 
725 

Ala Ala Glu Asp Phe 
740 

Lys Lys Pro Phe Val 
755 

Lys Glu Lys Leu Gly 
770 

Glu His Gin Gin Ala 
785 

Val Leu Val Val His 
805 

Asp Lys Leu Asp Thr 
820 

Ser Glu Phe Leu lie 
835 

lie Ala Val Ser Asn 
850 

Ala Phe Ala Lys Asn 
865 

Ser Ser Val Ser Thr 
885 

Tyr Val Leu Phe Tyr 
900 

Phe Pro Leu Asp Arg 
915 

Pro Leu Glu Ser Asp 
930 

Asn Glu Asn Cys Met 
945 



Gly Pro Asn 
615 

Asp Glu Pro 
630 

Asn Glu Asn 

Glu Asn Gly 

His Lys Lys 
680 

Asp lie Asn 
695 

Arg Gin Leu 
710 

Pro Asp Leu 

Glu Lys His 

Lys Leu Lys 
7 60 

Ala Glu Asp 
775 

Thr Lys Lys 
790 

Leu Lys Arg 

Leu Val Asp 

Asn Pro Asn 
840 

His Tyr Gly 
855 

Lys Asp Asp 
870 

Ala Ser Glu 

Gin Arg Gin 

Glu Thr Lys 
920 

Glu Asp Ser 
935 

His Thr Asn 
950 



Gly lie His 

Asp Asp Glu 
635 

Ser Gin Ser 
650 

Leu Cys Thr 
665 

Arg Leu Phe 

Tyr lie Lys 

Arg Leu Asp 
715 

Lys Lys Arg 

730 
Glu Ser Val 
745 

Asp Cys lie 

Pro Trp Tyr 

Leu Asp Leu 
795 

Phe Ser Tyr 

810 
Phe Pro lie 
825 

Ala Gly Pro 

Gly Met Gly 

Gly Lys Trp 
875 

Asp Gin He 

8 90 
Asp Thr Phe 
905 

Gly Ala Ser 
Asn Asp Asn 



Glu 
620 
Ser 

Glu 

Glu 

Thr 

Asp 
700 
Glu 

Tyr 

Glu 

Glu 

Cys 
780 
Trp 

Ser 

Asn 

Cys 

Gly 
860 
Tyr 

Val 

Ser 

Ala 

Asp 
940 



Glu 

Ser 

Asp 

Asp 

Phe 
685 
Asp 

Arg 

Phe 

Tyr 

Leu 
765 
Pro 

Ser 

Arg 

Asp 

Arg 
845 
Gly 

Tyr 

Ser 

Gly 

Ala 
925 
Asn 
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Gly Ser Pro 

Gin Asp Gin 
64 0 

Ser Val Gly 
655 

Thr Cys Lys 
670 

Gin Phe Asn 



Thr Arg His 

Ser Phe Leu 
720 

Asp Glu Asn 
735 

Lys Pro Pro 
750 

Phe Thr Thr 

Asn Cys Lys 

Leu Pro Pro 
800 

Tyr Met Arg 
815 

Leu Asp Met 
830 

Tyr Asn Leu 

His Tyr Thr 

Phe Asp Asp 
880 

Lys Ala Ala 
895 

Thr Gly Phe 
910 

Thr Gly He 
Asp He Glu 



<210> 502 

<211> 4611 

<212> DNA 

<213> Homo sapiens 



<400> 502 

tggaagaaga 

acgctgctca 

ttcaaacagt 

caaaatgtgt 

tcacttaagg 

aataaacttg 

gttgaacagg 

ctatgtgaaa 

atagatacaa 



tggcggaagg 
aaacctcgct 
ggaaaaaata 
atcctggacc 
aacatcttat 
tcagctggta 
gtatgtttgt 
atggaaacat 
ttgaaaagga 



cggagcggcg 
ccggaaaggg 
tgttggcttt 
cattgataac 
tgatgaattg 
cacattgatg 
aaagcactgc 
gaataatgtt 
aataagaaaa 



gatctggaca 
gacacctggt 
gacagttggg 
tctggacttc 
gattacatac 
gaaggtcaag 
aaagtagaag 
gtaactcgaa 
atcttcagta 



cccagcggtc 
acctagtcga 
acaaatacca 
tcaaagatgg 
tgttgccaac 
agccaatagc 
tatatctcac 
gatttagcaa 
ttccagatga 



tgacatcgcg 
tagtcgctgg 
gatgggagat 
tgatgcccag 
tgaaggttgg 
acgaaaggtg 
agaattgaag 
agctgacaca 
aaaggagacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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agattgtgga acaaatacat gagtaacaca tttgaaccac tgaataaacc agacagcacc 600 
attcaggatg ctggtttata ccaaggacag gtattagtga tagaacagaa aaatgaagat 660 
ggaacatggc caaggggtcc ttctactcct aatgtgaaaa actcaaatta ctgtcttcca 720 
tcatataccg cttataagaa ctatgattat tcggaacctg gaagaaacaa tgaacagcca 780 
ggcctctgtg gcctaagtaa cttgggaaat acgtgtttca tgaactcagc tattcagtgt 840 
ttgagcaaca cacctccact tactgagtat ttcctcaatg ataagtatca agaagaactg 900 
aattttgaca atcccttagg aatgagaggt gaaatagcta aatcttatgc cgaactgatc 960 
aagcaaatgt ggtctggaaa gtttagctac gtcaccccaa gagcctttaa gacacaggta 1020 
ggacgttttg cacctcagtt ctctggatat cagcagcaag actgtcaaga- actgttagct 1080 
ttcctattag atggattaca tgaggatttg aatagaatta ggaaaaaacc atatatacaa 1140 
ttaaaagatg cagatggaag gccagataag gtggttgccg aagaagcctg ggaaaaccat 1200 
ttaaaacgaa atgattctat catagtagat atatttcatg gccttttcaa atcaacttta 1260 
gtttgtcctg agtgtgctaa gatttcagta acatttgatc ctttttgtta cttgacactt 1320 
ccattgccca tgaaaaaaga acgcaccttg gaagtttact tagttagaat ggatccgctt 1380 
accaaaccta tgcagtacaa agtggttgtc cccaaaattg gaaacatatt agatctttgt 1440 
acagcattgt ctgctttgtc aggaatacct gcagataaga tgatagttac tgatatatac 1500 
aatcatagat ttcacagaat attcgctatg gatgaaaacc ttagtagtat tatggaacgg 1560 
gatgatattt atgtgtttga aattaacatc aataggacag aagatacaga gcacgtgatt 1620 
attcctgttt gcctaagaga aaaattcaga cactcgagtt atacccacca tactggttct 1680 
tcactttttg gtcagccctt tcttatggct gtaccacgaa acaatactga agacaaactt 1740 
tataatctcc tgctcttgag aatgtgccga tatgtcaaaa tatctactga aactgaagaa 1800 
actgaaggat ccctacactg ctgtaaggac caaaatatta atgggaatgg cccaaatggc 1860 
atacatgaag aaggctcacc aagtgaaatg gaaacagatg agccagatga tgaatccagc 1920 
caggatcaag aacttccctc agagaatgaa aacagtcagt ctgaagattc agttggagga 1980 
gataatgatt ctgaaaatgg attatgtact gaggatactt gcaaaggtca actcacggga 2040 
cacaaaaaac gattgtttac attccagttc aacaacttag gcaatactga tatcaactac 2100 
atcaaagatg ataccaggca tataagattt gatgataggc agcttaggct agatgaaaga 2160 
tcttttcttg ctttggattg ggatcctgat ttgaaaaaaa gatattttga tgaaaatgct 2220 
gctgaggact ttgaaaaaca tgaaagtgtg gagtataaac ctcctaaaaa accctttgtg 2280 
aaattaaaag attgcattga actttttaca acaaaagaaa agctaggtgc tgaagatccc 2340 
tggtattgtc cgaattgtaa agaacatcag caagccacaa agaaattgga tttatggtcc 2400 
ctgcctccag tacttgtagt acatctcaag cgattttctt acagtcgata catgagagac 2460 
aagttggata ccttagttga ttttcctatc aatgacttgg atatgtcgga attcttaatt 2520 
aatccaaatg caggtccttg ccgctataat ctgattgctg tttccaacca ctatggaggg 2580 
atgggaggag gacactatac tgcttttgca aaaaataaag atgatggaaa atggtactat 2640 
tttgatgaca gtagtgtctc cactgcatct gaagaccaaa ttgtgtccaa agcagcatat 2700 
gtactcttct accagagaca agacactttc agtggaactg gcttttttcc tcttgaccga 2760 
gaaactaaag gtgcttcagc tgccactggc atcccattag aaagtgatga agatagcaat 2820 
gataatgaca atgatataga aaatgaaaac tgtatgcaca ctaactaatg aaagtcctag 2880 
aagccataaa agagacactt tcctgctggt ggtatctatg gaaatgatga agttacccac 2940 
cacattaaaa caaaagtctg agatggggag tttcagataa ccgaatgtaa atcctttatc 3000 
agattttaac ttgtgcagta cttgaagtga aacacaatga aaactttaac agaaattgtc 3060 
tcttaataca tttacagtct tgtatttaca agctaaatat atataggaaa tcacaaataa 3120 
atccctttta agtttgctgc tgttttgatt aattatgttt cctttaattt tttggatgtg 3180 
agttgatgac aaatgttaaa tttgtggatg ttggtcattt attttgctct aataatactg 3240 
caagaaaact atggctgaac ttagtttttg gataatctag ttcatgtgca ttaggctgcc 3300 
gtatatacta ctatgatatt tagctgcagt atcaatgtgg cataaatact gcggcagcat 3360 
tcttcgaaaa ccgcattttc aaatttatcc ccgtaatcaa tgtgggccta ttaaaaaacc 3420 
aaatcatgat acaagaaaca accttggaaa agttatttcc ctttgtaata cctttaaaaa 3480 
tccagagtat tatattcatt ttactgccac tgtggaaaca ggttctagat ttcatactcc 3540 
atctaaagtc atttttcagt tacatgtaag tattatttac tgctattggg atctattctt 3600 
cagatattca aaaacatttt ttgctttaaa atgcatatct ttaattgggt gttggtccaa 3660 
aattaaaatt tttgctgtct gtttttctct accccatttt gtggtaattt ggcaacttga 3720 
gctctcccaa ttattgtaat ataaggacag acataatagt attctgtacc catagtaata 3780 
attgcatcaa gcttagatga gaaatttttt tcatatactg gcctttaaat cattaatgga 3840 
caattggcta taaaggtagg tctgttaact ttctttgtgt gttcctgatg gaattcacca 3900 
tagccttaca gcttttctca gaaggtaact tgttataaga gaaatggcat ttgcatgttc 3960 
cagagtcaga acctgtacag tatataaagc ataaacacct tgaatttgat tttagttcac 4020 
cacattcaag gatccaggat gccaaaaata tgtgtgaaca tttgaaacat ttttatttgc 4080 
tgctttttcc ctttgatata gttgaaagaa tgtattgtaa attggcatac aatactgcct 4140 
ttaatagaaa acctaaatgc tggggcacat ttcacatatc gactacctga gaaattgctt 4200 
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tgtgtcccac 
aatagattat 
aattatttat 
gaggtatgtt 
gtattttaaa 
cctcctgcat 
gaaatgtgca 



tgttattaaa 
aaattgtgta 
ttatgt tagc 
gatcat ggct 
agctgctttg 
gacagaggtt 
tgtttattta 



gcaaaaagta 
ttgtaaataa 
atttaaaagt 
ttgctttata 
gggttttttt 
aaaattttgt 
tatagatttc 



taatgttctt 
aagttcactc 
ttaaaaaaaa 
tcttgatatt 
tttctttttt 
ctgcacttga 
aataaagcta 



<210> 503 
<211> 187 
<212> PRT 

<213> Homo sapiens 
<400> 503 

Met Val Gly Ser Leu 

1 5 
lie Gly Lys Asn Gly 
20- 

Arg Tyr Phe Gin Arg 
35 

Asn Leu Val lie Met 
50 

Asn Arg Pro Leu Lys 
65 

Lys Glu Pro Pro Gin 
85 

Ala Leu Lys Leu Thr 
100 

Val Trp He Val Gly 
115 

Pro Gly His Leu Lys 
130 

Ser Asp Thr Phe Phe 
145 

Pro Glu Tyr Pro Gly 
165 

Lys Tyr Lys Phe Glu 
180 



Asn Cys He 

Asp Leu Pro 

Met Thr Thr 
40 

Gly Lys Lys 
55 

Gly Arg He 
70 

Gly Ala His 

Glu Gin Pro 

Gly Ser Ser 
120 

Leu Phe Val 

135 
Pro Glu lie 
150 

Val Leu Ser 
Val Tyr Glu 



Val Ala Val 
10 

Trp Pro Pro 

25 

Thr Ser Ser 

Thr Trp Phe 

Asn Leu Val 
75 

Phe Leu Ser 
90 

Glu Leu Ala 
105 

Val Tyr Lys 

Thr Arg He 

Asp Leu Glu 
155 

Asp Val Gin 
170 

Lys Asn Asp 
185 



cacttgaact 
tgtgagtgca 
tgcatttgac 
aaagctggtt 
ttttttttaa 
gttcacttga 
ttcaaggcct 



aatcaaatac 4260 
cattttggta 4320 
caggataaat 4380 
tatcatcctg 4440 
tgtaaactaa 4500 
gtttacattt 4560 
t 4 611 



Ser Gin 

Leu Arg 

Val Glu 

45 
Ser He 
60 

Leu Ser 

Arg Sex 

Asn Lys 

Glu Ala 
125 
Met Gin 
140 

Lys Tyr 
Glu Glu 



Asn Met Gly 
15 

Asn Glu Phe 
30 

Gly Lys Gin 

Pro Glu Lys 

Arg Glu Leu 
80 

Leu Asp Asp 
95 

Val Asp Met 
110 

Met Asn His 

Asp Phe Glu 

Lys Leu Leu 
160 

Lys Gly He 
175 



<210> 504 
<211> 3900 
<212> DNA 

<213> Homo sapiens 



<400> 504 

cgatggccct 

ggcgggcgct 

gtggaggagg 

tgcctcgcag 

atggcagggc 

cgcagttccc 

ggggcggggc 

caaacttgac 

tggttggttc 

gggacctgcc 

cctcttcagt 

ttcctgagaa 

aggaacctcc 

ctgaacaacc 

tttataagga 



gcccagtccc 
gggggcgctg 
aggtggattt 
gggctgagct 
aaggatggca 
aggcgtctgc 
ctcgcctgca 
cgcgcgttct 
gctaaactgc 
ctggccaccg 
agaaggtaaa 
gaatcgacct 
acaaggagct 
agaattagca 
agccatgaat 



agacagaacc 
cggccgctgc 
caggcttccc 
ggaggcagcg 
gcccggcggc 
gggcgcgagc 
caaataggga 
gctgtaacga 
atcgtcgctg 
ctcaggaatg 
cagaatctgg 
ttaaagggta 
cattttcttt 
aataaagtag 
cacccaggcc 



tactacgtgc 
agcgcagggt 
gtagactgga 
aggccgcccg 
agggcccggc 
acgccgcgac 
cgagggggcg 
gcgggctcgg 
tgtcccagaa 
aattcagata 
tgattatggg 
gaattaattt 
ccagaagtct 
acatggtctg 
atcttaaact 



ggcggcagct 
ccacctggtc 
agaatcggct 
acgcaggctt 
gaggagcgcg 
cctgcgtgcg 
gggcggccac 
aggtcctccc 
catgggcatc 
tttccagaga 
taagaagacc 
agttctcagc 
agatgatgcc 
gatagttggt 
atttgtgaca 



ggggcgggaa 
ggctgcacct 
caaaaccgct 
ccggcgagac 
aacccgcggc 
ccggggcggg 
aatttcgcgc 
gctgctgtca 
ggcaagaacg 
atgaccacaa 
tggttctcca 
agagaactca 
ttaaaactta 
ggcagttctg 
aggatcatgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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aagactttga aagtgacacg ttttttccag aaattgattt ggagaaatat aaacttctgc 960 
cagaataccc aggtgttctc tctgatgtcc aggaggagaa aggcattaag tacaaatttg 1020 
aagtatatga gaagaatgat taatatgaag gtgttttcta gtttaagttg ttccccctcc 1080 
ctctgaaaaa agtatgtatt tttacattag aaaaggtttt ttgttgactt tagatctata 1140 
attatttcta agcaacttgt ttttattccc cactactctt ctctctatca gataccattt 1200 
atgagacatt cttgctataa ctaagtgctt ctccaagacc ccaactgagt ccccagcacc 1260 
tgctacagtg agctgccatt ccacacccat cacatgtggc actcttgcca gtccttgaca 1320 
ttgtcgggct tttcacatgt tggtaatatt tattaaagat gaagatccac atacccttca 1380 
actgagcagt ttcactagtg gaaataccaa aagcttccta cgtgtatatc cagaggtttg 1440 
tagataaatg ttgccacctt gtttgtaaca gtgaaaaatt gaaaacaacc tggaagtcca 1500 
gtgatgggaa aatgagtatg tttctgtctt agattgggga acccaaagca gattgcaaga 1560 
ctgaaatttc agtgaaagca gtgtatttgc taggtcatac cagaaatcat caattgaggt 1620 
acggagaaac tgaactgaga aggtaagaaa agcaatttaa agtcagcgag caggttctca 1680 
ttgataacaa gctccatact gctgagatac agggaaatgg aggggggaaa gctggagtat 1740 
tgatcccgcc cccctccttg gttgtcagct ccctgtcctg tgtgtgggcg gaacatagtc 1800 
cagctgctct atagcaagtc tcaggtgttt gcagtaagaa gctgctggca tgcacgggaa 1860 
cagtgaatgc caaacactta aagcaattcg atgtttaagt atgtaagttc tttttttttt 1920 
agacagcgtt tcgctcttgt tgcccaggct agcatgcaat ggtgtgacct cggcttactg 1980 
caacctccgc cttcccagat tcaagcgatt ctcctgcctc aggctcccaa gtagctagga 2040 
ccaggtgcgc gccaccacgc ccggctaatt tttgtatttt gtatttttag tagagatggg 2100 
gtttcaccat gttggtcagg ctagtctcga actcgtgacc gcaagcgatt cacccacctc 2160 
agcctcccaa agtgctggga ttaccggctt gagccaccac acccggcaca tcttcattct 2220 
ttttatgtag taaaaagtat aaggccacac atggtttatt tgaagtattt tataatttaa 2280 
aaaaatacag aagcaggaaa accaattata agttcaagtg agggatgatg gttgcttgaa 2340 
ccaaagggtt gcatgtagta agaaattgtg atttaagata tattttaaag ttataagtag 2400 
caggatattc tgatggagtt tgactttggt tttgggccca gggagtttca gatgcctttg 2460 
agaaatgaat gaagtagaga gaaaataaaa gaaaaaccag ccaggcacag tggctcacac 2520 
ctgtaatccc agcgctttgg gaggctaagg caggcagatc acttgagacc agcttgggca 2580 
acatggcaaa gccccatctc tacaaaaaac acaaaaatta gctgggcatt gtggcgcaca 2640 
cctgtattcc catctagtca ggaagctgag atggaagaat taattgagcc cacgagttca 2700 
aggctgcagt gagtcgtgat tgtgccactg cactccagcc ggggtgacag aagagacctt 2760 
gtctcgaaaa cgaatctgaa aacaatggaa ccatgccttc ataattctag aaagttattt 2820 
tcaactgata aatctatatt cacccaaata atcaagggtg aaggtaaaat aatacatttt 2880 
tagacaagca aagactcagg ggttacctcc atgtgccctt tttagggaag ctgttggaga 2940 
aaatactcca gcaaaatgaa ggagtacaca aaccagagaa tgacatgaat ccagcaaata 3000 
ggatccaaca caggcaatat tccagctatg gagctagctt taaaaaggaa cagtaaaaat 3060 
attaatcggt tagctgggtg gaatggccca tgcctgt-agt cccagctact caggaggctc 3120 
agcagcagga cgacttgagc ccaagagttc cagaccagcc tggccacctt agtgagatcc 3180 
cttctcttaa aaataataac ttattgccag atttggggca tttggaaaga agttcattga 3240 
agataaagca aaagtaaaaa aaaaaaaaaa aaaaacaagg ggaaagggtt ggttaggcaa 3300 
tcattctagg gcagaaagaa gtacaggata ggaagagcat aatacactgt, ttttctcaac 3360 
aaggagcagt atgtacacag tcataatgat gtgactgctt agcccctaaa tatggtaact 3420 
actctgggac aatatgggag gaaaagtgaa gattgtgatg gtgtaagagc taatcctcat 3480 
ctgtcatatc cagaaatcac tatataatat ataataatga aatgactaag ttatgtgagg 3540 
aaaaaaacag aagacattgc taaaagagtt aaaagtcatt gctctggaga attaggaggg 3600 
atggggcagg ggactgttag gatgcattat aaactgaaaa gcctttttaa aattttatgt 3660 
attaatatat gcattcactt gaaaaactaa aaaaaaacaa taatttggaa aaacccatga 3720 
aggtaactaa cggaaggaaa aactaagaga atgaaaagta tttgcctctg gaaagaacaa 3780 
ctggcaggac tgttgttttc attgtaagac ttttggagcc atttaattgt acttaaccat 3840 
tttcatctat ttctttaata agaacaattc catcttaata aagagttaca cttgttaata 3900 



<210> 505 

<211> 1616 

<212> PRT 

<213> Homo sapiens 



<40O> 505 

Met Pro Ala Arg Thr Ala Pro Ala Arg Val Pro Thr Leu Ala Val Pro 

1 5 10 15 

Ala lie Ser Leu Pro Asp Asp Val Arg Arg Arg Leu Lys Asp Leu Glu 
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Arg 


Asp 


Ser 


Leu 






35 




Leu 


His 


Glu 


Phe 




50 






Glu 


Thr 


Lys 


Leu 


65 








Lys 
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Lys 


Ser 


His 


Ala 


Tyr 


Asn 








100 


Ala 


Arg 


Ser 


Glu 






115 




Pro 


Lys 


Pro 


Leu 




130 






Gly 


Glu 


Ala 


Lys 


145 








Ser 


Thr 


Arg 


Gin 


Lys 


Arg 


Lys 


Pro 








180 


lie 


Lys 


Glu 


Glu 






195 




Ser 


Arg 


Glu 


Arg 




210 






Ala 


Lys 


Ser 


Gly 


225 








Glu 


Glu 


Lys 


Arg 


Lys 


Leu 


Lys 


Glu 








260 


Asp 


Glu Asp 


Glu 






275 




Gin 


Pro 


Lys 


Asp 




290 






Glu 


Lys 


Val 


Asn 


305 








Glu 


Glu 


Lys 


Arg 


Met 


Ala 


Arg 


Ala 








340 


Cys 


He 


Gin 


Cys 






355 




Gin 


His 


Pro 


Pro 




370 






Lys 


Leu 


Ser 


He 


385 








Ala 


Leu 


Pro 


Gin 


Gly 


His 


Leu 


Cys 








420 


Leu 


Phe 


Phe 


Ser 






435 




Leu 


Glu 


Gly 


Gly 




450 






Trp 


He 


Thr 


Gly 


465 








Thr 


Ser 


Phe 


Ala 


Pro 


He 


Phe 


Gly 








500 



Thr Glu Lys Glu 
40 

Leu Gin Thr Glu 
55 

Arg Lys Glu Glu 
70 

Leu Leu Asn Lys 
85 

Arg Glu Val Asn 

Ala Arg Arg Val 
120 

Ser Lys Pro Arg 
135 

Pro Glu Pro Ser 
150 

Thr Thr He Thr 
165 

Gin Glu Glu Ser 

Asp Lys Asp Gin 
200 

Val Ala Arg Pro 
215 

Thr Arg Thr Glu 
230 

Leu Arg Ser Gin 
245 

Glu Pro Asp Arg 

Asp Gly Asp Glu 
280 

Leu Ala Ala Lys 
295 

Pro Gin He Ser 
310 

Arg Lys Thr Thr 
325 

Lys Thr Val Met 

Gly Gin Tyr Leu 
360 

Asp Ala Val Asp 
375 

Phe Asp Ala Asn 
390 

His Lys Leu Thr 
405 

Pro He Asp Thr 

Gly Ser Ala Lys 
440 

Val Asn Gly Lys 
455 

Phe Asp Gly Gly 
470 

Glu Tyr lie Leu 
485 

Leu Met Gin Glu 



Cys 


val 


Lys 


GlU 


lie 


Lys 


Asn 


Gin 








60 


Leu 


Ser 


blU 


Glu 






75 




Asp 


Leu 


Ser 


Leu 




90 






Gly 


Arg 


Leu 


Glu 


105 








Gly 


Met 


TV "1 _ 

Ala 


Asp 


Thr 


Pro 


Arg 


Arg 








140 


Pro 


Ser 


Pro 


Arg 






155 




Ser 


His 


Phe 


Ala 




170 






Glu 


Arg 


Ala 


Lys 


185 








Asp 


Glu 


Lys 


Arg 


Leu 


Pro 


Ala 


Glu 








220 


Lys 


Glu 


Glu 


Glu 






235 




Thr 


Lys 


Glu 


Pro 




250 






Glu 


Ala 


Arg 


Ala 


265 








Lys 


Asp 


Glu 


Lys 


Arg Arg 


Pro 


Glu 








300 


Asp 


Glu 


Lys 


Asp 






315 




Pro 


Lys 


Glu 


Pro 




330 






Asn 


Ser 


Lys 


Thr 


345 








Asp 


Asp 


Pro 


Asp 


Glu 


Pro 


Gin 


Met 








380 


Glu 


Ser 


Gly 


Pne 






395 




Cys 


Phe 


Ser 


Val 




410 






Gly 


Leu 


He 


Glu 


425 








Pro 


He 


Tyr 


Asp 


Asn 


Leu 


Gly 


Pro 








4 60 


Glu 


Lys 


Ala 


Leu 






475 




Met 


Asp 


Pro 


Ser 




490 






Lys 


He 


Tyr 


He 


505 









30 

Lys Leu Asn Leu 
45 

Leu Cys Asp Leu 

Gly Tyr Leu Ala 
80 

Glu Asn Gly Ala 
95 

Asn Gly Asn Gin 
110 

Ala Asn Ser Pro 
125 

Ser Lys Ser Asp 

He Thr Arg Lys 
160 

Lys Gly Pro Ala 
175 

Ser Asp Glu Ser 
190 

Arg Arg Val Thr 
205 

Glu Pro Glu Arg 

Arg Asp Glu Lys 
240 

Thr Pro Lys Gin 
255 

Gly Val Gin Ala 
270 

Lys His Arg Ser 
285 

Glu Lys Glu Pro 

Glu Asp Glu Lys 
320 

Thr Glu Lys Lys 
335 

His Pro Pro Lys 

350 : 
Leu Lys Tyr Gly 
365 

Leu Thr Asn Glu 

Glu Ser Tyr Glu 
400 

Tyr Cys Lys His 
415 

Lys Asn He Glu 
430 

Asp Asp Pro Ser 
445 

He Asn Glu Trp 

He Gly Phe Ser 
480 

Pro Glu Tyr Ala 
495 

Ser Lys lie Val 
510 
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Val Glu Phe Leu Gin Ser Asn Ser Asp Ser Thr Tyr Glu Asp Leu lie 

515 520 525 

Asn Lys lie Glu Thr Thr Val Pro Pro Ser Gly Leu Asn Leu Asn Arg 

530 535 540 

Phe Thr Glu Asp Ser Leu Leu Arg His Ala Gin Phe Val Val Glu Gin 
545 550 555 560 

Val Glu Ser Tyr Asp Glu Ala Gly Asp Ser Asp Glu Gin Pro lie Phe 

565 570 575 

Leu Thr Pro Cys Met Arg Asp Leu lie Lys Leu Ala Gly Val Thr Leu 

580 585 590 

Gly Gin Arg Arg Ala Gin Ala Arg Arg Gin Thr lie Arg His Ser Thr 

595 600 605 

Arg Glu Lys Asp Arg Gly Pro Thr Lys Ala Thr Thr Thr Lys Leu Val 

610 615 620 

Tyr Gin He Phe Asp Thr Phe Phe Ala Glu Gin He Glu Lys Asp Asp 
625 630 635 " 640 

Arg Glu Asp Lys Glu Asn Ala Phe Lys Arg Arg Arg Cys Gly Val Cys 

645 650 655 

Glu Val Cys Gin Gin Pro Glu Cys Gly Lys Cys Lys Ala Cys Lys Asp 

660 665 670 

Met Val Lys Phe Gly Gly Ser Gly Arg Ser Lys Gin Ala Cys Gin Glu 

675 680 685 

Arg Arg Cys Pro Asn Met Ala Met Lys Glu Ala Asp Asp Asp Glu Glu 

690 695 700 

Val Asp Asp Asn He Pro Glu Met Pro Ser Pro Lys Lys Met His Gin 
705 710 715 "* 720 

Gly Lys Lys Lys Lys Gin Asn Lys Asn Arg He Ser Trp Val Gly Glu 

725 730 735 

Ala Val Lys Thr Asp Gly Lys Lys Ser Tyr Tyr Lys Lys Val Cys He 

74 0 745 750 

Asp Ala Glu Thr Leu Glu Val Gly Asp Cys Val Ser Val He Pro Asp 

755 760 765 

Asp Ser Ser Lys Pro Leu Tyr Leu Ala Arg Val Thr Ala Leu Trp Glu 

770 775 780 

Asp Ser Ser Asn Gly Gin Met Phe His Ala His Trp Phe Cys Ala Gly 
785 790 795 800 

Thr Asp Thr Val Leu Gly Ala Thr Ser Asp Pro Leu Glu Leu Phe Leu 

805 810 815 

Val Asp Glu Cys Glu Asp Met Gin Leu Ser Tyr He His Ser Lys Val 

820 825 830 

Lys Val He Tyr Lys Ala Pro Ser Glu Asn Trp Ala Met Glu Gly .Gly 

835 840 845 

Met Asp Pro Glu Ser Leu Leu Glu Gly Asp Asp Gly Lys Thr Tyr Phe 

850 855 860 

Tyr Gin Leu Trp Tyr Asp Gin Asp Tyr Ala Arg Phe Glu Ser Pro Pro 
865 870 875 880 

Lys Thr Gin Pro Thr Glu Asp Asn Lys Phe Lys Phe Cys Val Ser Cys 

885 890 895 

Ala Arg Leu Ala Glu Met Arg Gin Lys Glu He Pro Arg Val Leu Glu 

900 905 910 

Gin Leu Glu Asp Leu Asp Ser Arg Val Leu Tyr Tyr Ser Ala Thr Lys 

915 920 ( 925 

Asn Gly He Leu Tyr Arg Val Gly Asp Gly Val Tyr Leu Pro Pro Glu 

930 935 940 

Ala Phe Thr Phe Asn He Lys Leu Ser Ser Pro Val Lys Arg Pro Arg 
945 950 955 960 

Lys Glu Pro Val Asp Glu Asp Leu Tyr Pro Glu His Tyr Arg Lys Tyr 

965 970 975 

Ser Asp Tyr He Lys Gly Ser Asn Leu Asp Ala Pro Glu Pro Tyr Arg 

980 985 990 

He Gly Arg lie Lys Glu He Phe Cys Pro Lys Lys Ser Asn Gly Arg 
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995 1000 1005 

Pro Asn Glu Thr Asp lie Lys He Arg Val Asn Lys Phe Tyr Arg Pro 

1010 1015 1020 

Glu Asn Thr His Lys Ser Thr Pro Ala Ser Tyr His Ala Asp He Asn 
1025 1030 1035 1040 

Leu Leu Tyr Trp Ser Asp Glu Glu Ala Val Val Asp Phe Lys Ala Val 

1045 1050 1055 

Gin Gly Arg Cys Thr Val Glu Tyr Gly Glu Asp Leu Pro Glu Cys Val 

1060 1065 1070 

Gin Val Tyr Ser Met Gly Gly Pro Asn Arg Phe Tyr Phe Leu Glu Ala 

1075 1080 1085 

Tyr Asn Ala Lys Ser Lys Ser Phe Glu Asp Pro Pro Asn His Ala Arg 

1090 1095 1100 

Ser Pro Gly Asn Lys Gly Lys Gly Lys Gly Lys Gly Lys Gly Lys Pro 
1105 1110 " " ~ 1115 1120 

Lys Ser Gin Ala Cys Glu Pro Ser Glu Pro Glu He Glu He Lys Leu 

1125 1130 1135 

Pro Lys Leu Arg Thr Leu Asp Val Phe Ser Gly Cys Gly Gly Leu Ser 

1140 1145 1150 

Glu Gly Phe His Gin Ala Gly He Ser Asp Thr Leu Trp Ala He Glu 

1155 1160 1165 

Met Trp Asp Pro Ala Ala Gin Ala Phe Arg Leu Asn Asn Pro Gly Ser 

1170 *" 1175 1180 

Thr Val Phe Thr Glu Asp Cys Asn He Leu Leu Lys Leu Val Met Ala 
1185 1190 1195 . 1200 

Gly Glu Thr Thr Asn Ser Arg Gly Gin Arg Leu Pro Gin Lys Gly Asp 

1205 1210 1215 

Val Glu Met Leu Cys Gly Gly Pro Pro Cys Gin Gly Phe Ser Gly Met 

1220 1225 1230 

Asn Arg Phe Asn Ser Arg Thr Tyr Ser Lys Phe Lys Asn Ser Leu Val 

1235 1240 1245 

Val Ser Phe Leu Ser Tyr Cys. Asp Tyr Tyr Arg Pro Arg Phe Phe Leu 

1250 1255 1260 

Leu Glu Asn Val Arg Asn Phe Val Ser Phe Lys Arg Ser Met Val Leu 
1265 1270 1275 1280 

Lys Leu Thr Leu Arg Cys Leu Val Arg Met Gly Tyr Gin Cys Thr Phe 

1285 1290 1295 

Gly Val Leu Gin Ala Gly Gin Tyr Gly Val Ala Gin Thr Arg Arg Arg 

1300 1305 1310 

Ala He lie Leu Ala Ala Ala Pro Gly Glu Lys Leu Pro Leu Phe Pro 

1315 1320 1325 

Glu Pro Leu His Val Phe Ala Pro Arg Ala Cys Gin Leu Ser Val Val 

1330 1335 1340 

Val Asp Asp Lys Lys Phe Val Ser Asn lie Thr Arg Leu Ser Ser Gly 
1345 1350 1355 1360 

Pro Phe Arg Thr He Thr Val Arg Asp Thr Met Ser Asp Leu Pro Glu 

1365 1370 1375 

Val Arg Asn Gly Ala Ser Ala Leu Glu He Ser Tyr Asn Gly Glu Pro 

1380 1385 1390 

Gin Ser Trp Phe Gin Arg Gin Leu Arg Gly Ala Gin Tyr Gin Pro He 

1395 1400 1405 

Leu Arg Asp His He Cys Lys Asp Met Ser Ala Leu Val Ala Ala Arg 

1410 1415 1420 

Met Arg His He Pro Leu Ala Pro Gly Ser Asp Trp Arg Asp Leu Pro 
1425 1430 1435 1440 

Asn lie Glu Val Arg Leu Ser Asp Gly Thr Met Ala Arg Lys Leu Arg 

1445 1450 1455 

Tyr Thr His His Asp Arg Lys Asn Gly Arg Ser Ser Ser Gly Ala Leu 

1460 1465 1470 

Arg Gly Val Cys Ser Cys Val Glu Ala Gly Lys Ala Cys Asp Pro Ala 
1475 * 1480 1485 
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Ala Arg Gin Phe Asn Thr Leu lie Pro Trp Cys Leu Pro His Thr Gly 

14 90 14 95 1500 

Asn Arg His Asn His Trp Ala Gly Leu Tyr Gly Arg Leu Glu Trp Asp 
1505 1510 1515 1520 

Gly Phe Phe Ser Thr Thr Val Thr Asn Pro Glu Pro Met Gly Lys Gin 

1525 1530 1535 

Gly Arg Val Leu His Pro Glu Gin His Arg Val Val Ser Val Arg Glu 

1540 1545 1550 

Cys Ala Arg Ser Gin Gly Phe Pro Asp Thr Tyr Arg Leu Phe Gly Asn 

1555 1560 1565 

lie Leu Asp Lys His Arg Gin Val Gly Asn Ala Val Pro Pro Pro Leu 

1570 1575 1580 

Ala Lys Ala He Gly Leu Glu lie Lys Leu Cys Met Leu Ala Lys Ala 
15^5 1590 1595 1600 

Arg Glu Ser Ala Ser Ala Lys He Lys Glu Glu Glu Ala Ala Lys Asp 
1605 1610 1615 



<210> 506 
<211> 5434 
<212> DNA 

<213> Homo sapiens 
<400> 506 

cgtccgcgtg gggggggtgt gtgcccgcct 
ctccgttcca tccttctgca cagggtatcg 
cccacgcccg gactggggtg gtagacgcgc 
cgcgcgaaaa gccggggcgc ctgcgctgcc 
ccggcgcgta ccgccccagc ccgggtgccc 
gacgatgtcc gcaggcggct caaagatttg 
gtgaaggaga aattgaatct cttgcacgaa 
tgtgacttgg aaaccaaatt acgtaaagaa 
gtcaaatccc ttttaaataa agatttgtcc 
gaagtgaatg gacgtctaga aaacgggaac 
atggcagatg ccaacagccc ccccaaaccc 
aagtccgatg gagaggctaa gcctgaacct 
accaggcaaa ccaccatcac atctcatttt 
gaagagtctg aaagagccaa atcggatgag 
gagaagagac gtagagttac atccagagaa 
cctgaaagag caaaatcagg aacgcgcact 
gaaaagagac tccgaagtca aaccaaagaa 
ccggacagag aagccagggc aggcgtgcag 
gatgagaaga agcacagaag tcaacccaaa 
aaagaacctg aaaaagtaaa tccacagatt 
gagaagagac gcaaaacgac ccccaaagaa 
acagtcatga actccaagac ccaccctccc 
gaccctgacc tcaaatatgg gcagcaccca 
acaaatgaga agctgtccat ctttgatgcc 
cttccccagc acaaactgac ctgcttcagt 
atcgacaccg gcctcatcga gaagaatatc 
atctatgatg atgacccgtc tcttgaaggt 
aatgaatggt ggatcactgg ctttgatgga 
tcatttgccg aatacattct gatggatccc 
atgcaggaga agatctacat cagcaagatt 
tcgacctatg aggacctgat caacaagatc 
ttgaaccgct tcacagagga ctccctcctg 
gagagttatg acgaggccgg ggacagtgat 
cgggacctga tcaagctggc tggggtcacg 
cagaccatca ggcattctac cagggagaag 
aagctggtct accagatctt cgatactttc 
gaagacaagg agaacgcctt taagcgccgg 



tgcgcatgcg tgttccctgg gcatggccgg 60 
cctctctccg tttggtacat cccctcctcc 120 
ctccgctcat cgcccctccc catcggtttc 180 
gccgccgcgt ctgctgaagc ctccgagatg 240 
acactggccg tcccggccat ctcgctgccc 300 
gaaagagaca gcttaacaga aaaggaatgt 360 
tttctgcaaa cagaaataaa gaatcagtta 420 
gaattatccg aggagggcta cctggctaaa 480 
ttggagaacg gtgctcatgc ttacaaccgg 540 
caagcaagaa gtgaagcccg tagagtggga 600 
otttccaaac ctcgcacgcc caggaggagc 660 
tcacctagcc ccaggattac aaggaaaagc 720 
gcaaagggcc ctgccaaacg gaaacctcag 780 
tccatcaagg aagaagacaa agaccaggat 840 
cgagttgcta gaccgcttcc tgcagaagaa 900 
gaaaaggaag aagaaagaga tgaaaaagaa 960 
ccaacaccca aacagaaact gaaggaggag 1020 
gctgacgagg acgaagatgg agacgagaaa 1080 
gatctagctg ccaaacggag gcccgaagaa 1140 
tctgatgaaa aagacgagga tgaaaaggag 1200 
ccaacggaga aaaaaatggc tcgcgccaaa 1260 
aagtgcattc agtgcgggca gtacctggac 1320 
ccagacgcgg tggatgagcc acagatgctg 1380 
aacgagtctg gctttgagag ttatgaggcg 1440 
gtgtactgta agcacggtca cctgtgtccc 1500 
gaactcttct tttctggttc agcaaaacca 1560 
ggtgttaatg gcaaaaatct tggccccata 1620 
ggtgaaaagg ccctcatcgg cttcagcacc 1680 
agtcccgagt atgcgcccat atttgggctg 1740 
gtggtggagt tcctgcagag caattccgac 1800 
gagaccacgg ttcctccttc tggcctcaac 1860 
cgacacgcgc agtttgtggt ggagcaggtg 1920 
gagcagccca tcttcctgac gccctgcatg 1980 
ctgggacaga ggcgagccca ggcgaggcgg 2040 
gacaggggac ccacgaaagc caccaccacc 2100 
ttcgcagagc aaattgaaaa ggatgacaga 2160 
cgatgtggcg tctgtgaggt gtgtcagcag 2220 
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cctgagtgtg ggaaatgtaa agcctgcaag gacatggtta aatttggtgg cagtggacgg 2280 
agcaagcagg cttgccaaga gcggaggtgt cccaatatgg ccatgaagga ggcagatgac 2340 
gatgaggaag tcgatgataa catcccagag atgccgtcac ccaaaaaaat gcaccagggg 2400 
aagaagaaga aacagaacaa gaatcgcatc tcttgggtcg gagaagccgt caagactgat 2460 
gggaagaaga gttactataa gaaggtgtgc attgatgcgg aaaccctgga agtgggggac 2520 
tgtgtctctg ttattccaga tgattcctca aaaccgctgt atctagcaag ggtcacggcg 2580 
ctgtgggagg acagcagcaa cgggcagatg tttcacgccc actggttctg cgctgggaca 2640 
gacacagtcc tcggggccac gtcggaccct ctggagctgt tcttggtgga tgaatgtgag 2700 
gacatgcagc tttcatatat ccacagcaaa gtgaaagtca tctacaaagc cccctccgaa 2760 
aactgggcca tggagggagg catggatccc gagtccctgc tggaggggga cgacgggaag 2820 
acctacttct accagctgtg gtatgatcaa gactacgcga gattcgagtc ccctccaaaa 2880 
acccagccaa cagaggacaa caagttcaaa ttctgtgtga gctgtgcccg tctggctgag 2940 
atgaggcaaa aagaaatccc cagggtcctg gagcagctcg aggacctgga tagccgggtc 3000 
ctctactact cagccaccaa gaacggcatc ctgtaccgag ttggtgatgg tgtgtacctg 3060 
ccccctgagg ccttcacgtt caacatcaag ctgtccagtc ccgtgaaacg cccacggaag 3120 
gagcccgtgg atgaggacct gtacccagag cactaccgga aatactccga ctacatcaaa 3180 
ggcagcaacc tggatgcccc tgagccctac cgaattggcc ggatcaaaga gatcttctgt 3240 
cccaagaaga gcaacggcag gcccaatgag actgacatca aaatccgggt caacaagttc 3300 
tacaggcctg agaacaccca caagtccact ccagcgagct accacgcaga catcaacctg 3360 
ctctactgga gcgacgagga ggccgtggtg gacttcaagg ctgtgcaggg ccgctgcacc 3420 
gtggagtatg gggaggacct gcccgagtgc gtccaggtgt actccatggg cggccccaac 3480 
cgcttctact tcctcgaggc ctataatgca aagagcaaaa gctttgaaga tcctcccaac 3540 
catgcccgta gccctggaaa caaagggaag ggcaagggaa aagggaaggg caagcccaag 3600 
tcccaagcct gtgagccgag cgagccagag atagagatca agctgcccaa gctgcggacc 3660 
ctggatgtgt tttctggctg cggggggttg tcggagggat tccaccaagc aggcatctct 3720 
gacacgctgt gggccatcga gatgtgggac cctgcggccc aggcgttccg gctgaacaac 3780 
cccggctcca cagtgttcac agaggactgc aacatcctgc tgaagctggt catggctggg 3840 
gagaccacca actcccgcgg ccagcggctg ccccagaagg gagacgtgga gatgctgtgc 3900 
ggcgggccgc cctgccaggg cttcagcggc atgaaccgct tcaattcgcg cacctactcc 3960 
aagttcaaaa actctctggt ggtttccttc ctcagctact gcgactacta ccggccccgg 4020 
ttcttcctcc tggagaatgt caggaacttt gtctccttca agcgctccat ggtcctgaag 4080 
ctcaccctcc gctgcctggt ccgcatgggc tatcagtgca ccttcggcgt gctgcaggcc 4140 
ggtcagtacg gcgtggccca gactaggagg cgggccatca tcctggccgc ggcccctgga 4200 
gagaagctcc ctctgttccc ggagccactg cacgtgtttg ctccccgggc ctgccagctg 4260 
agcgtggtgg tggatgacaa gaagtttgtg agcaacataa ccaggttgag ctcgggtcct 4320 
ttccggacca tcacggtgcg agacacgatg tccgacctgc cggaggtgcg gaatggagcc 4380 
tcggcactgg agatctccta caacggggag cctcagtcct ggttccagag gcagctccgg 4440 
ggcgcacagt accagcccat cctcagggac cacatctgta aggacatgag tgcattggtg 4500 
gctgcccgca tgcggcacat ccccttggcc ccagggtcag actggcgcga tctgcccaac 4560 
atcgaggtgc ggctctcaga cggcaccatg gccaggaagc tgcggtatac ccaccatgac 4620 
aggaagaacg gccgcagcag ctctggggcc ctccgtgggg tctgctcctg cgtggaagcc 4680 
ggcaaagcct gcgaccccgc agccaggcag ttcaacaccc tcatcccctg gtgcctgccc 4740 
cacaccggga accggcacaa ccactgggct ggcctctatg gaaggctcga gtgggacggc 4800 
ttcttcagca caaccgtcac caaccccgag cccatgggca agcagggccg cgtgctccac 4860 
ccagagcagc accgtgtggt gagcgtgcgg gagtgtgccc gctcccaggg cttccctgac 4920 
acctaccggc tcttcggcaa catcctggac aagcaccggc aggtgggcaa tgccgtgcca 4980 
ccgcccctgg ccaaagccat tggcttggag atcaagcttt gtatgttggc caaagcccga 5040 
gagagtgcct cagctaaaat aaaggaggag gaagctgcta aggactagtt ctgccctccc 5100 
gtcacccctg tttctggcac caggaatccc caacatgcac tgatgttgtg tttttaacat 5160 
gtcaatctgt ccgttcacat gtgtggtaca tggtgtttgt ggccttggct gacatgaagc 5220 
tgttgtgtga ggttcgctta tcaactaatg atttagtgat caaattgtgc agtactttgt 5280 
gcattctgga ttttaaaagt tttttattat gcattatatc aaatctacca ctgtatgagt 5340 
ggaaattaag actttatgta gtttttatat gttgtaatat ttcttcaaat aaatctctcc 5400 
tataaaccaa aaaaaaaaaa aaaaaaaaaa aaaa 5434 

<210> 507 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<400> 507 
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Leu 
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Pro 
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Pro 
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1 
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10 
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Val 
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Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Gly Arg Arq Ala 








20 










25 










30 






Ala 


Gly 


Ala 

35 


Ser 


Val 


Ala 


Thr 


Glu 
40 


Leu 


Arg 


Cys 


Gin 


Cys 
45 


Leu 


Gin 


Thr 


Leu 


Gin 


Gly 


He 


His 


Pro 


Lys 


Asn 


He 


Gin 


Ser 


Val 


Asn 


Val 


Lys 


Ser 




50 










55 










60 








Pro 


Gly 


Pro 


His 


Cys 


Ala 


Gin 


Thr 


Glu 


Val 


He 


Ala 


Thr 


Leu 


Lys 


Asn 


65 










70 










75 








80 


Gly 


Arg 


Lys 


Ala 


Cys 


Leu 


Asn 


Pro 


Ala 


Ser 


Pro 


He 


Val 


Lys 


Lys 


He 


He 








85 










90 








95 




Glu 


Lys 


Met 
100 


Leu 


Asn 


Ser 


Asp 


Lys 
105 


Ser 


Asn 













<210> 508 

<211> 1103 

<212> DNA 

<213> Homo sapiens 

<400> 508 

cacagagccc gggccgcagg cacctcctcg ccagctcttc cgctcctctc acagccgcca 60 
gacccgcctg ctgagcccca tggcccgcgc tgctctctcc gccgccccca gcaatccccg 120 
gctcctgcga gtggcactgc tgctcctgct cctggtagcc gctggccggc gcgcagcagg 180 
agcgtccgtg gccactgaac tgcgctgcca gtgcttgcag accctgcagg gaattcaccc 240 
caagaacatc caaagtgtga acgtgaagtc ccccggaccc cactgcgccc aaaccgaagt 300 
catagccaca ctcaagaatg ggcggaaagc ttgcctcaat cctgcatccc ccatagttaa 360 
gaaaatcatc gaaaagatgc tgaacagtga caaatccaac tgaccagaag ggaggaggaa 420 
gctcactggt ggctgttcct gaaggaggcc ctgcccttat aggaacagaa gaggaaagag 480 
agacacagct gcagaggcca cctggattgt gcctaatgtg tttgagcatc gcttaggaga 540 
agtcttctat ttatttattt attcattagt tttgaagatt ctatgttaat attttaggtg 600 
taaaataatt aagggtatga ttaactctac ctgcacactg tcctattata ttcattcttt 660 
ttgaaatgtc aaccccaagt tagttcaatc tggattcata tttaatttga aggtagaatg 720 
ttttcaaatg ttctccagtc attatgttaa tatttctgag gagcctgcaa catgccagcc 780 
actgtgatag aggctggcgg atccaagcaa atggccaatg agatcattgt gaaggcaggg 840 
gaatgtatgt gcacatctgt tttgtaactg tttagatgaa tgtcagttgt tatttattga 900 
aatgatttca cagtgtgtgg tcaacatttc tcatgttgaa actttaagaa ctaaaatgtt 960 
ctaaatatcc cttggacatt ttatgtcttt cttgtaaggc atactgcctt gtttaatggt 1020 
agttttacag tgtttctggc ttagaacaaa ggggcttaat tattgatgtt ttcatagaga 1080 
i- • • atataaaaat aaagcactta tag 1103 

<210> 509 
<211> 107 
<212> PRT 
<213> Homo sapiens 



<400> 509 



Met 


Ala 


Arg 


Ala 


Thr 


Leu 


Ser 


Ala 


Ala 


Pro 


Ser 


Asn 


Pro 


Arg Leu 


Leu 


1 








5 










10 








15 




Arg 


Val 


Ala 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Ser 


Arg Arg 


Ala 


Ala 






20 










25 










30 




Gly 


Ala 


Pro 


Leu 


Ala 


Thr 


Glu 


Leu Arg 


Cys 


Gin 


Cys 


Leu Gin 


Thr 






35 










40 










45 






Leu 


Gin 


Gly 


He 


His 


Leu 


Lys 


Asn 


He 


Gin 


Ser 


Val 


Lys 


Val Lys 


Ser 




50 










55 










60 




Pro 


Gly 


Pro 


His 


Cys 


Ala 


Gin 


Thr 


Glu 


Val 


He 


Ala 


Thr 


Leu Lys 


Asn 


65 










70 










75 






80 


Gly 


Gin 


Lys 


Ala 


Cys 
85 


Leu 


Asn 


Pro 


Ala 


Ser 
90 


Pro 


Met 


Val 


Lys Lys 
95 


He 


He 


Glu 


Lys 


Met 


Leu 


Lys 


Asn 


Gly 


Lys 


Ser 


Asn 
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100 105 



<210> 510 

<211> 1110 

<212> DNA 

<213> Homo sapiens 



<400> 510 

gacagagccc gggccacgga gctccttgcc 
gcctgctgag ccccatggcc cgcgccacgc 
tgcgggtggc gctgctgctc ctgctcctgg 
ccctggccac tgaactgcgc tgccagtgct 
acatccaaag tgtgaaggtg aagtcccccg 
ccacactcaa gaatgggcag aaagcttgtc 
tcatcgaaaa gatgctgaaa aatggcaaat 
ttggtggctg ttcctgaagg aggccctgcc 
agctgcagag gccacctggc ttgcgcctaa 
tatttattta tttatttatt tatttgtttg 
tgtaaaataa ggttatgatt gaatctactt 
tttaggtcaa acccaagtta gttcaatcct 
tgcagatatt ctctagtcat ttgttaatat 
actgtgatag aggctgagga atccaagaaa 
aaatgtatgt gtgtctattt tgtaactgta 
tgatttcaca gtgtgtggtc aacatttctc 
aaatatccct tggcatttta tgtctttctt 
tgcagtgttt ccctctgtgt tagagcagag 
agaacaggaa aataaaatat ttaaaaatat 



agctctcctc ctcgcacagc cgctcgaacc 60 
tctccgccgc ccccagcaat ccccggctcc 120 
tggccgccag ccggcgcgca gcaggagcgc 180 
tgcagaccct gcagggaatt cacctcaaga 240 
gaccccactg cgcccaaacc gaagtcatag 300 
tcaaccccgc atcgcccatg gttaagaaaa 360 
ccaactgacc agaaggaagg aggaagctta 420 
ttacaggaac agaagaggaa agagagacac 480 
tgtgtttgag catacttagg agaagtcttc 540 
ttttagaaga ttctatgtta atattttatg 600 
gcacactctc ccattatatt tattgtttat 660 
gattcatatt taatttgaag atagaaggtt 720 
ttcttcgtga tgacatatca catgtcagcc 780 
atggccagta agatcaatgt gacggcaggg 840 
aagatgaatg tcagttgtta tttattgaaa 900 
atgttgaagc tttaagaact aaaatgttct 960 
gtaagatact gccttgttta atgttaatta 1020 
aggtttcgat atttattgat gttttcacaa 1080 

1110 



<210> 511 

<211> 99 

<212> PRT 

<213> Homo sapiens 



<400> 511 



Met 


Thr 


Ser 


Lys 


Leu 


Ala 


Val 


Ala 


Leu 


Leu 


Ala 


Ala 


Phe 


Leu 


He 


Ser 


1 








5 










10 










15 




Ala 


Ala 


Leu 


Cys 


Glu 


Gly Ala 


Val 


Leu 


Pro 


Arg 


Ser 


Ala 


Lys 


Glu 


Leu 








20 










25 










30 






Arg 


Cys 


Gin 


Cys 


He 


Lys 


Thr 


Tyr 


Ser 


Lys 


Pro 


Phe 


His 


Pro 


Lys 


Phe 






35 










40 










45 








lie 


Lys 


Glu 


Leu 


Arg 


Val 


He 


Glu 


Ser 


Gly 


Pro 


His 


Cys 


Ala 


Asn 


Thr 




50 










55 










60 










Glu 


lie 


He 


Val 


Lys 


Leu 


Ser 


Asp 


Gly 


Arg 


Glu 


Leu 


Cys 


Leu 


Asp 


Pro 


65 










70 










75 










80 


Lys 


Glu 


Asn 


Trp 


Val 


Gin 


Arg 


Val 


Val 


Glu 


Lys 


Phe 


Leu 


Lys 


Arg 


Ala 



85 90 95 



Glu Asn Ser 



<210> 512 

<211> 1666 

<212> DNA 

<213> Homo sapiens 

<400> 512 

ctccataagg cacaaacttt 
ggaagaaacc accggaagga 
tggctctctt ggcagccttc 
ggagtgctaa agaacttaga 



cagagacagc agagcacaca 
accatctcac tgtgtgtaaa 
ctgatttctg cagctctgtg 
tgtcagtgca taaagacata 



agcttctagg acaagagcca 60 
catgacttcc aagctggccg 120 
tgaaggtgca gttttgccaa 180 
ctccaaacct ttccacccca 240 
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aatttatcaa agaactgaga gtgattgaga gtggaccaca ctgcgccaac acagaaatta 300 
ttgtaaagct- ttctgatgga agagagctct gtctggaccc caaggaaaac tgggtgcaga 360 
gggttgtgga gaagtttttg aagagggctg agaattcata aaaaaattca ttctctgtgg 420 
tatccaagaa tcagtgaaga tgccagtgaa acttcaagca aatctacttc aacacttcat 480 
gtattgtgtg ggtctgttgt agggttgcca gatgcaatac aagattcctg gttaaatttg 540 
aatttcagta aacaatgaat agtttttcat tgtaccatga aatatccaga acatacttat 600 
atgtaaagta ttatttattt gaatctacaa aaaacaacaa ataattttta aatataagga 660 
ttttcctaga tattgcacgg gagaatatac aaatagcaaa attgaggcca agggccaaga 720 
gaatatccga actttaattt caggaattga atgggtttgc tagaatgtga tatttgaagc 780 
atcacataaa aatgatggga caataaattt tgccataaag tcaaatttag ctggaaatcc 840 
tggatttttt tctgttaaat ctggcaaccc tagtctgcta gccaggatcc acaagtcctt 900 
g.ttccactgt gccttggttt ctcctttatt tctaagtgga aaaagtatta gccaccatct 960 
tacctcacag tgatgttgtg aggacatgtg gaagcacttt aagttttttc atcataacat 1020 
aaattatttt caagtgtaac ttattaacct atttattatt tatgtattta tttaagcatc 1080 
aaatatttgt gcaagaattt ggaaaaatag aagatgaatc attgattgaa tagttataaa 1140 
gatgttatag taaatttatt ttattttaga tattaaatga tgttttatta gataaatttc 1200 
aatcagggtt tttagattaa acaaacaaac aattgggtac ccagttaaat tttcatttca 1260 
gataaacaac aaataatttt ttagtataag tacattattg tttatctgaa attttaattg 1320 
aactaacaat cctagtttga tactcccagt cttgtcattg ccagctgtgt tggtagtgct 1380 
gtgttgaatt acggaataat gagttagaac tattaaaaca gccaaaactc cacagtcaat 1440 
attagtaatt tcttgctggt tgaaacttgt ttattatgta caaatagatt cttataatat 1500 
tatttaaatg actgcatttt taaatacaag gctttatatt tttaacttta agatgttttt 1560 
atgtgctctc caaatttttt ttactgtttc tgattgtatg gaaatataaa agtaaatatg 1620 
aaacatttaa aatataattt gttgtcaaag taaaaaaaaa aaaaaa 1666 

<210> 513 
<211> 106 
<212> PRT 

<213> Homo sapiens 
<400> 513 



Met 


Ala 


His 


Ala 


Thr 


Leu 


Ser 


Ala 


Ala 


Pro 


Ser 


Asn 


Pro 


Arg 


Leu 


Leu 


1 








5 










10 








15 




Arg 


Val 


Ala 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Val 


Gly 


Ser 


Arg Arg 


Ala 


Ala 








20 










25 










30 






Gly 


Ala 


Ser 


Val 


Val 


Thr 


Glu 


Leu 


Arg 


Cys 


Gin 


Cys 


Leu 


Gin 


Thr 


Leu 






35 










40 








45 








Gin 


Gly 


lie 


His 


Leu 


Lys 


Asn 


He 


Gin 


Ser 


Val 


Asn 


Val 


Arg 


Ser 


Pro 




50 










55 










60 








Gly 


Pro 


His 


Cys 


Ala 


Gin 


Thr 


Glu 


Val 


He 


Ala 


Thr 


Leu 


Lys 


Asn 


Gly 


65 










70 










75 








80 


Lys 


Lys 


Ala 


Cys 


Leu 


Asn 


Pro 


Ala 


Ser 


Pro 


Met 


Val 


Gin 


Lys 


He 


He 


Glu 








85 










90 








95 




Lys 


He 


Leu 
100 


Asn 


Lys 


Gly 


Ser 


Thr 
105 


Asn 















<210> 514 
<211> 1064 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (27) ... (27) 
<223> N=A, T, G, or C 

<400> 514 

cacagccggg tcgcaggcac ctccccngcc 

cgcgtctgct gagccccatg gcccacgcca 

tcctgcgggt ggcgctgctg ctcctgctcc 



agctctcccg cattctgcac agcttcccga 60 
cgctctccgc cgcccccagc aatccccggc 120 
tggtgggcag ccggcgcgca gcaggagcgt 180 
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ccgtggtcac 
acatccaaag 
ccacactcaa 
tcatcgaaaa 
tcagcgtatc 
aaaagagaac 
cttacgaggg 
catgctgtta 
aattggaata 
tgtgtcagcc 
caagcaatta 
cttttttttt 
tcaacatttt 
gccttgtt ta 
tactgttaca 



tgaactgcgc 
tgtgaatgta 
gaatgggaag 
gatactgaac 
attgacactt 
agcagctttc 
ttctacttat 
tttaaagatg 
tgatgggttt 
accttgataa 
gtggatcact 
taaaaaagaa 
tatgctgaag 
atgtccattc 
tttgtacaaa 



tgccagtgct 
aggtcccccg 
aaagcttgtc 
aaggggagca 
cctgcagggt 
tagggacagc 
ttatgtattt 
tgagtgtgtt 
taaatgtgtc 
atgacagggt 
gttagggtaa 
tgtcagttgt 
tttcccttag 
tgcagcgttt 
tgacatgata 



tgcagacact 
gaccccactg 
tcaaccccgc 
ccaactgaca 
ggtccctgcc 
tggaaaggga 
atttttgaaa 
tcatcaaaca 
attaaactaa 
ggggaactgg 
gggaatgtat 
tatttattca 
acattttatg 
ctctttccct 
ataaaagtt t 



gcagggaatt 
cgcccaaacc 
atcccccatg 
ggagagaagt 
cttaccagag 
cttaatgtgt 
gcttgtattt 
tagctcagtc 
tatttagtgg 
agggtngggg 
gtacacatct 
aattatctca 
tcttgcttgt 
tggaaaagag 
tatg 



cacctcaaga 
gaagtcatag 
gt tcagaaaa 
aagaagctta 
ctgaaaatga 
ttgactattt 
taatatttta 
ctgattattt 
gagaccataa 
gattgaaatg 
attttttata 
cattatgtgt 
agggcataat 
aa t t tatcat 



<210> 515 

<211> 99 

<212> PRT 

<213> Homo sapiens 

<400> 515 

Met Lys Val Ser Ala Ala Leu Leu Cys Leu Leu Leu lie Ala Ala Thr 

15 10 15 

Phe lie Pro Gin Gly Leu Ala Gin Pro Asp Ala He Asx» Ala Pro Val 

20 25 30 

Thr Cys Cys Tyr Asn Phe Thr Asn Arg Lys He Ser Val Gin Arg Leu 

35 40 45 

Ala Ser Tyr Arg Arg He Thr Ser Ser Lys Cys Pro Lys Glu Ala Val 

50 55 60 

He Phe Lys Thr lie Val Ala Lys Glu He Cys Ala Asp Pro Lys Gin 
65 70 75 80 

Lys Trp Val Gin Asp Ser Met Asp His Leu Asp Lys Gin Thr Gin Thr 
85 90 95 

Pro Lys Thr 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1064 



<210> 516 
<211> 757 
<212> DNA 
<213> Homo sapiens 



<400> 516 

ggaaccgaga 

tcgcctccag 

tcattcccca 

acttcaccaa 

gcaagtgtcc 

accccaagca 

cgaagacttg 

ctttccccag 

ttatgcctta 

tggtactagt 

gttctacccc 

ttaacattcc 

accaaataaa 

<210> 517 
<211> 415 
<212> PRT 



ggctgagact 
catgaaagtc 
agggctcgct 
taggaagatc 
caaagaagct 
gaagtgggtt 
aacactcact 
acaccctgtt 
agtaatgtta 
gttttttaga 
tgggatgttt 
aatgcattgc 
tatatttttg 



aacccagaaa 
tctgccgccc 
cagccagatg 
tcagtgcaga 
gtgatcttca 
caggattcca 
ccacaaccca 
ttattttatt 
attcttattt 
tacagagact 
tgagggtctt 
taaaatatta 
tacaaaaaaa 



catccaattc 
ttctgtgcct 
caatcaatgc 
ggctcgcgag 
agaccattgt 
tggaccacct 
agaatctgca 
ataatgaatt 
aagttattga 

tggggaaatt 

tgcaagaatc 
ttgtggaaat 
aaaaaaa 



,tcaaactgaa 
gctgctcata 
cccagtcacc 
ctatagaaga 
ggccaaggag 
ggacaagcaa 
gctaacttat 
ttgtttgttg 
tgttttaagt 
gcttttcctc 
attaatacaa 
gaatattttg 



gctcgcactc 
gcagccacct 
tgctgttata 
atcaccagca 
atctgtgctg 
acccaaactc 
tttcccctag 
atgtgaaaca 
ttatctttca 
ttgaaccaca 
agaatttttt 
taactattac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

757 
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<213> Homo sapiens 
<400> 517 

Met Glu Leu Arg Lys Tyr Gly Pro Gly Arg Leu Ala Gly Thr Val lie 

1 5 10 15 

Gly Gly Ala Ala Gin Ser Lys Ser Gin Thr Lys Ser Asp Ser He Thr 

20 25 30 

Lys Glu Phe Leu Pro Gly Leu Tyr Thr Ala Pro Ser Ser Pro Phe Pro 

35 40 45 

Pro Ser Gin Val Ser Asp His Gin Val Leu Asn Asp Ala Glu Val Ala 

50 55 60 

Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn Glu 
65 7 0 75 80 

Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gin Asp Phe Ser Leu 

85 90 95 

He Asn Phe Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe 

100 105 110 

Leu Leu Gly Leu Leu Gly Asn Gly Ala Val Ala Ala Val Leu Leu Ser 

H5 120 125 

Arg Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala 

130 135 140 

Val Ala Asp Thr Leu Leu Val Leu Thr Leu Pro Leu Trp Ala Val Asp 

nf 5 150 155 160 

Ala Ala Val Gin Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly 

165 170 , 175 

Ala Leu Phe Asn He Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys 

180 185 190 

He Ser Phe Asp Arg Tyr Leu Asn He Val His Ala Thr Gin Leu Tyr 

200 205 

Arg Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val Trp 

210 215 220 

Gly Leu Cys Leu Leu Phe Ala Leu Pro Asp Phe He Phe Leu Ser Ala 

225 230 235 240 

His His Asp Glu Arg Leu Asn Ala Thr His Cys Gin Tyr Asn Phe Pro 

245 250 255 

Gin Val Gly Arg Thr Ala Leu Arg Val Leu Gin Leu Val Ala Gly Phe 

260 265 270 

Leu Leu Pro Leu Leu Val Met Ala Tyr Cys Tyr Ala His He Leu Ala 

275 280 285 

Val Leu Leu Val Ser Arg Gly Gin Arg Arg Leu Arg Ala Met Arq Leu 

290 295 " 300 

Val Val Val Val Val Val Ala Phe Ala Leu Cys Trp Thr Pro Tyr His 

305 310 315 * 320 

Leu Val Val Leu Val Asp He Leu Met Asp Leu Gly Ala Leu Ala Arg 

325 330 335 

Asn Cys Gly Arg Glu Ser Arg Val Asp Ala Lys Ser Val Thr Ser Gly 

340 345 350 

Leu Gly Tyr Met His Cys Cys Leu Asn Pro Leu Leu Tyr Ala Phe Val 

355 360 365 

Gly Val Lys Phe Arg Glu Arg Met Trp Met Leu Leu Leu Arq Leu Glv 

370 375 380 

Cys Pro Asn Gin Arg Gly Leu Gin Arg Gin Pro Ser Ser Ser Arq Arq 

385 390 395 400 

Asp Ser Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu 

405 410 ~ 415 



<210> 518 

<211> 1703 

<212> DNA 

<213> Homo sapiens 
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<400> 518 

atggagttga 

cagagtaaat 

acagcccct t 

gccgaggttg 

agtgactcgt 

gccttcctgc 

gtggcagccg 

cacctagctg 

gccgtccagt 

aacttctacg 

gttcatgcca 

gctgtctggg 

cacgacgagc 

gctctgcggg 

tgctatgccc 

atgcggctgg 

gtggtgctgg 

agcagggtag 

aacccgctgc 

ttgcgcctgg 

gattcatcct 

ggctcccctt 

ggctctggct 

ggtctcccgg 

ccaagcccca 

aaactaaaac 

ccccatgaag 

acctctatat 

caagatcgtc 



ggaagtacgg 
cacagactaa 
cctccccgtt 
ccgccctcct 
gctgtacctc 
cagccctcta 
tgctgctgag 
tagcagacac 
gggtctttgg 
caggagccct 
cccagctcta 
ggctctgcct 
gcctcaacgc 
tgctgcagct 
acatcctggc 
tggtggtggt 
tggacatcct 
acgtggccaa 
tctatgcctt 
gctgccccaa 
ggtctgagac 
tcgcccacag 
ctccccaata 
gaagccaccc 
tcctgccgcc 
ttcatcttcc 
ccacagccca 
ttgctctttt 
aggaccaaaa 



ccctggaaga 
atcagactca 
cccgccctca 
ggagaacttc 
cccgccctgc 
cagcctcctc 
ccggcggaca 
gctgctggtg 
ctctggcctc 
cctgctggcc 
ccgccggggg 
gctt ttcgcc 
cacccactgc 
ggtggctggc 
cgtgctgctg 
cgtggtggcc 
catggacctg 
gtcggtcacc 
tgtaggggtc 
ccagagaggg 
ctcagaggcc 
tctgacttcc 
tcctcgctcc 
tcccagctct 
cgaggtggct 
ccaagtgcgg 
ggcctccagc 
atttttatgt 
aaa 



ctggcgggga 
atcacaaaag 
caggtgagtg 
agctcttcct 
ccacaggact 
tttctgctgg 
gccctgagca 
ctgacactgc 
tgcaaagtgg 
tgcatcagct 
cccccggccc 
ctcccagact 
caatacaact 
tttctgctgc 
gtttccaggg 
tttgccctct 
ggcgctttgg 
tcaggcctgg 
aagttccggg 
ctccagaggc 
tcctactcgg 
ccgcattcca 
cgggactcac 
gaggactgca 
gcctggagcc 
ggagtacaag 
tcagcagtga 
ctaaaatcct 



cagttatagg 
agttcctgcc 
accaccaagt 
atgactatgg 
tcagcctgaa 
ggctgctggg 
gcaccgacac 
cgctctgggc 
caggtgccct 
ttgaccgcta 
gcgtgaccct 
tcatcttcct 
tcccacaggt 
ccctgctggt 
gccagcggcg 
gctggacccc 
cccgcaactg 
gctacatgca 
agcggatgtg 
agccatcgtc 
gcttgtgagg 
ggctcctccc 
tggcagcccc 
ccattgctgc 
ccactgccct 
gcatggcgta 
ctgtggccat 
gcttaaaact 



aggagctgct 
aggcctttac 
gctaaa tgac 
agaaaacgag 
cttcgaccgg 
caacggcgcg 
cttcctgctc 
agtggacgct 
cttcaacatc 
cctgaacata 
cacctgcctg 
gtcggcccac 
gggccgcacg 
catggcctac 
cctgcgggcc 
ctatcacctg 
tggccgagaa 
ctgctgcctc 
gatgctgctc 
ttcccgccgg 
ccggaatccg 
tccctctgcc 
agcaccacca 
tccttagctg 
tctcatttgg 
gagggtgctg 
ggtccccaag 
tttcaataaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1703 



<210> 519 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Aligent oligonucleotide 
<400> 519 

gtactttgtg tttaatatat ctgggtgatg gatcacaaca catcaataaa ctgacttacc 60 



<210> 520 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<400> 520 

atttacttgc atatgtaaac cattgctgtg ccattcaatg tttgatgcat aattggacct 60 



<210> 521 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic sequence 
<400> 521 

agatggagga agcatctgag tttgagacca tggctgttac agggatcatg taaacttgct 60 

<210> 522 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<400> 522 

tggagattta aacactgagg tttctgttca aactgtgagt tctgttcttt gtgagaaatt 60 

<210> 523 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<400> 523 

tgtacctgaa aaatgggaag agtcgggacc acagtttatt accaattctg aggaagtccg 60 

<210> 524 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<400> 524 

atccatttac tcagctggag aggagacatc aagaacatgc cagacacatt tcttttggtg 60 

<210> 525 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<400> 525 

ctggatgttg aattgccacc tgtttgctgt gacatagata tttaaatttc ttagtgcttc 60 

<210> 526 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> synthetic sequence 
<400> 526 

ctggaaacct ctggaggtca tctcggctgt 

<210> 527 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<40O> 527 

tgttgcagtc tcctttaagc attctgttga 



<210> 528 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<400> 528 

actatttctt cagagtttgt catatactgc 



tcctgagaaa taaaaagcct gtcatttcaa 60 



ccctggatgt tgaattgcca cctgtttgct 60 



ttgtcatctg catgtctact cagcatttga 60 



<210> 529 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic sequence 
<400> 529 

agactgttaa aacaaaaagt tctgttcctg cctcagatga cgcctatcca gaaatagaaa 60 



<210> 530 
<211> 1863 
<212> DNA 

<213> Homo sapiens 



<400> 530 

gtgcagcctg 

ccgtgttggt 

cagcagcggc 

ctcaccaaga 

caagataggt 

ttacgaaagg 

agactcaagc 

tcagtggaat 

ttggtggagt 

ctcaagtaca 

tttgaccaat 

gagatgctgc 

aatgaacttt 



atgcgcaacg 
ggtctgaatt 
gctatcatga 
agtccacgct 
atatggaggt 
aggagctcaa 
aaataaagga 
ttgaggaact 
tcacagatga 
ttaacctgaa 
tatttgacat 
ttgagtacct 
ttgggaagat 



tggggactca 
gagaagccgc 
ggagaaggaa 
ccgggaccag 
cagtgggaac 
tgccatttca 
attccaccgg 
cctgaaggct 
ggagggatat 
ggcatctgag 
tcctaaagaa 
tcaggattac 
tcaggctgag 



ggcgcgctgg 
gactaaggga 
cggctcatgg 
atcaattctg 
ctgagggatt 
ggacccaatg 
aagcacccaa 
cgagagaatc 
ggtcgttatc 
aagctggatt 
aggaagaatg 
acagatagag 
tttgagaaga 



gcggcaggag 
agatggagac 
acgtcatggc 
atcaccgcac 
tgtatgatga 
agtttgctga 
atgagatctg 
caagtgaaga 
tcgatctcca 
atatcacata 
cagagtataa 
tgaagcctct 
aatgggagaa 



ttgcttccgg 
aatactggag 
taaagagatg 
tcgggccatg 
taaggatgga 
attctataat 
tgtgccaatg 
ggcacaaaac 
tgactgttac 
cctgtccatc 
gagataccta 
ccaagatcag 
tgggaccttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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cctggatggc 

tctgcattct 

ctcttagctt 

agtaccaaag 

tcaaagggca 

atctatgaat 

cgcaagcaag 

agtgagagtg 

tgggatggca 

aactgtgaga 

gctgaatggc 

gctaatgtga 

gcttcagaac 

gtgaataaga 

tagctcagct 

ccaaatcctc 

aacaaggcat 

agattctgcc 

aaa 



cgaaagagac 

cctcctggga 

taggcttgaa 

gaaagtccct 

ccaagcgaga 

atgtagagat 

ccaggacagg 

aagatgaaga 

aacctattcc 

tttgtggaaa 

gtcatgctca 

cacagattga 

gatggcagcc 

agacatacga 

tttgggctag 

ttaaagaccc 

gctggcagat 

agaaaataaa 



aagcagtgcc 

ggagttggct 

atgtggcggg 

ggagtcactt 

cactgaaagg 

tctcggggaa 

agaagagcga 

gaacgagatc 

ctactggctg 

ctacacctac 

tggcatgagg 

agatgctgtc 

tgacactgag 

ggatctgaaa 

cccaggcttc 

tttgctatgt 

tgcagggttg 

accagacctt 



ctgacccatg 

tctctgggtt 

accctagaag 

gatacctctt 

aacaaagaca 

cagcgacatc 

gaagaagagg 

atttacaacc 

tataagcttc 

cgagggccca 

tgtttgggca 

tccttgtggg 

gaagaatatg 

agacaaggac 

cctaagatct 

agtctcatgg 

agatgtgttt 

gttctaaaaa 



ctggagccca 
tggacagatt 
agcgagccca 
tgtttgccaa 
ttgcttttct 
tcactcatga 
aagaagagca 
ccaaaaacct 
atggcctaaa 
aagccttcca 
tcccaaacac 
ccaaactgaa 
aagactcaag 
tgctctagtg 
gctttttcta 
tctagcatgc 
tatctgtttt 
aaaaaaaaaa 



tcttgacctc 
gaaatctgct 
gagactattc 
aaatcccaag 
agaagcccag 
aaatgtacag 
gatcagtgag 
gccacttggc 
tatcaactac 
gcgacacttt 
tgctcacttt 
attgcagaag 
tgggaatgtt 
ttgagggatg 
tttctcccaa 
atcttgtaga 
atattttaaa 
aaaaaaaaaa 



<210> 531 
<211> 501 
<212> PRT 

<213> Homo sapiens 



<400> 531 
Met Glu Thr lie 
1 

Arg Leu Met Asp 
20 

Leu Arg Asp Gin 
35 

Arg Tyr Met Glu 
50 

Asp Gly Leu Arg 
65 

Phe Ala Glu Phe 

Lys His Pro Asn 
100 

Leu Leu Lys Ala 
115 

Glu Phe Thr Asp 
130 

Cys Tyr Leu Lys 
145 

lie Thr Tyr Leu 

Arg Lys Asn Ala 
180 

Leu Gin Asp Tyr 
195 

Leu Phe Gly Lys 
210 

Thr Phe Pro Gly 
225 

Gly Ala His Leu 

Ser Leu Gly Leu 
260 

Lys Cys Gly Gly 



Leu 
5 

Val 

He 

Val 

Lys 

Tyr 
85 
Glu 

Arg 

Glu 

Tyr 

Ser 
165 
Glu 

Thr 

He 

Trp 

Asp 
245 
Asp 

Thr 



Glu 

Met 

Asn 

Ser 

Glu 
70 
Asn 

He 

Glu 

Glu 

He 
150 
He 

Tyr 

Asp 

Gin 

Pro 
230 
Leu 

Arg 

Leu 



Gin Gin 

Ala Lys 

Ser Asp 
40 

Gly Asn 
55 

Glu Leu 

Arg Leu 

Cys Val 

Asn Pro 
120 
Gly Tyr 
135 

Asn Leu 

Phe Asp 

Lys Arg 

Arg Val 
200 
Ala Glu 
215 

Lys Glu 
Ser Ala 
Leu Lys 
Glu Glu 



Arg 

Glu 
25 
His 

Leu 

Asn 

Lys 

Pro 
105 
Ser 

Gly 

Lys 

Gin 

Tyr 
185 
Lys 

Phe 

Thr 

Phe 

Ser 
265 
Arg 



Arg Tyr 
10 

Met Leu 

Arg Thr 

Arg Asp 

Ala He 

75 
Gin He 
90 

Met Ser 

Glu Glu 

Arg Tyr 

Ala Ser 
155 
Leu Phe 
170 

Leu Glu 

Pro Leu 

Glu Lys 

Ser Ser 
235 
Ser Ser 
250 

Ala Leu 
Ala Gin 



His Glu 

Thr Lys 

Arg Ala 
45 

Leu Tyr 
60 

Ser Gly 

Lys Glu 

Val Glu 

Ala Gin 
125 
Leu Asp 
14 0 

Glu Lys 

Asp He 

Met Leu 

Gin Asp 
205 
Lys Trp 
220 

Ala Leu 
Trp Glu 
Leu Ala 
Arg Leu 



Glu Lys Glu 
15 

Lys Ser Thr 
30 

Met Gin Asp 

Asp Asp Lys 

Pro Asn Glu 
80 

Phe His Arg 
95 

Phe Glu Glu 
110 

Asn Leu Val 

Leu His Asp 

Leu Asp Tyr 
160 

Pro Lys Glu 
175 

Leu Glu Tyr 
190 

Gin Asn Glu 

Glu Asn Gly 

Thr His Ala 
240 

Glu Leu Ala 
255 

Leu Gly Leu 
270 

Phe Ser Thr 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1863 
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275 280 



Lys 


Gly 


Lys 


ber 


Leu 


Glu 


Ser 


Leu 














295 




Pro 


Lys 


Ser 


Lys 


p i - * 
Gly 


Thr 


Lys 


Arg 


O A C 










310 






Ala 


rne 


Leu 


P t n 
G1U 


Aia 


Gin 


lie 


Tyr 


















Gin 


Arg 


His 


Leu 


1 nr 


His 


Glu 


Asn 








34 0 










p t . • 

bly 




Glu 


Arg 


GlU 


GlU 


p l i - 

Glu 


pi., 
Glu 






355 










360 


Ser 


P 1 . o 

Glu 


Asp 


GlU 


GlU 


Asn 


Glu 


lie 




370 










375 




Leu 


Gly 


Trp 


Asp 


Gly 


Lys 


Pro 


lie 


385 










390 






Gly 


Leu 


Asn 


lie 


Asn 


Tyr 


Asn 


Cys 










405 








Arg 


Gly 


Pro 


Lys 


Ala 


Phe 


Gin 


Arg 








420 










HIS 


biy 


rie L 


Arg 


Cys 


Leu 


pi 
Giy 


T 1 Q 

lie 






435 










440 


Val 


Thr 


Gin 


lie 


Glu 


Asp 


Ala 


Val 




450 










455 




Gin 


Lys 


Ala 


Ser 


Glu 


Arg 


Trp 


Gin 


465 










470 






Asp 


Ser 


Ser 


Gly 


Asn 


Val 


Val 


Asn 










485 








Arg 


Gin 


Gly 


Leu 


Leu 









500 



285 



Asp 


1 nr 


ber 


Leu 


Phe 


Ala 


Lys 


Asn 








^AA 








Asp 


Thr 


GlU 


Arg 


Asn 


Lys 


Asp 


He 






jib 










320 


GlU 


Tyr 


vai 


fin 

GlU 


lie 


Leu 


Gly 


Glu 














335 




val 


pi n 

bin 


Arg 


Lys 


Gin 


Ala 


Arg 


Thr 


O A C 

34 d 










350 






p 1 . 1 

GlU 


pi ,, 

GlU 


pi „ 
Gin 


He 


Ser 


Glu 


Ser 


Glu 










365 








lie 


Tyr 


Asn 


Pro 


Lys 


Asn 


Leu 


Pro 








380 










Pro 


Tyr 


Trp 


Leu 


Tyr 


Lys 


Leu 


His 






395 










400 


p i * -. 
Glu 


lie 


Cys 


Gly Asn 


Tyr 


Thr 


Tyr 




410 










415 




His 


Phe 


Ala 


Glu 


Trp 


Arg 


His 


Ala 


425 










430 






Pro 


Asn 


Thr 


Ala 


His 


Phe 


Ala 


Asn 










445 








Ser 


Leu 


Trp 


Ala 


Lys 


Leu 


Lys 


Leu 








4 60 










Pro 


Asp 


Thr 


Glu 


Glu 


Glu 


Tyr 


Glu 






475 










480 


Lys 


Lys 


Thr 


Tyr 


Glu 


Asp 


Leu 


Lys 




490 










4 95 





<210> 532 
<211> 3326 
<212> DNA 

<213> Homo sapiens 
<400> 532 

ctgagcctcc cgtgccggcc ggccggccgg 
gggggcgggg ggagttccgg ttccggttct 
atggcggccc gggcgggttt ccagtctgtg 
ggggcgggcg cggctgctgc cttgggcccg 
ggcccggctc cgggtcaagg gctgtaccgc 
ccaggtatgt tgccaggcag ccgaatgaca 
tatgggggga acccttcagt ccgacctggc 
aagagacctg cccctcagca gatccagcag 
cacaatgcaa agaaaaagaa gatggctgac 
gtaccagaat cccaggccta tatggatctc 
atcatgagga aacggctaga tatccaagag 
aagctgcgaa ttttcatttc taacactttc 
gaagggacgg tggcttcctg g-gagcttcgg 
ttgtccaaat atgatgccac taaacaaaag 
gtgattgaac tggacaaaga cctgtatggg 
accgccacta cccaggagac cgatggcttt 
cggtgtactg tcctactgat gctggattac 
ctagctcgac tcctgggcat ccatacccag 
caatatatta agacacataa gctccaggac 
aagtacctgc agcagatctt tgagtctcaa 
ctccatgcct tgcttatgcc accagaacct 
ccgaatgatc agaaaaagac agcttgttat 
aagacccaga tgaattcttt tctgctgtcc 
gacaacaaga tccatgagac aatagaaacc 



gggaacaggc gggcgctcgg ggggcgctcg 60 
ttgtgcggct gcatcggcgg ctccgggaag 120 
gctccaagcg gcggcgccgg agcctcagga 180 
ggcggaactc cggggcctcc tgtgcgaatg 240 
tccccgatgc ccggagcggc ctatccgaga 300 
cctcagggac cttccatggg accccctggc 360 
ctggcccagt cagggatgga tcagtcccgc 420 
gtccagcagc aggcggtcca aaatcgaaac 480 
aaaattctac ctcaaaggat tcgtgaactg 540 
ttggcttttg aaaggaaact ggaccagact 600 
gccttgaaac gtcccatcaa gcaaaaacgg 660 
aatccggcta agtcagatgc cgaggatggg 720 
gtagaaggac ggctcctgga ggattcagcc 780 
aggaagttct cttccttttt taagtccttg 840 
ccagacaacc atctggtaga atggcacagg 900 
caggtgaagc ggccgggaga cgtgaatgta 960 
cagcctcccc agtttaaatt agacccccgc 1020 
actcgtccag tgatcatcca agcactgtgg 1080 
cctcacgagc gggagtttgt catctgtgac 1140 
cgtatgaagt tttcagagat ccctcagcgg 1200 
atcatcatta atcatgtcat cagtgttgac 1260 
gacattgatg ttgaagtgga tgacaccttg 1320 
actgccagcc aacaggagat tgctactcta 1380 
atcaaccagc tgaagactca gcgggagttc 1440 
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atgctgagct 

agggacctca 

ttctacttcc 

cagagacgac 

agccctgatt 

ctgccttggt 

atgaagaggg 

ccagcttccc 

gacctctaga 

ggcctcaggc 

aggagttggg 

gttgatcttg 

aacaaactct 

cactgagccc 

ctccttagtt 

actccctgtg 

gggggaatct 

ccagcagagg 

ccctgcctct 

gggccatccc 

ggcactgagg 

ttccctctcc 

ccctcaggta 

tctttccagg 

ccaggctcgc 

cgttcagcat 

tcaggctcct 

cagaattttc 

ggacaacctt 

tcagtacttc 

atggaccaaa 

atcttctgaa 



ttgccagaga 

agacaatgac 

agccctgggc 

aagaattaga 

cgactgcacc 

cttgcttggg 

tctcacaaga 

tttgccccac 

tagtgttaga 

caggtggtct 

agctt tcgct 

gctctcctga 

cccttcactt 

caggtcagtc 

ggggtccaca 

gctgcccttt 

tgcct ttccc 

ctgttccttc 

gcatccctta 

tgtcttcctg 

agtgaaagcg 

ctgcctcaca 

gcagcaggac 

agagcctcac 

agactcaaca 

gggctctaaa 

gcgagcacac 

tccccgtctt 

ggggcagctg 

ccacagtgtc 

cttttttttg 

aaaaaaaaaa 



ccctcagggt 

tgatgtggtg 

tcaggaggct 

gcaagccctg 

aattcttgat 

gcgttccagg 

cacctgttat 

aaagttccca 

gagagaacat 

tcaaggggac 

ccttctccaa 

taacagaatc 

gttgccctgt 

cccagccctc 

tcagtattgg 

gtgat tccct 

tttcagagcc 

ctgctgtttg 

atggagaaac 

tcccttgtct 

cctagggctg 

ggattgtgac 

cttgtgatct 

attcttcttc 

cagcaagggt 

cccacagaac 

ccccaccccc 

ccccttcctg 

agccctggcc 

cctgttgata 

tactgtcccc 

aaaaaa 



ttcatcaatg 

ggtaacccag 

gtgtgccgat 

ggaatccgga 

ttgggccctg 

ggatgctgtt 

cctcttcttt 

tgtgcctgta 

gtagtggtaa 

cagctaactg 

gactcaggcc 

caatttcctt 

agcactacag 

tgggttggcc 

agttttgttc 

cagatctgcc 

ccagggatct 

gagatgtgac 

gggcctaaaa 

gcccagttga 

gagaatagcg 

tccccagccc 

tggccccttg 

caggttgtat 

gggagacagc 

tgacaaagcc 

aaatccctcc 

ccattttccc 

gcctcctggc 

actgttttta 

ttattgatgt 



actggcttca 

aggaggagcg 

acttctactc 

atacataggg 

tgctgcctgc 

ggttcaagga 

caccctatct 

ccctcccctg 

tgagtgcttg 

atcctgccct 

tgtgggcact 

ccttccctcc 

aaaccctggt 

tgctgtcagt 

tttattgctc 

ctaatcccgg 

catctgggga 

tcattcattc 

ccaaacgggt 

cacctactgg 

ctgagttggg 

ctgccctcaa 

gatctgagat 

cacccccgag 

tgggcacaaa 

cctgcttccc 

ctgttctaca 

tcccttgaaa 

tggaaccatg 

ttaactgaat 

tacccagttt 



gtcccagtgc 1500 
ccgagctgag 1560 
caaggtgcag 1620 
cctctcccac 1680 
ctcatagtat 1740 
caacaccaga 1800 
cttcccaccc 1860 
gtctacatag 1920 
gaatggattg 1980 
tcagagaccc 2040 
ctataagcta 2100 
acaggtttgg 2160 
tcttgggctc 2220 
gcttctctca 2280 
cctcccagac 2340 
gcatttgggt 2400 
actgtcattg 2460 
actcactcca 2520 
aaaaagccct 2580 
tgacttctag 2640 
tttgtgactc 2700 
agcttcagac 2760 
ggtttttgca 2820 
ttagcatatc 2880 
gggggaattc 2940 
caccccctcc 3000 
ctggggacag 3060 
ggttgacact 3120 
agaaggaagc 3180 
tgtttttttc 3240 
taataaaaga 3300 
3326 



<210> 533 
<211> 515 
<212> PRT 

<213> Homo sapiens 
<400> 533 

Met Ala Ala Arg Ala Gly Phe Gin Ser Val Ala 

1 5 10 

Gly Ala Ser Gly Gly Ala Gly Ala Ala Ala Ala 

20 25 
Thr Pro Gly Pro Pro Val Arg Met Gly Pro Ala 

35 40 
Tyr Arg Ser Pro Met Pro Gly Ala Ala Tyr Pro 

50 55 
Pro Gly Ser Arg Met Thr Pro Gin Gly Pro Ser 
65 70 75 

Tyr Gly Gly Asn Pro Ser Val Arg Pro Gly Leu 

85 90 
Asp Gin Ser Arg Lys Arg Pro Ala Pro Gin Gin 

100 105 
Gin Gin Ala Val Gin Asn Arg Asn His Asn Ala 

115 120 
Ala Asp Lys lie Leu Pro Gin Arg He Arg Glu 

130 135 
Gin Ala Tyr Met Asp Leu Leu Ala Phe Glu Arg 
14 5 150 155 

He Met Arg Lys Arg Leu Asp lie Gin Glu Ala 
165 170 



Pro Ser Gly 
Leu 
Pro 



Arg 

60 

Met 

Ala 

He 

Lys 

Leu 
140 
Lys 

Leu 



Gly Pro 
30 

Gly Gin 
45 

Pro Gly 



Gly Pro 

Gin Ser 

Gin Gin 
110 
Lys Lys 
125 

Val Pro 
Leu Asp 
Lys Arg 



Gly Ala 
15 

Gly Gly 

Gly Leu 

Met Leu 

Pro Gly 

80 
Gly Met 
95 

Val Gin 

Lys Met 

Glu Ser 

Gin Thr 
160 
Pro lie 
175 
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Lys 


Gin 


Lys 


Arg 
180 


Lys 


Leu 


Arg 


lie 


rne 
185 


Tip 

X X C 


Ser 


7\ c; ri 


Th r 

1 XIX 


190 


Asn 


ir r o 


Ala 


Lys 


Ser 
195 


Asp 


ai a 


Gl u 


Asp 


C 1 xr 

Giy 
200 


G1U 


vjiy 


1 1 1 X 


v a x 


205 




Trp 


bill 


Leu 


Arg 
210 


vai 




gi y 


Arg 


Le u 
215 


Leu 


Gl u 


Asp 


OCX 


nia 

220 


Leu 


oer 


Lys 


Tyr 


Asp 


Ala 


Thr 


Lys 


Gin 


Lys 


Arg 


Lys 


Fne 




C o r- 


Pho 

irne 


Pho 

rile 


Lys 


ber 


Leu 


225 










230 










235 










240 


vai 


lie 


bill 


Leu 


Asp 


Lys 


Asp 


Leu 


Tyr 


biy 


Prn 
IT X O 


Asp 


Asn 


MIS 


Leu 

ZOO 


vai 


GlU 


Trp 


HIS 


Arg 
260 


I nr 


Ala 


l nr 


i nr 


r— -] _ 

gi n 
265 


blU 


i nr 


Asp 


Vji y 


Fne 
270 


bin 


v a i 


Lys 


Arg 


Pro 
275 


Giy 


Asp 


vai 


As n 


vai 
280 


Arg 


Cys 


I 11 X 


vai 


Leu 
285 


Leu 


n6 L 


Leu 


Asp 


Tyr 
290 


Gin 


Pro 


Pro 


Gin 


Fne 
295 


Lys 


Leu 


Asp 


Pro 


Arg 
300 


Leu 


Aia 


Arg 


Leu 


Leu 


Gly 


lie 


T_J ' „ 

HIS 


rrt l_ — 

i nr 


Gin 


l nr 


Arg 


Pro 


vai 


ne 


T 1 Q 

lie 


bin 


ai a 


Leu 


Trp 


305 










310 










315 










320 


bin 


Tyr 


T I - 

lie 


Lys 


rp K -w- 

I nr 


His 


Lys 


Leu 


Gin 


Asp 
^ ^n 


Pro 


HIS 


blU 


Arg 


blU 
7 ^ ^ 


Fne 


vai 


lie 


Cys 


Asp 
340 


Lys 


Tyr 


Leu 


Gin 


bin 
345 


lie 


rne 


LjIU 


oer 


Gin 
350 


Arg 


Met 


Lys 


Fne 


Ser 
355 


GlU 


lie 


Pro 


Gin 


Arg 
360 


Leu 


HIS 


Ala 


Leu 


Leu 
365 


Met 


Pro 


Pro 


GlU 


Pro 
370 


lie 


lie 


lie 


Asn 


HIS 

375 


Vai 


T 1 . 

He 


ber 


vai 


Asp 
380 


Pro 


Asn 


Asp 


bin 


Lys 


Lys 


Thr 


Ala 


Cys 


Tyr 


Asp 


lie 


ASp 


vai 


Glu 


val 


A r^»-s 

ASp 


Asp 


l nr 


lieu 


385 










390 










395 










400 


Lys 


Thr 


Gin 


Met 


Asn 


Ser 


Phe 


Leu 


Leu 


Ser 
/lift 

4 1U 


Thr 


Ala 


ber 


Gin 


bin 

/lie. 

4 lo 


GlU 


lie 


Ala 


Thr 


Leu 
420 


Asp 


Asn 


Lys 


He 


Hi s 
425 


GlU 


Thr 


l le 


GlU 


Thr 

430 


lie 


Asn 


Gin 


Leu 


Lys 
435 


Thr 


Gl n 


Arg 


Glu 


Pne 
440 


Met 


Leu 


Ser 


Pne 


Ala 
445 


Arg 


Asp 


Pro 


Gin 


Gly 
450 


Phe 


lie 


Asn 


Asp 


Trp 
455 


Leu 


Gin 


Ser 


Gin 


Cys 
460 


Arg 


Asp 


Leu 


Lys 


Thr 


Met 


Thr 


Asp 


Vai 


Vai 


Gly 


Asn 


Pro 


Glu 


Glu 


Glu 


Arg 


Arg 


Ala 


Glu 


4 65 










470 










475 










480 


Phe 


Tyr 


Phe 


Gin 


Pro 
485 


Trp 


Ala 


Gin 


Glu 


Ala 
490 


Vai 


Cys 


Arg 


Tyr 


Phe 
495 


Tyr 


Ser 


Lys 


Vai 


Gin 
500 


Gin 


Arg 


Arg 


Gin 


Glu 
505 


Leu 


Glu 


Gin 


Ala 


Leu 
510 


Gly 


He 



Arg Asn Thr 
515 



<210> 534 
<211> 1662 
<212> DNA 

<213> Homo sapiens 
<400> 534 

gttaaagccg gtcggaactg ctccggaggg cacgggctcc gtaggcacca actgcaagga 60 

cccctccccc tgcgggcgct cccatggcac agttcgcgtt cgagagtgac ctgcactcgc 120 

tgcttcagct ggatgcaccc atccccaatg caccccctgc gcgctggcag cgcaaagcca 180 

aggaagccgc aggcccggfcc ccctcaccca tgcgggccgc caaccgatcc cacagcgccg 240 

gcaggactcc gggccgaact cctggcaaat ccagttccaa ggttcagacc actcctagca 300 

aacctggcgg tgaccgctat atcccccatc gcagtgctgc ccagatggag gtggccagct 360 

tcctcctgag caaggagaac cagcctgaaa acagccagac gcccaccaag aaggaacatc 420 

agaaagcctg ggctttgaac ctgaacggtt ttgatgtaga ggaagccaag atccttcggc 480 

tcagtggaaa accacaaaat gcgccagagg gttatcagaa cagactgaaa gtactctaca 540 
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gccaaaaggc 
accgtatcct 
gttctgggaa 
ctggtgacat 
cctggatcaa 
gggatgtgca 
ccctaagctg 
atgatgttcg 

gtgggctgcg 

tcaatgtgtg 
agcatcaagg 
caggaggggg 
tgagtgccgt 
agctcatctc 
tggccaaggt 
cagatggggc 
ttgagttgga 
tcatccacca 
aataaagtca 



cactcctggc 

ggatgcgcct 

tgtactggcc 

cctgcagctt 

agagggcaac 

gcagcagaaa 

gaacagctat 

ggtagcagaa 

ctgggcccca 

gcctagtgct 

ggctgtcaag 

caccagtgat 

ggatgcccat 

aggccatggc 

ggctgaactc 

cacagtggca 

ccctgcgcgg 

aggcatccgc 

tgtctccctt 



tccagccgga 

gaaatccgaa 

gtggcactgg 

ttgcaaatgg 

tacttggctg 

cggcttcgaa 

atcctgtcca 

caccatgtgg 

gatggacgac 

cctggagagg 

gccgtagcat 

cgacacattc 

tcccaggtgt 

tttgcacaga 

aaaggtcaca 

tccgcagcag 

cggcgggagc 

tgaagaccaa 

caaaaaaaaa 



agacctgccg 

atgactatta 

acaacagtgt 

agcagcctgg 

tgggcaccag 

atatgaccag 

gtggttcacg 

ccacactgag 

atttggccag 

gtggctgggt 

ggtgtccctg 

gcatctggaa 

gctccatcct 

accagctagt 

catcccgggt 

cagatgagac 

gggagaaggc 

cccatcacct 

aaaaaaaaaa 



ttacattcct 

cctgaacctt 

gtacctgtgg 

ggaatatata 

cagtgctgag 

tcactctgcc 

ttctggccac 

tggccacagc 

tggtggtaat 

tcctctgcag 

gcagtccaat 

tgtgtgctct 

ctggtctccc 

tatttggaag 

cctgagtctg 

cctgaggcta 

cagtgcagcc 

cagttgtttt 

aa 



tccctgccag 

gtggattgga 

agtgcaagct 

tcctctgtgg 

gtgcagctat 

cgagtgggct 

atccaccacc 

caggaagtgt 

gataacttgg 

acattcaccc 

gtcctggcaa 

ggggcctgtc 

cattacaagg 

tacccaacca 

accatgagcc 

tggcgctgtt 

aaaagcagcc 

ttatttttct 



<210> 535 
<211> 499 
<212> PRT 

<213> Homo sapiens 



<400> 535 
Met Ala Gin Phe 
1 

Asp Ala Pro He 
20 

Lys Glu Ala Ala 

35 

Ser His Ser Ala 
50 

Ser Lys Val Gin 
65 

Pro His Arg Ser 

Lys Glu Asn Gin 
100 

Gin Lys Ala Trp 
115 

Lys He Leu Arg 
130 

Gin Asn Arg Leu 
14 5 

Ser Arg Lys Thr 

Asp Ala Pro Glu 
180 

Ser Ser Gly Asn 
195 

Trp Ser Ala Ser 
210 

Pro Gly Glu Tyr 
225 

Leu Ala Val Gly 



Ala Phe 
5 

Pro Asn 

Gly Pro 

Gly Arg 

Thr Thr 

70 
Ala Ala 

85 

Pro Glu 

Ala Leu 

Leu Ser 

Lys Val 
150 
Cys Arg 
165 

He Arg 

Val Leu 

Ser Gly 

He Ser 
230 
Thr Ser 
245 

Leu Arg 



Gin Gin Lys Arg 
260 

Ser Leu Ser Trp Asn Ser 



Glu Ser 

Ala Pro 

Ala Pro 

40 
Thr Pro 
55 

Pro Ser 

Gin Met 

Asn Ser 

Asn Leu 
120 
Gly Lys 
135 

Leu Tyr 

Tyr He 

Asn Asp 

Ala Val 
200 
Asp lie 
215 

Ser Val 
Ser Ala 
Asn Met 
Tyr He 



Asp Leu 

10 
Pro Ala 
25 

Ser Pro 

Gly Arg 

Lys Pro 

Glu Val 
90 

Gin Thr 
105 

Asn-. Gly 

Pro Gin 

Ser Gin 

Pro Ser 
170 
Tyr Tyr 
185 

Ala Leu 



His Ser Leu Leu 
Arg Trp 
Met Arg 



Thr Pro 
60 

Gly Gly 
75 

Ala Ser 

Pro Thr 

Phe Asp 

Asn Ala 
140 
Lys Ala 
155 

Leu Pro 
Leu Asn 
Asp Asn 



Gin Arg 

30 
Ala Ala 
45 

Gly Lys 



Leu Gin 

Ala Trp 

Glu Val 
250 
Thr Ser 
265 

Leu Ser 



Leu Leu 

220 
He Lys 
235 

Gin Leu 
His Ser 
Ser Gly 



Asp Arg 

Phe Leu 

Lys Lys 
110 
Val Glu 
125 

Pro Glu 

Thr Pro 

Asp Arg 

Leu Val 
190 
Ser Val 
205 

Gin Met 

Glu Gly 

Trp Asp 

Ala Arg 
270 
Ser Arg 



Gin 
15 
Lys 

Asn 

Ser 

Tyr 

Leu 
95 
Glu 

Glu 

Gly 

Gly 

He 
175 
Asp 

Tyr 

Glu 

Asn 

Val 
255 
Val 



Leu 

Ala 

Arg 

Ser 

He 
80 
Ser 

His 

Ala 

Tyr 

Ser 
160 
Leu 

Trp 

Leu 

Gin 

Tyr 
240 
Gin 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1662 



Ser 



Gly 
Gly 
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275 

His lie His His 
290 

Leu Ser Gly His 
305 

Gly Arg His Leu 

Pro Ser Ala Pro 
340 

Gin His Gin Gly 
355 

Asn Val Leu Ala 
370 

Trp Asn Val Cys 
385 

Gin Val Cys Ser 

Gly His Gly Phe 
420 

Met Ala Lys Val 
435 

Leu Thr Met Ser 
450 

Glu Thr Leu Arg 
4 65 

Arg Glu Arg Glu 
Gly lie Arg 



280 

His Asp Val Arg 
2 95 

Ser Gin Glu Val 
310 

Ala Ser Gly Gly 
325 

Gly Glu Gly Gly 

Ala Val Lys Ala 
360 

Thr Gly Gly Gly 
375 

Ser Gly Ala Cys 
390 

lie Leu Trp Ser 
405 

Ala Gin Asn Gin 

Ala Glu Leu Lys 
440 

Pro Asp Gly Ala 
455 

Leu Trp Arg Cys 
4 70 

Lys Ala Ser Ala 
485 



Val Ala Glu His 
300 

Cys Gly Leu Arg 
315 

Asn Asp Asn Leu 
330 

Trp Val Pro Leu 
345 

Val Ala Trp Cys 

Thr Ser Asp Arg 
380 

Leu Ser Ala Val 
395 

Pro His Tyr Lys 
410 

Leu Val He Trp 
425 

Gly His Thr Ser 

Thr Val Ala Ser 
4 60 

Phe Glu Leu Asp 
475 

Ala Lys Ser Ser 
490 



285 

His Val Ala Thr 

Trp Ala Pro Asp 
320 

Val Asn Val Trp 
335 

Gin Thr Phe Thr 
350 

Pro Trp Gin Ser 
365 

His He Arg He 

Asp Ala His Ser 
400 

Glu Leu lie Ser 
415 

Lys Tyr Pro Thr 
430 

Arg Val Leu Ser 
445 

Ala Ala Ala Asp 

Pro Ala Arg Arg 
480 

Leu He His Gin 
495 
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